The dictionary consists of 72,375 words and 591,570 letters. The dictionary entries are words from the Romanian Scrabble Association's official list of words and the entries from a 15,517 words dictionary, developed according to the SpeechDat specifications. The phonetic transcriptions are in SAMPA format.
If you use the MaRePhoR dictionary in your work, please read the license first and cite the following paper:
Ştefan-Adrian Toma, Adriana Stan, Mihai-Lică Pura, Traian Bârsan, MaRePhoR – An Open Access Machine-Readable Phonetic Dictionary for Romanian, in Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue, Bucharest, Romania, July 6-9, 2017 pdf | bib
This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
THE CONTRIBUTORS TO THIS WORK DISCLAIM ALL WARRANTIES WITH REGARD TO THIS DATA, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL THE CONTRIBUTORS BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS DATA.