CMU Pronouncing Dictionary

Last updated
CMU Pronouncing Dictionary
Developer(s) Carnegie Mellon University
Stable release
0.7b / November 19, 2014;10 years ago (2014-11-19)
Available in English
License BSD
Website www.speech.cs.cmu.edu/cgi-bin/cmudict

The CMU Pronouncing Dictionary (also known as CMUdict) is an open-source pronouncing dictionary originally created by the Speech Group at Carnegie Mellon University (CMU) for use in speech recognition research.

Contents

CMUdict provides a mapping orthographic/phonetic for English words in their North American pronunciations. It is commonly used to generate representations for speech recognition (ASR), e.g. the CMU Sphinx system, and speech synthesis (TTS), e.g. the Festival system. CMUdict can be used as a training corpus for building statistical grapheme-to-phoneme (g2p) models [1] that will generate pronunciations for words not yet included in the dictionary.

The most recent release is 0.7b; it contains over 134,000 entries. An interactive lookup version is available. [2]

Database format

The database is distributed as a plain text file with one entry to a line in the format "WORD  <pronunciation>" with a two-space separator between the parts. If multiple pronunciations are available for a word, variants are identified using numbered versions (e.g. WORD(1)). The pronunciation is encoded using a modified form of the ARPABET system, with the addition of stress marks on vowels of levels 0, 1, and 2. A line-initial ;;; token indicates a comment. A derived format, directly suitable for speech recognition engines is also available as part of the distribution; this format collapses stress distinctions (typically not used in ASR).

The following is a table of phonemes used by CMU Pronouncing Dictionary. [2]

Vowels
ARPABET Rspl. IPA Example
AAah ɑ odd
AEa æ at
AH0ə ə about
AHuh ʌ hut
AOaw ɔ ought, story
AWowcow
AYeyehide
EHeh ɛ Ed
Vowels
ARPABET Rspl. IPA Example
ERur, ər ɝ , ɚ hurt
EYayate
IHi, ih ɪ it
IYee i eat
OWohoat
OYoyɔɪtoy
UHuu ʊ hood
UWoo u two
Stress
ABDescription
0No stress
1 Primary stress
2 Secondary stress
Consonants
ARPABET Rspl. IPA Example
Bb b be
CHch, tch cheese
Dd d dee
DHdh ð thee
Ff f fee
Gg ɡ green
HHh h he
JHj gee
Consonants
ARPABET Rspl. IPA Example
Kk k key
Ll l lee
Mm m me
Nn n knee
NGng ŋ ping
Pp p pee
Rr r read
Ss, ss s sea
Consonants
ARPABET Rspl. IPA Example
SHsh ʃ she
Tt t tea
THth θ theta
Vv v vee
Ww, wh w we
Yy j yield
Zz z zee
ZHzh ʒ seizure

History

VersionRelease date [3] License
0.116 September 1993Public Domain
0.210 March 1994Public Domain
0.328 September 1994Public Domain
0.48 November 1995Public Domain
0.5No public releasePublic Domain
0.611 August 1998Public Domain
0.7No public releasePublic Domain
0.7a18 February 2008 2-clause BSD
0.7b19 November 2014 [4] 2-clause BSD
GitHub (unversioned)26 May 2021 2-clause BSD

Applications

See also

References

  1. "Sequitur G2P - A trainable Grapheme-to-Phoneme converter".
  2. 1 2 "The CMU Pronouncing Dictionary". CMU Pronouncing Dictionary. 2015-07-16. Archived from the original on 2022-06-03. Retrieved 2022-06-04.
  3. "FTP link". ftp.cs.cmu.edu (FTP).[dead ftp link](To view documents see Help:FTP)
  4. "CMUdict". svn.code.sf.net.
  5. "Cmusphinx - Revision 10973: /Trunk/Logios". Archived from the original on 2011-05-20. Retrieved 2009-12-19.