singhranjodh / cmudict-ipa

The CMU Pronouncing Dictionary converted to IPA

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


CMU Pronouncing Dictionary converted to IPA by Alexander Piperski

cmudict-0.7b-ipa.txt contains all the entries from the CMU Pronouncing Dictionary (version 0.7b; automatically converted to the International Phonetic Alphabet (IPA). Each line consists of a headword and its transcription(s) separated by a tab symbol; variant forms are separated by ", ".


  • ALPINE ˈælˌpaɪn
  • APPALACHIAN ˌæpəˈleɪtʃən, ˌæpəˈleɪʃən, ˌæpəˈlætʃən
  • HIMALAYAN ˌhɪməˈleɪən, ˌhɪməˈlaɪən

Apart from the standard Arpabet to IPA conversion rules (, the following ones were observed:

  • AH is converted to ʌ when bearing primary stress and to ə otherwise (AH1 -> ʌ; AH0, AH2 -> ə)
  • Vowel length is indicated (AA -> ɑː, ER -> ɝː, IY -> iː, UW -> uː). However, unstressed word-final ER and IY are short (i.e., ER0 -> ɝ and IY -> i when word-final).

brown-frequency-list-with-ipa.txt contains 33862 words occurring in the Brown Corpus of American English ( sorted by their frequency of occurrence. This list may be of use to English learners willing to pronounce most frequent words correctly.


The CMU Pronouncing Dictionary converted to IPA