mewo2 / syllpos

Wordlists by part of speech and syllable count

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wordlists by part of speech and syllable count

This is a collection of wordlists, taken from the Brown University Standard Corpus of Present-Day American English. Filenames have the form postag-syllablecount.txt, where postag is the part of speech tag, and syllable count is the number of syllables in the word.

The part-of-speech tags form part of the corpus, and are described further here.

The syllable counts are taken from the pronunciations in the CMU Pronouncing Dictionary. Words not included in the CMU dictionary are ignored. In cases where there is more than one pronunciation listed, the first is used.

About

Wordlists by part of speech and syllable count

License:Other


Languages

Language:Python 100.0%