Language files for WordDumb.
Wiktionary data come from kaikki.org and Dbnary, Chinese and French Wiktionary data are created with the Wiktextract tool. Word difficulty data sources are listed in each language subfolders.
-
Python
-
lemminflect: inflect English words
-
Open Chinese Convert: convert Chinese characters
-
wordfreq: get word frequency data
-
wiktextract-lemmatization: remove stress
-
Perl: Remove invalid text in Dbnary files
-
lbzip2 or bzip2
-
pigz or gzip
$ python -m venv .venv
$ source .venv/bin/activate.fish
$ python -m pip install .
$ proficiency en
Change the venv invoke command according to your shell.
This work is licensed under GPL version 3 or later.