mvpvipliu / HSK-3.0-words-list

HSK 3.0 words list

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HSK 3.0 words list

The words list with meaning generated using

How it generated?

The meanings in HSK list with meaning taken in following order

  1. if found in wiktionary then get meaning
  2. else take meaning from CC-CEDICT
  3. if not found then translate using Google Translate

Scripts and data

The Scripts and data contains files that used to create HSK 3.0 lists.

The main.py is used to create HSK 3.0 word list with meaning. The script is not so optimized, it may need improvements.

Data

The following data used to generate list

The following data generated (view in the folder)

  • HSK 1 to HSK 7-9 with clear meaning .txt files
  • tsv list for importing in Anki

For running the script

  1. Install Python
  2. Install following python modules using pip
pinyin
pycedict
hanziconv
googletrans
pinyin_jyutping_sentence
  1. The script reads characters/words per line and fetch meaning. Then it write the data .txt files
  2. Uncomment functions to use the script
# uncomment below and run
# get_meaning()


# some other helper functions
# find_dup()

# get_sound()
# count_field()

Note:

  • The meaning translated using Google when not found in CC-CEDICT.
  • This generated using python program, may contain errors and need improvements.

License

View License

About

HSK 3.0 words list

License:Other


Languages

Language:Python 94.1%Language:JavaScript 5.9%