ssunqf's repositories
rust-sbert
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
awsomes
awsome list
BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Chinese-Mandarin-Dictionaries
中文词典 / 中文詞典。
Chinese-Names-Corpus
中文人名语料库。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Chinese_financial_sentiment_dictionary
A Chinese financial sentiment word dictionary
chinese_sentiment_dictionary
该仓库收集了常用的中文情感词典,仅供学习
ECDICT
Free English to Chinese Dictionary Database
English-to-IPA
Converts English text to IPA notation
english-wordnet
The Open English WordNet
etymology-db
An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.
fastBPE
Fast BPE
Final_word_Similarity
综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。
ipa-dict
Monolingual wordlists with pronunciation information in IPA
IPRE
IPRE: a Dataset for Inter-Personal Relationship Extraction
mana
scrape infohashes and names passively from the global distributed hash table
OpenCorpus
A collection of freely available corpora.
pgx
Build Postgres Extensions with Rust!
PinYinSound
汉语拼音MP3语音文件
python-pinyin
汉字转拼音(pypinyin)
ssunqf.github.io
Privacy Policy Template for website or app
stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
WikipediaHomographData
Labeled data for homograph disambiguation