Karol Chang's repositories
Administrative-divisions-of-China
中华人民共和国行政区划:省级(省份直辖市自治区)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,**省市区镇村二级三级四级五级联动地址数据 Node.js 爬虫。
Alibaba-MIT-Speech
Alibaba speech technology
ASR_Theory
语音识别理论,论文和PPT
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
chinese-corpus
中文相关词典和语料库。
ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
ChineseNLPCorpus-1
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
correction
Chinese "spelling" error correction
ctc_beam_search_lm
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
Dialog_Corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
kaldi
This is the official location of the Kaldi project.
LeetCode
My leetcode solution
nlp-datasets
A list of datasets/corpora for NLP tasks, in reverse chronological order.
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
pysrilm-1
Simple Python wrapper for SRILM with Python 2.x and 3.x supported
Restaurant-List
This is an exercise of swift
setk
Tools for Speech Enhancement integrated with Kaldi
Small-Chinese-Corpus
Some useful Chinese corpus datasets 中文语料小数据
youtube-dl
Command-line program to download videos from YouTube.com and other video sites