Infinity Future's repositories
ChineseNER
中文命名实体识别,实体抽取,tensorflow,pytorch,BiLSTM+CRF
sklearn-crfsuite
scikit-learn inspired API for CRFsuite
insuranceqa-corpus-zh
OpenData in insurance area for Machine Learning Tasks
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition
THULAC-Python
An Efficient Lexical Analyzer for Chinese
dgk_lost_conv
dgk_lost_conv 中文对白语料 chinese conversation corpus
chinese-corpus-1
中文单/多轮对话语料库
Chinese-Names-Corpus
中文人名语料库。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。
Chinese-Literature-NER-RE-Dataset
A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
Chinese_Corpus
中文语料库:包括情感词典 情感分析 文本分类 单轮对话 中文词典 知乎
TorchGlove
PyTorch implementation of Global Vectors for Word Representation.
tvsub
TVsub: DCU-Tencent Chinese-English Dialogue Corpus
Chinese-Lyric-Corpus
A Chinese lyric corpus which contains nearly 50,000 lyrics from 500 artists
Chinese-abbreviation-dataset
This is a corpus of Chinese abbreviation, including negative full forms.
ccsa
A Chinese Conversation Corpus for Sentiment Analysis
douban_group_convs
A dataset of online conversations in Chinese in DASFAA 2017 paper: Learning the Structures of Online Asynchronous Conversations.
doc-han-att
Hierarchical Attention Networks for Chinese Sentiment Classification
Chinese_conversation_sentiment
A Chinese sentiment dataset may be useful for sentiment analysis.
chinese-corpus
中文相关词典和语料库。