Dreamxu's repositories
Pinyin2Hanzi
拼音转汉字, 拼音输入法引擎, pin yin -> 拼音
-Learning-materials-
各种学习资料,包括一些百度云视频链接还有pdf资料 --搬运工
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合
BERT-BiLSTM-CRF-NER
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
books-making-you-better
A list of classic books make better you understand not only how it works, but why it works.
chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
ChatterBot
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
DrQA
Reading Wikipedia to Answer Open-Domain Questions
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报告、自然语言生成、NLP太难了系列、自动对联数据及机器人、用户名黑名单列表、罪名法务名词及分类模型、微信公众号语料、cs224n深度学习自然语言处理课程、中文手写汉字识别、中文自然语言处理 语料/数据集、变量命名神器
gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁
Keyword_Extraction
神策杯2018高校算法大师赛(中文关键词提取)第二名代码方案
lazynlp
Library to scrape and clean web pages to create massive datasets.
MatchPyramid-TensorFlow
A simple version of MatchPyramid implement in TensorFlow. Paper https://arxiv.org/abs/1602.06359.
NCRFpp
NCRF++, an Open-source Neural Sequence Labeling Toolkit. It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components. (code for COLING/ACL 2018 paper)
nlp-beginner
NLP上手教程
nlp-library
curated collection of papers for the nlp practitioner 📖👩🔬
NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
nlp_course
YSDA course in Natural Language Processing
nsfw_data_scrapper
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
pandownload-fake-server
重定向服务器复活 Pandownload
PyTorch_Tutorial
《Pytorch模型训练实用教程》中配套代码
SPM_toolkit
Neural network toolkit for sentence pair modeling.
Task-Oriented-Dialogue-Dataset-Survey
A dataset survey about task-oriented dialogue, including recent datasets.
tensorflow-DSMM
Tensorflow implementations of various Deep Semantic Matching Models