xiaoduozhou's repositories
wikipedia-word-frequency
Gather modern English word frequencies from all enwiki articles.
Task-Oriented-Dialogue-Dataset-Survey
A dataset survey about task-oriented dialogue, including recent datasets and SoA results & papers.
geohash
Python module to decode/encode Geohashes to/from latitude and longitude. See http://en.wikipedia.org/wiki/Geohash
practical-pytorch
PyTorch tutorials demonstrating modern techniques with readable code
NMT
Pytorch implementation of Neural Machine Translation with seq2seq and attention (en-zh)
FrequencyWords
Repository for Frequency Word List Generator and processed files
CS-Notes
:books: 面试必备基础知识
bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
ant-design
🐜 A UI Design Language
arXivNotes
IssuesにNLP(対話関連多め)に関連するの論文を読んだまとめを書いています.雑です.🚧 マークは編集中の論文です(事実上放置のものも多いです).🍡 マークは概要のみ書いてます(早く見れる的な意味で団子).
bert_chinese_pytorch
bert for chinese text classification
Name-Entity-Recognition
Lstm-crf,Lattice-CRF,bert-ner及近年ner相关论文follow
LeetCode
刷LeetCode
bert
TensorFlow code and pre-trained models for BERT
pytorch-pretrained-BERT
A PyTorch implementation of Google AI's BERT model with script to load Google's pre-trained models
MutiTreading_craw
多线程爬虫,获取动态异步网页Ajax内容
pdf2txt
将pdf转化为txt的小工具
haipproxy
:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Skill-Tree
🐼 准备秋招,欢迎来树上取果实
chinese_chatbot_corpus
中文公开聊天语料库
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
MachineLearning
Basic Machine Learning and Deep Learning
e2e_dialog_challenge
End-To-End Task-Completion Dialogue Challenge
EasyJob
:fire:2018互联网求职面试知识复习
HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 新词发现 短语提取 自动摘要 文本分类 拼音简繁
zhihu-oauth
尝试解析出知乎官方未开放的 OAuth2 接口,并提供优雅的使用方式,作为 zhihu-py3 项目的替代者,目前还在实验阶段