Xuemin Zhao's repositories
alexa-dataset-contextual-query-rewrite
This repo includes extensions to the Stanford Dialogue Corpus. It contains crowd-sourced rewrites to facilitate research in dialogue state tracking using natural language as the interface.
backchannel-prediction
Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
couplet-clean-dataset
Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。
icassp2019-ood-dataset
dialog system, icassp, dataset
LatticeLSTM
Chinese NER using Lattice LSTM. Code for ACL 2018 paper.
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
multisense-prob-fasttext
ACL 2018 paper: Probabilistic FastText for Multi-Sense Word Embeddings (Athiwaratkun et al., 2018)
ood_robust_hcn
Code for the paper "Improving Robustness of Dialog Systems in a Data-Efficient Way with Turn Dropout" by Igor Shalyminov and Sungjin Lee
poetry-dataset
Chinese classical poetry dataset. 中文绝句诗歌数据集,欢迎使用。