Beast code in Giters

Xuemin Zhao's repositories

ape210k

This is the repository of the Ape210K dataset and baseline models.

Language:Python100

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.01 20

Synonyms

中文近义词工具包

Language:PythonMIT1 20

alexa-dataset-contextual-query-rewrite

This repo includes extensions to the Stanford Dialogue Corpus. It contains crowd-sourced rewrites to facilitate research in dialogue state tracking using natural language as the interface.

MIT-0010

backchannel-prediction

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor

Language:Python000

bert-dst

BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer

Language:Python010

chat

010

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonApache-2.0010

coco-dst

Language:PythonBSD-3-Clause000

couplet-clean-dataset

Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。

MIT000

dats

Language:PythonApache-2.0000

DIRT

DIRT:Deep Learning Enhanced Item Response Theory for Cognitive Diagnosis

Language:Python010

dkt

Our implementation of the LSTM version of Deep Knowledge Tracing (DKT)

Language:PythonMIT010

ekt

Language:Python010

glyph

Which Encoding is the Best for Text Classification in Chinese, English, Japanese and Korean?

Language:ShellBSD-3-Clause010

icassp2019-ood-dataset

dialog system, icassp, dataset

010

LatticeLSTM

Chinese NER using Lattice LSTM. Code for ACL 2018 paper.

Language:Python010

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

000

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonMIT000

MSParS

020

multisense-prob-fasttext

ACL 2018 paper: Probabilistic FastText for Multi-Sense Word Embeddings (Athiwaratkun et al., 2018)

Language:C++NOASSERTION020

NeuralCD

Language:Python010

NeurIPSEducation2020

Language:PythonMIT000

ood_robust_hcn

Code for the paper "Improving Robustness of Dialog Systems in a Data-Efficient Way with Turn Dropout" by Igor Shalyminov and Sungjin Lee

Language:Jupyter Notebook010

poetry-dataset

Chinese classical poetry dataset. 中文绝句诗歌数据集，欢迎使用。

000

pyBKT

Python implementation of Bayesian Knowledge Tracing and extensions

Language:C++MIT010

rasa_nlu

turn natural language into structured data

Language:PythonApache-2.0020

simjoin

Language:Jupyter NotebookApache-2.0010

simsearch

Language:JavaApache-2.0010

StarSpace

Learning embeddings for classification, retrieval and ranking.

Language:C++MIT020