MilkWYX's repositories
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
bert
TensorFlow code and pre-trained models for BERT
BilibiliWordCloud
制作B站弹幕词云图
capsule-networks
A PyTorch implementation of the NIPS 2017 paper "Dynamic Routing Between Capsules".
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit
JointModelNSP
联合模型:a joint model for text normalization, segmention, POS tagging.
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
MELD
多模态情绪分析:MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
ntua-slp-semeval2018
Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.
Seq2Set
Code for the paper "A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification"
seq2set-semantic-tagging
End-to-end semantic tagging.
SGM
seq2seq多标签文本分类:Sequence Generation Model for Multi-label Classification (COLING 2018)
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
SU4MLC
Code for the article "Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification" (EMNLP 2018)