daishu7's repositories
TurboTransformers
a fast and user-friendly runtime for transformer inference on CPU and GPU
sentence-transformers
Sentence Embeddings with BERT & XLNet
deformer
[ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
linformer-pytorch
My take on a practical implementation of Linformer for Pytorch. https://arxiv.org/pdf/2006.04768.pdf
fastHan
fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。
reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
longformer
Longformer: The Long-Document Transformer
lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention
Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
gpt2-ml
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、句法分析等),无监督或弱监督(种子词)方法
pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
PolyEncoder
An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)
ALBERT
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
PyTorch_Tutorial
《Pytorch模型训练实用教程》中配套代码
elasticsearch-analysis-hanlp
HanLP Analyzer for Elasticsearch
BiDAF-pytorch
Re-implementation of BiDAF(Bidirectional Attention Flow for Machine Comprehension, Minjoon Seo et al., ICLR 2017) on PyTorch.
subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
learning_to_rank
利用lightgbm做learning to rank排序
ctrl
Conditional Transformer Language Model for Controllable Generation
GPT2-Chinese
Chinese version of GPT2 training code, using BERT or BPE tokenizer.
conversational-QG
Implementation for our ACL 2019 paper: Interconnected Question Generation with Coreference Alignment and Conversation Flow Modeling
HDSA-Dialog
Code and Data for ACL 2019 "Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention"
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
albert_pytorch
albert_zh对应的pytorch版本
AutoNER
Learning Named Entity Tagger from Domain-Specific Dictionary
text-segmentation
Implementation of the paper: Text Segmentation as a Supervised Learning Task