XingWu_UCAS's repositories
ir
ConTextual Mask Auto-Encoder for Dense Passage Retrieval
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Prompt-BERT
Prompt-BERT: Prompt makes BERT Better at Sentence Embeddings
CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
TransformersDataAugmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
WoBERT
以词为基本单位的中文BERT
GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
english-wordlists
常用英语词汇表
DeepLearningExamples
Deep Learning Examples
sentence-transformers
Sentence Embeddings with BERT & XLNet
lihang-code
《统计学习方法》的代码实现
CUDA-Programming
Sample codes for my CUDA programming book
nlp
兜哥出品 <一本开源的NLP入门书籍>
sent-conv-torch
Text classification using a convolutional neural network.
PALM
Paddle for Multi-task
awesome-data-augmentation
Papers and repos for data augmentation
TransSent_dataset
dataset constructed for sentence transfer task
PBT-paddle
Population Based Training in PaddlePaddle
1024er.github.io
1024er's homepage
MLM_transfer
Implemetation of MLM_transfer