tongshoujie

tongshoujie

Geek Repo

Github PK Tool:Github PK Tool

tongshoujie's starred repositories

python-small-examples

告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 https://ai-jupyter.com

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookLicense:MITStargazers:4405Issues:41Issues:221

mixup-cifar10

mixup: Beyond Empirical Risk Minimization

Language:PythonLicense:NOASSERTIONStargazers:1156Issues:21Issues:16

CLUENER2020

A PyTorch implementation of a BiLSTM\BERT\Roberta(+CRF) model for Named Entity Recognition.

MixText

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Language:Jupyter NotebookLicense:MITStargazers:349Issues:6Issues:34

WordSeg

A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .

GAIN

Source code for EMNLP 2020 paper: Double Graph Based Reasoning for Document-level Relation Extraction

Language:PythonLicense:MITStargazers:142Issues:6Issues:40

GIT

Source code for ACL-IJCNLP 2021 Long paper: Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.

TaCL

[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

ChildTuning

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

ParaSCI

a large scientific paraphrase dataset for longer paraphrase generation

ContrastivePruning

Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》

HCL-Text2AMR

Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"

Behind-the-Scenes

Code and data for CIKM-2021 paper《Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification》