smj0's repositories
Adaptive-Decision-Boundary
Deep Open Intent Classification with Adaptive Decision Boundary (AAAI2021)
AwesomeMRC
This repo is our research summary and playground for MRC. More features are coming.
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
BERT-whitening
简单的向量白化改善句向量质量
ConSERT
Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
CrossWeigh
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
DC-Match
ACL-2022 paper: Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents.
DeepAligned-Clustering
Discovering New Intents with Deep Aligned Clustering (AAAI 2021)
dialogue-utterance-rewriter-transformer
ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
DuReader
Baseline Systems of DuReader Dataset
GAR
Code for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021
GOT
Source code for 《Energy-based Unknown Intent Detection with Data Manipulation》, which is accepted by Findings of ACL, 2021.
IS-BERT
An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)
KMRC-Papers
A list of recent papers on knowledge-based machine reading comprehension.
LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF with QLoRA)
out-of-scope-intent-detection
Out-of-Scope Intent Detection
PolyEncoder
An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)
Quasi-Attention-ABSA
The codebase for a new quasi-attention BERT model for TABSA tasks
sccl
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
Task-Oriented-Dialogue-Dataset-Survey
A dataset survey about task-oriented dialogue, including recent datasets and SoA results & papers.
TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
toutiao-text-classfication-dataset
今日头条中文新闻(文本)分类数据集
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.