There are 8 repositories under pre-trained-language-models topic.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
The official GitHub page for the survey paper "A Survey of Large Language Models".
An Open-Source Framework for Prompt-Learning.
Must-read papers on prompt-based tuning for pre-trained language models.
RoBERTa中文预训练模型: RoBERTa for Chinese
Must-read Papers on Knowledge Editing for Large Language Models.
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
Awesome papers on Language-Model-as-a-Service (LMaaS)
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
A PyTorch-based model pruning toolkit for pre-trained language models
HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab
[ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
Must-read papers on improving efficiency for pre-trained language models.
The Paper List on Data Contamination for Large Language Models Evaluation.
We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.
📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
PLM 기반 한국어 개체명 인식 (NER)
A repository listing important datasets for multimodal recommender systems
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
LingLong (玲珑): a small-scale Chinese pretrained language model
Calculating FLOPs of Pre-trained Models in NLP
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
Question and answer generation (QAG) is a natural language processing (NLP) task that generates a question and an answer in the same time by using context information. The input context can be represented in form of structured information in a database or raw text. The outputs of QAG systems can be directly applied to several NLP applications...