Yilin Niu's starred repositories
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ColossalAI
Making large AI models cheaper, faster and more accessible
google-research
Google Research
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
flash-attention
Fast and memory-efficient exact attention
DeepSpeedExamples
Example models using DeepSpeed
ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
ChatGLM-Finetuning
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
Task-Oriented-Dialogue-Research-Progress-Survey
A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.
summarize-from-feedback
Code for "Learning to summarize from human feedback"
perspectiveapi
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
Awesome-Code-LLM
👨💻 An awesome and curated list of best code-LLM for research.
Question-Generation-Paper-List
A summary of must-read papers for Neural Question Generation (NQG)
transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.
unsupervised-passage-reranking
Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"