heyLinsir

followers

following

stars

Yilin Niu's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.040447 394 1294

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038662 384 1652

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.034889 342 2739

google-research

Google Research

Language:Jupyter NotebookApache-2.033903 750 1245

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonNOASSERTION15700 132 615

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause13559 115 1039

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.013355 98 777

ShiArthur03

Language:MATLABGPL-3.010377 32 1357

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Language:PythonApache-2.07652 99 198

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.06009 74 534

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.05228 49 187

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4457 49 289

FlagAI

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Language:PythonApache-2.03815 43 210

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonApache-2.03650 32 374

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Language:Python2638 15 146

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.02044 19 81

Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

MOSS-RLHF

MOSS-RLHF

Language:PythonApache-2.01272 34 52

Task-Oriented-Dialogue-Research-Progress-Survey

A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.

sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Language:PythonApache-2.01045 19 156

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language:PythonApache-2.0983 17 61

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonNOASSERTION976 148 21

perspectiveapi

Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.

Apache-2.0885 500

Awesome-Code-LLM

👨‍💻 An awesome and curated list of best code-LLM for research.

Question-Generation-Paper-List

A summary of must-read papers for Neural Question Generation (NQG)

MetaICL

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Language:PythonNOASSERTION251 9 21

FineGrainedRLHF

Language:PythonApache-2.0248 8 13

transition-amr-parser

SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.

Language:PythonApache-2.0237 13 48

amrlib

A python library that makes AMR parsing, generation and visualization simple.

Language:PythonMIT219 6 70

unsupervised-passage-reranking

Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"

Language:Python92 5 3