BaohaoLiao

Baohao Liao's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0134922 1124 16128

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30502 426 4201

generative-models

Generative Models by Stability AI

Language:PythonMIT24596 258 311

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.018374 184 731

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT6958 38 1141

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.04570 77 89

llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

Language:HTML3670 18 6

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT3418 28 271

LLaVA-NeXT

Language:PythonApache-2.02854 33 299

open-instruct

Language:PythonApache-2.01266 16 116

LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Language:PythonApache-2.01076 11 59

SimSiam

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Language:PythonMIT817 9 42

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

755 16 3

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonMIT725 16 82

unify-parameter-efficient-tuning

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Language:PythonApache-2.0516 7 16

ALMA

State-of-the-art LLM-based translation models.

Language:RubyMIT433 13 62

minimal-text-diffusion

A minimal implementation of diffusion models for text generation

Language:PythonMIT312 8 19

academic-budget-bert

Repository containing code for "How to Train BERT with an Academic Budget" paper

Language:PythonApache-2.0309 16 22

DinkyTrain

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Language:PythonMIT112 4 9

DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Language:PythonMIT99 1 3

SeqDiffuSeq

Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]

Language:Python91 5 26

PrefixQuant

An algorithm for static activation quantization of LLMs

Language:PythonApache-2.067 5 10

difformer

The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)

Language:PythonMIT52 4 9

collages-dataset

Language:MATLAB15 2 2

StableMask

PyTorch implementation of StableMask (ICML'24)

Language:Python11 20

3ml

Official code for 3ML (EMNLP 2022)

7 1 2

SIFo

Evaluating large language models for sequential instruction following (SIFo).

Language:Python5 20

chat-task-2024-data

Data for WMT 2024 Chat Shared Task

Language:PythonNOASSERTION500

NLP-reproduction

Offer straightforward guidance to reproduce the results of NLP papers.

Language:PythonMIT3 20

baohaoliao.github.io

Language:CSSNOASSERTION1 10