Baohao Liao (BaohaoLiao)

BaohaoLiao

Geek Repo

Company:University of Amsterdam

Location:Netherlands

Home Page:https://baohaoliao.github.io/

Github PK Tool:Github PK Tool

Baohao Liao's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:134922Issues:1124Issues:16128

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30502Issues:426Issues:4201

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:24596Issues:258Issues:311

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18374Issues:184Issues:731

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6958Issues:38Issues:1141

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4570Issues:77Issues:89

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3418Issues:28Issues:271
Language:PythonLicense:Apache-2.0Stargazers:2854Issues:33Issues:299
Language:PythonLicense:Apache-2.0Stargazers:1266Issues:16Issues:116

LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:1076Issues:11Issues:59

SimSiam

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Language:PythonLicense:MITStargazers:817Issues:9Issues:42

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonLicense:MITStargazers:725Issues:16Issues:82

unify-parameter-efficient-tuning

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Language:PythonLicense:Apache-2.0Stargazers:516Issues:7Issues:16

ALMA

State-of-the-art LLM-based translation models.

Language:RubyLicense:MITStargazers:433Issues:13Issues:62

minimal-text-diffusion

A minimal implementation of diffusion models for text generation

Language:PythonLicense:MITStargazers:312Issues:8Issues:19

academic-budget-bert

Repository containing code for "How to Train BERT with an Academic Budget" paper

Language:PythonLicense:Apache-2.0Stargazers:309Issues:16Issues:22

DinkyTrain

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Language:PythonLicense:MITStargazers:112Issues:4Issues:9

DuQuant

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Language:PythonLicense:MITStargazers:99Issues:1Issues:3

SeqDiffuSeq

Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]

PrefixQuant

An algorithm for static activation quantization of LLMs

Language:PythonLicense:Apache-2.0Stargazers:67Issues:5Issues:10

difformer

The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)

Language:PythonLicense:MITStargazers:52Issues:4Issues:9

StableMask

PyTorch implementation of StableMask (ICML'24)

Language:PythonStargazers:11Issues:2Issues:0

3ml

Official code for 3ML (EMNLP 2022)

SIFo

Evaluating large language models for sequential instruction following (SIFo).

Language:PythonStargazers:5Issues:2Issues:0

chat-task-2024-data

Data for WMT 2024 Chat Shared Task

Language:PythonLicense:NOASSERTIONStargazers:5Issues:0Issues:0

NLP-reproduction

Offer straightforward guidance to reproduce the results of NLP papers.

Language:PythonLicense:MITStargazers:3Issues:2Issues:0
Language:CSSLicense:NOASSERTIONStargazers:1Issues:1Issues:0