tianjianl

Tianjian Li's starred repositories

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookApache-2.018800

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.0205600

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonApache-2.02808500

ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Language:Python16600

ALMA

State-of-the-art LLM-based translation models.

Language:RubyMIT39500

Tk-Instruct

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

Language:PythonMIT17700

Glot500

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

Language:PythonNOASSERTION9600

TaiLr

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Language:PythonMIT2100

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.0218300

DITTO

The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 2022

Language:PythonMIT3900

COMET

A Neural Framework for MT Evaluation

Language:PythonApache-2.049200

EKFAC-pytorch

Repository containing Pytorch code for EKFAC and K-FAC perconditioners.

Language:PythonMIT13900

LASER

Language-Agnostic SEntence Representations

Language:Jupyter NotebookNOASSERTION358300

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter Notebook1193700