Tianjian Li (tianjianl)

tianjianl

Geek Repo

Company:Johns Hopkins University

Location:Baltimore, MD

Home Page:tianjianl.github.io

Twitter:@tli104

Github PK Tool:Github PK Tool

Tianjian Li's starred repositories

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27817Issues:247Issues:7025

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25323Issues:219Issues:4090

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9081Issues:74Issues:1051

OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Language:PythonLicense:MITStargazers:6695Issues:176Issues:1443

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2186Issues:32Issues:102

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2152Issues:25Issues:54

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1968Issues:19Issues:79

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:920Issues:13Issues:192

P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Language:PythonLicense:MITStargazers:911Issues:23Issues:50

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonLicense:NOASSERTIONStargazers:668Issues:12Issues:48
Language:PythonLicense:Apache-2.0Stargazers:549Issues:9Issues:18

dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

ALMA

State-of-the-art LLM-based translation models.

Language:RubyLicense:MITStargazers:375Issues:12Issues:51

BRIO

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

pytorch_influence_functions

This is a PyTorch reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang.

Language:PythonLicense:NOASSERTIONStargazers:311Issues:7Issues:32

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLLicense:MITStargazers:284Issues:5Issues:29

dsir

DSIR large-scale data selection framework for language model training

Language:PythonLicense:MITStargazers:211Issues:21Issues:7

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:188Issues:7Issues:9

Tk-Instruct

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

Language:PythonLicense:MITStargazers:176Issues:4Issues:26

ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

wimbd

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Language:PythonLicense:Apache-2.0Stargazers:162Issues:6Issues:10

QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

Glot500

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

Language:PythonLicense:NOASSERTIONStargazers:96Issues:8Issues:7

submodlib

Summarize Massive Datasets using Submodular Optimization

Language:Jupyter NotebookLicense:MITStargazers:85Issues:7Issues:17

Gradient_Starvation

Gradient Starvation: A Learning Proclivity in Neural Networks

Language:PythonLicense:MITStargazers:59Issues:6Issues:3

TaiLr

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Language:PythonLicense:MITStargazers:20Issues:2Issues:0

InstructMT

A collection of instruction data and scripts for machine translation.

Language:PythonStargazers:19Issues:1Issues:0

ParetoMNMT

Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023

Language:PythonStargazers:15Issues:1Issues:0

Integer_Addition

✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks

Language:Jupyter NotebookLicense:MITStargazers:12Issues:3Issues:1