gaotianyu1350

followers

following

stars

@princeton-nlp

https://gaotianyu.xyz/about/

Organizations

princeton-nlp

Tianyu Gao's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION67758 558 711

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029390 339 268

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonMIT10478 283 1544

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.09124 111 81

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookApache-2.06960 74 205

metaseq

Repo for external large-scale work

Language:PythonMIT6459 112 294

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT5756 48 968

galai

Model API for GALACTICA

Language:Jupyter NotebookApache-2.02675 44 71

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT1201 21 87

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonMIT1052 23 60

rank_bm25

A Collection of BM25 Algorithms in Python

Language:PythonApache-2.01007 10 31

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language:PythonApache-2.0987 17 61

OpenCLaP

Open Chinese Language Pre-trained Model Zoo

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonNOASSERTION665 15 17

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Language:PythonApache-2.0551 11 87

LegalPapers

Must-read Papers on Legal Intelligence

incoder

Generative model for code infilling and synthesis

Language:Python294 9 18

knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Language:PythonMIT269 4 11

dpr-scale

Scalable training for dense retrieval models.

Language:Python268 18 13

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Language:PythonApache-2.0233 7 19

TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Language:Python191 8 8

LegalPLMs

Source code and checkpoints for legal pre-trained language models.

Language:Python170 7 11

ACL-anthology-corpus

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Language:Jupyter Notebook167 8 3

MAVEN-dataset

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Language:PythonMIT151 8 17

rankgen

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Language:PythonApache-2.0136 5 12

attribute_charge

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Language:Python128 10 14

TopJudge

Language:PythonMIT95 11 7

CLAIM

MIT77 11 2

jec-qa

The respository of jec-qa.

Language:Python48 7 12

QAJudge

Language:PythonMIT23 7 3