Tianyu Gao (gaotianyu1350)

gaotianyu1350

Geek Repo

Company:@princeton-nlp

Home Page:https://gaotianyu.xyz/about/

Twitter:@gaotianyu1350

Github PK Tool:Github PK Tool


Organizations
princeton-nlp

Tianyu Gao's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67758Issues:558Issues:711

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29390Issues:339Issues:268

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonLicense:MITStargazers:10478Issues:283Issues:1544

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9124Issues:111Issues:81

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6960Issues:74Issues:205

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6459Issues:112Issues:294

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5756Issues:48Issues:968

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2675Issues:44Issues:71

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1201Issues:21Issues:87

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonLicense:MITStargazers:1052Issues:23Issues:60

rank_bm25

A Collection of BM25 Algorithms in Python

Language:PythonLicense:Apache-2.0Stargazers:1007Issues:10Issues:31

OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language:PythonLicense:Apache-2.0Stargazers:987Issues:17Issues:61

OpenCLaP

Open Chinese Language Pre-trained Model Zoo

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonLicense:NOASSERTIONStargazers:665Issues:15Issues:17

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Language:PythonLicense:Apache-2.0Stargazers:551Issues:11Issues:87

LegalPapers

Must-read Papers on Legal Intelligence

incoder

Generative model for code infilling and synthesis

knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Language:PythonLicense:MITStargazers:269Issues:4Issues:11

dpr-scale

Scalable training for dense retrieval models.

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Language:PythonLicense:Apache-2.0Stargazers:233Issues:7Issues:19

TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

LegalPLMs

Source code and checkpoints for legal pre-trained language models.

ACL-anthology-corpus

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Language:Jupyter NotebookStargazers:167Issues:8Issues:3

MAVEN-dataset

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Language:PythonLicense:MITStargazers:151Issues:8Issues:17

rankgen

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Language:PythonLicense:Apache-2.0Stargazers:136Issues:5Issues:12

attribute_charge

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Language:PythonLicense:MITStargazers:95Issues:11Issues:7

jec-qa

The respository of jec-qa.

Language:PythonLicense:MITStargazers:23Issues:7Issues:3