Sheng Shen's repositories
prag_generation
[NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301
one_layer_lottery_ticket
[EMNLP 2021] code for "Whatās Hidden in a One-layer Randomly Weighted Transformer?"
few-shot-learning
Few-shot Learning of GPT-3
google-drive-downloader
Minimal class to download shared files from Google Drive.
java-nlp-toolkit
My personal Java NLP toolkit that serves as an interface to various existing NLP libraries.
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
promptsource
Toolkit for collecting and applying templates of prompting instances
sincerass.github.io
Sheng (Arnold) Shen's homepage
transformers
š¤ Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs