Sachin Gururangan's repositories
haveibeenpwned
Python interface to Have I Been Pwned API
demix-data
Benchmark API for Multidomain Language Modeling
quality-filter
Code for "Whose language is high quality?" paper
ensemble-transformers
Ensembling Hugging Face transformers made easy
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
download_subreddits
Download Subreddit Data
ethics599p
Ethics 599P course website
kernelmachine.archived.github.io
A beautiful Jekyll theme for academics
kernelmachine.github.io.archive
personal site
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
logistic_regression
A logistic regression baseline for text classification
minimal
Minimal is a Jekyll theme for GitHub Pages
news-please
news-please - an integrated web crawler and information extractor for news that just works
nlp-corpora-backend
Staging grounds for nlp-corpora scripts and docs
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
shared-vec
Project word vectors into shared space with linear CCA
tpu_pretrain-1
LM Pretraining with PyTorch/TPU
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.