dmahan93's repositories
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
grimoire-exploration
Diving into LLMs like they're a grimoire
algorithm-distillation-from-conversations
Algorithm Distillation + Pretraining Language Models with Human Preferences + Chat
bitsandbytes
8-bit CUDA functions for PyTorch
codeclippy_postprocessing
https://github.com/huggingface/transformers/blob/main/examples/research_projects/codeparrot/scripts but edited to do just one thing
Compact-Transformers
[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021
ELM
Evolution Through Large Models Implementation
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
pyIesorPhysics
Python version of IeSOR
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
emoggoth
Generate your favorite emoji shoggoth!
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
tclx
A repository for transformer critique learning and generation
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs