EleutherAI's repositories
github-downloader
Script for downloading GitHub.
stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
best-download
URL downloader supporting checkpointing and continuous checksumming.
pile_dedupe
Pile Deduplication Code
tagged-pile
Part-of-Speech Tagging for the Pile and RedPajama
LLM-Markov-Chains
Project github for LLM Markov Chains Project
minetest-baselines
Baseline agents for Minetest tasks.
llemma-sample-explorer
Sample explorer tool for the Llemma models.
minetest-interpretabilty-notebook
Jupyter notebook for the interpretablity section of the minetester blog post
Unpaired-Image-Generation
Project Repo for Unpaired Image Generation project
latent-video-diffusion
Latent video diffusion
common-llm-settings
Common LLM Settings App
prefix-free-tokenizer
A prefix free tokenizer
truncated-gaussian
Method-of-moments estimation and sampling for truncated multivariate Gaussian distributions
gaia
Hugging Face and Pyserini interoperability
Plenoxels_FreeNerf
implmentation of Plenoxels radiance fields without neural networks, with free nerf strategy