EleutherAI

EleutherAI

Organization data from Github https://github.com/EleutherAI

Location:The Internet

Home Page:www.eleuther.ai

GitHub:@EleutherAI

Twitter:@AIEleuther

EleutherAI's repositories

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:10104Issues:38Issues:1327

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Language:PythonLicense:Apache-2.0Stargazers:7301Issues:126Issues:456

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2449Issues:34Issues:110

cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Language:PythonLicense:Apache-2.0Stargazers:784Issues:13Issues:14

sparsify

Sparsify transformers with SAEs and transcoders

Language:PythonLicense:MITStargazers:503Issues:4Issues:25

concept-erasure

Erasing concepts from neural representations with provable guarantees

Language:PythonLicense:MITStargazers:228Issues:8Issues:6

elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Language:PythonLicense:MITStargazers:197Issues:5Issues:90

delphi

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Language:PythonLicense:Apache-2.0Stargazers:166Issues:1Issues:33

DeeperSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:165Issues:3Issues:0

nanoGPT-mup

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:103Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:45Issues:4Issues:0

aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Language:PythonLicense:Apache-2.0Stargazers:34Issues:2Issues:2
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:21Issues:1Issues:0

mdl

Minimum Description Length probing for neural network representations

Language:PythonLicense:MITStargazers:19Issues:2Issues:0

polyapprox

Closed-form polynomial approximations to neural networks

Language:PythonLicense:MITStargazers:11Issues:1Issues:0

transformer-reasoning

Experiments in transformer knowledge and reasoning

Language:Jupyter NotebookLicense:MITStargazers:10Issues:0Issues:0

cupbearer

A library for mechanistic anomaly detection

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

tyche

Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6Issues:2Issues:0
Language:PythonLicense:MITStargazers:5Issues:3Issues:5
Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:4Issues:2Issues:0

website

New website for EleutherAI based on Hugo static site generator

Language:HTMLStargazers:4Issues:2Issues:0

sae_overlap

Acompanying code for our research on SAE feature overlap when trained on different seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:1Issues:0

aria-utils

MIDI tokenizers and pre-processing utils.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

rtopk

https://github.com/xiexi51/RTopK PyTorch wrapper

Language:CudaLicense:MITStargazers:1Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

open-r1

Fully open reproduction of DeepSeek-R1

License:Apache-2.0Stargazers:0Issues:0Issues:0

POSER

Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals

Language:PythonStargazers:0Issues:0Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0