Joseph Bloom's repositories

SAELens

Training Sparse Autoencoders on Language Models

Language:HTMLLicense:MITStargazers:224Issues:9Issues:72

DecisionTransformerInterpretability

Interpreting how transformers simulate agents performing RL tasks

Language:Jupyter NotebookLicense:MITStargazers:58Issues:4Issues:72

alphabetical_probe

Experimental code which trains 26 linear probes to detect the presence of alphabetic letters in GPT-J token strings, given their embeddings. Exploring the resulting vector arithmetic and its impact on GPT-J spelling abilities

Language:Jupyter NotebookStargazers:2Issues:1Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

toy_model_interpretability

I'd like to start playing around with toy models to better understand results in recent papers.

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

ARENA_2.0

I'm teaching ARENA 2.0 and providing students with direction on careers and personal development.

Language:PythonStargazers:0Issues:1Issues:0

ARENA_2.0-RLHF

Preparing content for the ARENA RLHF day.

Language:Jupyter NotebookStargazers:0Issues:3Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

geom_median

Fast and differentiable geometric median, a multivariate median analogue. Install with `pip install geom-median`

License:NOASSERTIONStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Module-1

Module 1 - Autodifferentiation

Language:PythonStargazers:0Issues:1Issues:0
Language:HTMLLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

protein-inference

A python package for protein inference in Mass Spectrometric data analysis.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rust_cli_project

I'm teaching myself Rust.

Language:RustStargazers:0Issues:2Issues:0

rust_text_editor

Learning by doing with Rust. Following along the Hecto tutorial https://www.philippflenker.com/hecto/

Language:RustStargazers:0Issues:2Issues:0

sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Language:PythonLicense:MITStargazers:0Issues:1Issues:0