firstuserhere's repositories

Language:HTMLStargazers:3Issues:0Issues:0

gpt4Vadvanced

Testing GPT-4 Vision on Advanced examination questions (2023) across physics, chemistry, and mathematics

Language:JavaScriptStargazers:3Issues:0Issues:0

metalearning

This is a repository and github pages website deployment for my work on the mechanistic analysis of out-of-context meta-learning in LLMs

Language:SCSSLicense:MITStargazers:2Issues:0Issues:0

awesome-mech-interp

An awesome curated list of resources dedicated to Mechanistic interpretability

basic-scripts

a bunch of basic scripts hacked together but working and are maybe useful for me

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

firstuserhere.github.io

This is my website

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

multimodal-mechinterp

Basic mech interp analysis for some multimodal models

outofcontextnotes

This repository holds my notes and thoughts (always WIP) while doing work on the "out of context meta learning" project.

Language:RubyLicense:MITStargazers:1Issues:0Issues:0

replications

My attempts at replicating results of papers

Stargazers:1Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0
Stargazers:0Issues:1Issues:0

aisc_oocl_experiments

experiments trying to elicit out of context learning when training a transformer on a simple task

Stargazers:0Issues:0Issues:0

ComPromptMized

ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications

Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Improved-worldmodels

Critiques of the pre-print, suggestions for improvement, and counterfactual examples testing

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

miras-sudoku-solution

Fork of a possible solution for testing

Stargazers:0Issues:0Issues:0

nanogenmo

National Novel Generation Month, 2023 edition.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SPARta

LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces

Stargazers:0Issues:0Issues:0

transformer-debugger

My fork of the original transformer Debugger library by openAI

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

transformerperspectives

Looking at data through the perspective of different components of a transformer model

Language:RubyLicense:MITStargazers:0Issues:0Issues:0

visualize-SAE

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

License:MITStargazers:0Issues:0Issues:0

ViT-Prisma

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Whisper-mechinterp

Mechanistic Interpretability for Whisper

Stargazers:0Issues:0Issues:0