firstuserhere's repositories

gpt4Vadvanced

Testing GPT-4 Vision on Advanced examination questions (2023) across physics, chemistry, and mathematics

Language:JavaScriptStargazers:3Issues:1Issues:0

metalearning

This is a repository and github pages website deployment for my work on the mechanistic analysis of out-of-context meta-learning in LLMs

Language:SCSSLicense:MITStargazers:2Issues:0Issues:0

awesome-mech-interp

An awesome curated list of resources dedicated to Mechanistic interpretability

basic-scripts

a bunch of basic scripts hacked together but working and are maybe useful for me

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

firstuserhere.github.io

This is my website

Language:HTMLLicense:MITStargazers:1Issues:1Issues:0

multimodal-mechinterp

Basic mech interp analysis for some multimodal models

neurips-workshops-2024

I wanted to have the neurips workshops organized neatly so created this page

Stargazers:1Issues:0Issues:0

outofcontextnotes

This repository holds my notes and thoughts (always WIP) while doing work on the "out of context meta learning" project.

Language:RubyLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0
Stargazers:0Issues:1Issues:0

aisc_oocl_experiments

experiments trying to elicit out of context learning when training a transformer on a simple task

Language:PythonStargazers:0Issues:0Issues:0

ComPromptMized

ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications

Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Improved-worldmodels

Critiques of the pre-print, suggestions for improvement, and counterfactual examples testing

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

miras-sudoku-solution

Fork of a possible solution for testing

Stargazers:0Issues:0Issues:0

nanogenmo

National Novel Generation Month, 2023 edition.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SPARta

LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces

Stargazers:0Issues:0Issues:0

transformer-debugger

My fork of the original transformer Debugger library by openAI

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

transformerperspectives

Looking at data through the perspective of different components of a transformer model

Language:RubyLicense:MITStargazers:0Issues:1Issues:0

visualize-SAE

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

ViT-Prisma

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Whisper-mechinterp

Mechanistic Interpretability for Whisper

Stargazers:0Issues:1Issues:0