Joel Rorseth's starred repositories
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
TransformerLens
A library for mechanistic interpretability of GPT-style language models
Treasure-of-Transformers
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
FlexNeuART
Flexible classic and NeurAl Retrieval Toolkit
belief-localization
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."
CNN-Units-in-NLP
:scissors: Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs