AsaCooperStickland

Asa Cooper Stickland's repositories

Bert-n-Pals

Pytorch implementation of Bert and Pals: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning (https://arxiv.org/abs/1902.02671)

Language:PythonMIT81 2 2

situational-awareness-evals

Measuring the situational awareness of language models

Language:Jupyter Notebook31 10

Spike_And_Slab

Implementation of Gibbs sampling for spike and slab priors

Language:Python18 2 1

hf-sharpness

Simple implementation of flat minima methods (SAM, fisher penalty) for Huggingface trainer.

Language:Python200

plotting

Matplotlib talk

Language:HTML200

kl-then-steer

Language:Python100

transparent

Exposing transformer language model internals.

Language:PythonMIT100

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonApache-2.0000

AsaCooperStickland.github.io

Language:Shell000

darts

Differentiable architecture search for convolutional and recurrent networks

Language:PythonApache-2.0000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0000

FlatMinimaInterpretability

Figuring out connections between loss landscapes and interpretability.

Language:Jupyter NotebookMIT000

gpt3-arithmetic

Scratchpad/Chain-of-Thought Prompts

Language:Python000

Ideas

Added master document and short description of SGHMC stuff

Language:TeX000

inverse-scaling-eval-pipeline

Basic pipeline for running different sized GPT models and plotting the results + calibration

Language:Python000

knowledge-erasure

Removing knowledge from language models

Language:Jupyter NotebookMIT000

LBM2

initlal

000

LBM_LG_code

Language:C++010

llama3-jailbreak

A trivial programmatic Llama 3 jailbreak. Sorry Zuck!

Language:Python000

lm-sandbox

The goal of this project is to enable users to create easy scaling law examples and cool web demos using the OpenAI GPT-3 API or Huggingface compatible LMs with just a few lines of Python.

Language:PythonMIT000