James Campbell's repositories
llama-lying
Code for our paper "Localizing Lying in Llama"
cornell-ml-kaggle-winner
My winning submission (1st out of 155 participants) to Cornell's ML Kaggle competition
iti_capstone
Analyzing truth representations in LLMs across different kinds of truth and intervening on their hidden states to make LLMs more truthful
Language:Jupyter Notebook000
NLP-brain-biased-robustness
CS 6740 term project: "CereBERTo: Improving Distributional Robustness with Brain-Like Language Representations"
representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
Language:Jupyter NotebookMIT000
MCTSr
A quick implementation of "Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B"
Language:Python000
Language:Python000
TransformerLens
TransformerLens
Language:PythonMIT000