Kosti's repositories
setfit-integrated-gradients
Hacking SetFit so that it works with integrated gradients.
mlx-examples
Examples in the MLX framework
mygithubpage
My website, served with jekyll.
alignment-handbook
Robust recipes to align language models with human and AI preferences
chess_llm_interpretability
Evaluating an LLM trained on chess PGN strings using techniques from the Othello World Models paper.
diff_history
[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
diffuse-distributions
Forcing Diffuse Distributions out of Language Models
LLM-LieDetector
Code for the paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
mamba.py
An efficient Mamba implementation in PyTorch and MLX.
mlx
MLX: An array framework for Apple silicon
pdf-renamer-server
A python tool to automatically rename the pdf files of scientific publications by looking up the publication metadata on the web.
posteriors
Uncertainty quantification with PyTorch
starting-kit
Starting kit for the NeurIPS 2023 unlearning challenge
streamlit-chess
Bi-directional component to play chess in streamlit apps
time_interpret
Unified Model Interpretability Library for Time Series
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.