Kosti's repositories
machine-learning-zettelkasten
Insert backlinks to markdown documents by using sklearn and cosine similarity.
mlx-examples
Examples in the MLX framework
Linear-Token-Predictor
This is a reproduction of the model used in Malach, E., 2023. Auto-regressive next-token predictors are universal learners. arXiv preprint arXiv:2309.06979, Section 4.1.
Things-AI
An experiment in having LLM support for the Things 3 todo app.
alignment-handbook
Robust recipes to align language models with human and AI preferences
diff_history
[arXiv preprint 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
diffuse-distributions
Forcing Diffuse Distributions out of Language Models
DSPy-Text2SQL
DSPY on action with OpenSource LLMs.
entropix
Entropy Based Sampling and Parallel CoT Decoding
LESS
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
llm-autoeval
Automatically evaluate your LLMs in Google Colab
LLMRank
PageRank for LLMs
outlines
Structured Text Generation
pdf-renamer-server
A python tool to automatically rename the pdf files of scientific publications by looking up the publication metadata on the web.
posteriors
Uncertainty quantification with PyTorch
rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
SAE-based-representation-engineering
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
thermox
Exact OU processes with JAX
TransformerLens
A library for mechanistic interpretability of GPT-style language models
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.