tomekkorbak

Tomek Korbak's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

1526700

gpt-repository-loader

Convert code repos into an LLM prompt-friendly format. Mostly built by GPT-4.

Language:PythonMIT248700

autoanki

Automatically create Anki cards from text using language models

Language:PythonMIT1000

inspect_ai

Inspect: A framework for large language model evaluations

Language:PythonMIT40500

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonMIT1788800

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause130000

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT1191600

posteriors

Uncertainty quantification with PyTorch

Language:PythonApache-2.027800

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonApache-2.0154300

machine-learning-list

A curriculum for learning about foundation models, from scratch to the frontier

79200

min-max-gpt

Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training

Language:Python10100

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.014700

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookApache-2.085300

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1041000

CPM

Language:Python100

transformer-debugger

Language:PythonMIT397200

allms

A versatile and powerful library designed to streamline the process of querying LLMs

Language:PythonApache-2.06700

devinterp

Tools for studying developmental interpretability in neural networks.

Language:Python5200

pit-38-usd-schwab-calculator

Language:PythonMIT100

pybefit

Probabilistic inference for models of behaviour

Language:PythonMIT900

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

85800