sashank06

Sashank Santhanam's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT60790 514 3267

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT34196 354 300

engineering-blogs

A curated list of engineering blogs

Language:Ruby29746 1020 90

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION20592 236 50

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonMIT19333 255 71

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonApache-2.014684 112 155

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION7940 78 492

metaseq

Repo for external large-scale work

Language:PythonMIT6422 109 292

mlx-examples

Examples in the MLX framework

Language:PythonMIT5463 59 389

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonApache-2.04822 79 74

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04166 40 163

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookMIT3786 70 4

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookApache-2.03673 61 94

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonMIT2954 63 315

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++NOASSERTION2182 118 199

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02123 26 54

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonMIT913 16 40

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonApache-2.0836 17 62

prize

A prize for finding tasks that cause large language models to show inverse scaling

CC-BY-4.0585 27 7

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonApache-2.0578 6 25

alexa-teacher-models

Language:PythonApache-2.0362 36 7

SEAL

Search Engines with Autoregressive Language models

Language:PythonNOASSERTION272 7 13

Converse

Language:PythonBSD-3-Clause129 10 1

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonMIT127 5 1

Berkeley-Crossword-Solver

ACL 2022

Language:PythonMIT122 5 7

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonMIT59 3 61

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonMIT53 2 3

FaithDial

Language:PythonMIT48 7 2

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:Python14 20

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonMIT3 20