sashank06

Sashank Santhanam's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT62278 522 3441

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT35090 353 305

engineering-blogs

A curated list of engineering blogs

Language:Ruby30379 1026 90

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION23270 265 62

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonMIT19537 255 72

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonApache-2.014705 112 155

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION8089 79 499

metaseq

Repo for external large-scale work

Language:PythonMIT6440 111 292

mlx-examples

Examples in the MLX framework

Language:PythonMIT5583 59 412

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonApache-2.04856 79 74

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04246 42 173

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookMIT3993 73 4

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookApache-2.03724 61 97

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonMIT3026 65 318

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++NOASSERTION2190 119 199

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02136 26 54

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonMIT917 15 40

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonApache-2.0859 18 67

prize

A prize for finding tasks that cause large language models to show inverse scaling

CC-BY-4.0586 27 7

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonApache-2.0583 6 25

alexa-teacher-models

Language:PythonApache-2.0362 36 7

SEAL

Search Engines with Autoregressive Language models

Language:PythonNOASSERTION273 7 13

Converse

Language:PythonBSD-3-Clause129 10 1

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonMIT127 5 1

Berkeley-Crossword-Solver

ACL 2022

Language:PythonMIT123 5 7

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonMIT60 3 61

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonMIT54 2 3

FaithDial

Language:PythonMIT48 7 2

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:Python14 20

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonMIT3 20