Sashank Santhanam (sashank06)

sashank06

Geek Repo

Location:Charlotte, NC

Home Page:https://sashank06.github.io

Github PK Tool:Github PK Tool

Sashank Santhanam's starred repositories

llama.cpp

LLM inference in C/C++

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34196Issues:354Issues:300

engineering-blogs

A curated list of engineering blogs

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20592Issues:236Issues:50

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19333Issues:255Issues:71

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14684Issues:112Issues:155

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7940Issues:78Issues:492

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6422Issues:109Issues:292

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5463Issues:59Issues:389

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4822Issues:79Issues:74

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4166Issues:40Issues:163

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3786Issues:70Issues:4

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3673Issues:61Issues:94

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:2954Issues:63Issues:315

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++License:NOASSERTIONStargazers:2182Issues:118Issues:199

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2123Issues:26Issues:54

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:913Issues:16Issues:40

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:836Issues:17Issues:62

prize

A prize for finding tasks that cause large language models to show inverse scaling

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonLicense:Apache-2.0Stargazers:578Issues:6Issues:25

SEAL

Search Engines with Autoregressive Language models

Language:PythonLicense:NOASSERTIONStargazers:272Issues:7Issues:13
Language:PythonLicense:BSD-3-ClauseStargazers:129Issues:10Issues:1

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonLicense:MITStargazers:127Issues:5Issues:1

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonLicense:MITStargazers:59Issues:3Issues:61

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonLicense:MITStargazers:53Issues:2Issues:3
Language:PythonLicense:MITStargazers:48Issues:7Issues:2

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:PythonStargazers:14Issues:2Issues:0

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonLicense:MITStargazers:3Issues:2Issues:0