Sashank Santhanam (sashank06)

sashank06

Geek Repo

Location:Charlotte, NC

Home Page:https://sashank06.github.io

Github PK Tool:Github PK Tool

Sashank Santhanam's starred repositories

llama.cpp

LLM inference in C/C++

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35090Issues:353Issues:305

engineering-blogs

A curated list of engineering blogs

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23270Issues:265Issues:62

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19537Issues:255Issues:72

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14705Issues:112Issues:155

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8089Issues:79Issues:499

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6440Issues:111Issues:292

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5583Issues:59Issues:412

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4856Issues:79Issues:74

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4246Issues:42Issues:173

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3993Issues:73Issues:4

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3724Issues:61Issues:97

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:3026Issues:65Issues:318

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++License:NOASSERTIONStargazers:2190Issues:119Issues:199

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2136Issues:26Issues:54

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:917Issues:15Issues:40

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:859Issues:18Issues:67

prize

A prize for finding tasks that cause large language models to show inverse scaling

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonLicense:Apache-2.0Stargazers:583Issues:6Issues:25

SEAL

Search Engines with Autoregressive Language models

Language:PythonLicense:NOASSERTIONStargazers:273Issues:7Issues:13
Language:PythonLicense:BSD-3-ClauseStargazers:129Issues:10Issues:1

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonLicense:MITStargazers:127Issues:5Issues:1

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonLicense:MITStargazers:60Issues:3Issues:61

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonLicense:MITStargazers:54Issues:2Issues:3
Language:PythonLicense:MITStargazers:48Issues:7Issues:2

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:PythonStargazers:14Issues:2Issues:0

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonLicense:MITStargazers:3Issues:2Issues:0