Sashank Santhanam (sashank06)

sashank06

Geek Repo

Location:Charlotte, NC

Home Page:https://sashank06.github.io

Github PK Tool:Github PK Tool

Sashank Santhanam's starred repositories

llama.cpp

LLM inference in C/C++

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:32484Issues:346Issues:294

engineering-blogs

A curated list of engineering blogs

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19078Issues:255Issues:70

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:16954Issues:204Issues:39

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14663Issues:111Issues:155

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7740Issues:75Issues:481

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6404Issues:108Issues:292

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5209Issues:58Issues:362

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4798Issues:79Issues:74

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4079Issues:41Issues:153

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3614Issues:60Issues:91

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3447Issues:68Issues:3

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:2890Issues:63Issues:311

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++License:NOASSERTIONStargazers:2178Issues:118Issues:199

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2101Issues:26Issues:54

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:899Issues:16Issues:40

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:803Issues:17Issues:58

prize

A prize for finding tasks that cause large language models to show inverse scaling

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonLicense:Apache-2.0Stargazers:573Issues:6Issues:25

SEAL

Search Engines with Autoregressive Language models

Language:PythonLicense:NOASSERTIONStargazers:272Issues:6Issues:13
Language:PythonLicense:BSD-3-ClauseStargazers:128Issues:10Issues:1

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonLicense:MITStargazers:127Issues:5Issues:1

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonLicense:MITStargazers:59Issues:3Issues:61
Language:PythonLicense:MITStargazers:47Issues:6Issues:1

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonLicense:MITStargazers:45Issues:2Issues:3

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:PythonStargazers:13Issues:0Issues:0

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonLicense:MITStargazers:3Issues:2Issues:0