Sashank Santhanam (sashank06)

sashank06

Geek Repo

Location:Charlotte, NC

Home Page:https://sashank06.github.io

Github PK Tool:Github PK Tool

Sashank Santhanam's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:13913Issues:0Issues:0

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:757Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:3925Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:4863Issues:0Issues:0

engineering-blogs

A curated list of engineering blogs

Language:RubyStargazers:29066Issues:0Issues:0

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:882Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:55784Issues:0Issues:0

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:2798Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2081Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:18768Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7523Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:31601Issues:0Issues:0

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonLicense:Apache-2.0Stargazers:569Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:363Issues:0Issues:0

prize

A prize for finding tasks that cause large language models to show inverse scaling

License:CC-BY-4.0Stargazers:581Issues:0Issues:0

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14634Issues:0Issues:0

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:PythonStargazers:13Issues:0Issues:0
Language:PythonLicense:MITStargazers:120Issues:0Issues:0

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6382Issues:0Issues:0
Language:PythonLicense:MITStargazers:47Issues:0Issues:0

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++License:NOASSERTIONStargazers:2174Issues:0Issues:0

SEAL

Search Engines with Autoregressive Language models

Language:PythonLicense:NOASSERTIONStargazers:269Issues:0Issues:0

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3538Issues:0Issues:0

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonLicense:MITStargazers:127Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:128Issues:0Issues:0

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonLicense:MITStargazers:58Issues:0Issues:0

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4761Issues:0Issues:0

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3266Issues:0Issues:0