Sashank Santhanam (sashank06)

sashank06

Geek Repo

Location:Charlotte, NC

Home Page:https://sashank06.github.io

Github PK Tool:Github PK Tool

Sashank Santhanam's starred repositories

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:20033Issues:0Issues:0

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:832Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4153Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5445Issues:0Issues:0

engineering-blogs

A curated list of engineering blogs

Language:RubyStargazers:29720Issues:0Issues:0

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:913Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:60552Issues:0Issues:0

esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Language:PythonLicense:MITStargazers:2942Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2124Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19290Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7915Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34019Issues:0Issues:0

diffuzers

a web ui & api for 🤗 diffusers

Language:PythonLicense:Apache-2.0Stargazers:578Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:362Issues:0Issues:0

prize

A prize for finding tasks that cause large language models to show inverse scaling

License:CC-BY-4.0Stargazers:585Issues:0Issues:0

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14681Issues:0Issues:0

duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Language:PythonStargazers:14Issues:0Issues:0
Language:PythonLicense:MITStargazers:122Issues:0Issues:0

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6419Issues:0Issues:0
Language:PythonLicense:MITStargazers:48Issues:0Issues:0

sdsl-lite

Succinct Data Structure Library 2.0

Language:C++License:NOASSERTIONStargazers:2181Issues:0Issues:0

SEAL

Search Engines with Autoregressive Language models

Language:PythonLicense:NOASSERTIONStargazers:272Issues:0Issues:0

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3666Issues:0Issues:0

grafog

Graph Data Augmentation Library for PyTorch Geometric

Language:PythonLicense:MITStargazers:127Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:128Issues:0Issues:0

GEM-metrics

Automatic metrics for GEM tasks

Language:PythonLicense:MITStargazers:59Issues:0Issues:0

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4818Issues:0Issues:0

raph

RAPH - Reinforcement Agent Playing netHack

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3692Issues:0Issues:0