Aaryan Singhal's starred repositories

llama.cpp

LLM inference in C/C++

Bend

A massively parallel, high-level programming language

Language:RustLicense:Apache-2.0Stargazers:16916Issues:92Issues:203

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6740Issues:65Issues:22

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5823Issues:38Issues:77

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:3077Issues:45Issues:358

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1722Issues:17Issues:25

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1413Issues:25Issues:20

consistency-policy

[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation

Language:PythonLicense:MITStargazers:82Issues:2Issues:3

readings

This is a list of readings for CS348K.

nanoGPT-TK

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Language:MakefileLicense:MITStargazers:44Issues:3Issues:0

madrona_escape_room_pixact

co-optimizing the design and execution of ml architectures that enables end-to-end (raw pixel inputs, agent action outputs) rl algorithms to quickly learn effective game-playing policies on the madrona game engine

Language:C++License:MITStargazers:2Issues:0Issues:0