Aaryan0404

followers

following

stars

Palo Alto

https://www.aaryan-singhal.com/

Aaryan Singhal's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT62575 525 3459

Bend

A massively parallel, high-level programming language

Language:RustApache-2.016916 92 203

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog6740 65 22

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.05823 38 77

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonNOASSERTION3077 45 358

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonApache-2.01722 17 25

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1413 25 20

consistency-policy

[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation

Language:PythonMIT82 2 3

readings

This is a list of readings for CS348K.

nanoGPT-TK

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Language:MakefileMIT44 30

madrona_escape_room_pixact

co-optimizing the design and execution of ml architectures that enables end-to-end (raw pixel inputs, agent action outputs) rl algorithms to quickly learn effective game-playing policies on the madrona game engine

Language:C++MIT200