eqy's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34892Issues:354Issues:304

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:25194Issues:264Issues:670

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonLicense:MITStargazers:20637Issues:252Issues:3471

discord.py

An API wrapper for Discord written in Python.

Language:PythonLicense:MITStargazers:14536Issues:262Issues:2913

volkswagen

:see_no_evil: Volkswagen detects when your tests are being run in a CI server, and makes them pass.

Language:JavaScriptLicense:MITStargazers:14106Issues:95Issues:33

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12009Issues:135Issues:196

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9082Issues:108Issues:80

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:8830Issues:86Issues:689

nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

Language:CLicense:NOASSERTIONStargazers:7763Issues:78Issues:232

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5410Issues:27Issues:28

min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Language:PythonLicense:MITStargazers:3483Issues:25Issues:76

KataGo

GTP engine and self-play learning in Go

Language:C++License:NOASSERTIONStargazers:3364Issues:77Issues:776

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1657Issues:37Issues:269

Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Language:PythonLicense:MITStargazers:916Issues:15Issues:40

hidet

An open-source efficient deep learning framework/compiler, written in python.

Language:PythonLicense:Apache-2.0Stargazers:632Issues:17Issues:81

nvbench

CUDA Kernel Benchmarking Library

Language:CudaLicense:Apache-2.0Stargazers:450Issues:18Issues:89

Awesome-GPU

Awesome resources for GPUs

License:BSD-3-ClauseStargazers:441Issues:23Issues:0

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonLicense:Apache-2.0Stargazers:435Issues:17Issues:19

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Language:C++License:NOASSERTIONStargazers:240Issues:19Issues:557

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonLicense:NOASSERTIONStargazers:228Issues:5Issues:15

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:26Issues:5Issues:682

mntm

modular neural turing machines

License:MITStargazers:2Issues:1Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:CudaLicense:Apache-2.0Stargazers:2Issues:0Issues:0

OSDP-public

Composable + Tunable = Optimal

Language:PythonStargazers:2Issues:1Issues:0

gen

graph generation and analysis stuff

License:MITStargazers:1Issues:1Issues:0

pin

A Pin

License:UnlicenseStargazers:1Issues:1Issues:0

torchrec

Pytorch domain library for recommendation systems

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:1Issues:0