eqy's starred repositories
pytorch_geometric
Graph Neural Network Library for PyTorch
discord.py
An API wrapper for Discord written in Python.
volkswagen
:see_no_evil: Volkswagen detects when your tests are being run in a CI server, and makes them pass.
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
GPU-Puzzles
Solve puzzles. Learn CUDA.
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Awesome-GPU
Awesome resources for GPUs
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
OSDP-public
Composable + Tunable = Optimal