humahuma's repositories
awesome
😎 Awesome lists about all kinds of interesting topics
cccl
CUDA Core Compute Libraries
FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
gdev
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
hidet
An open-source efficient deep learning framework/compiler, written in python.
iree-turbine
IREE's PyTorch Frontend, based on Torch Dynamo.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
ROCK-Kernel-Driver
AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver
workshops
This is a repository for all workshop related materials.
xla
Enabling PyTorch on Google TPU
torch-ccl
oneCCL Bindings for Pytorch*
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
xv6-riscv
Xv6 for RISC-V