There are 215 repositories under cuda topic.
A high-throughput and memory-efficient inference and serving engine for LLMs
Build and run Docker containers leveraging NVIDIA GPUs
Instant neural graphics primitives: lightning fast NeRF and more
Modular ZK(Zero Knowledge) backend accelerated by GPU
Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, and OpenCV Contrib.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
SGLang is a fast serving framework for large language models and vision language models.
A PyTorch Library for Accelerating 3D Deep Learning Research
Lightning fast C++/CUDA neural network framework
Fast inference engine for Transformer models