AMD ROCm™ Software's repositories
rocBLAS-Examples
Examples illustrating usage of the rocBLAS library
rocm-spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
distributed
A distributed task scheduler for Dask
OpenLLM
Operating LLMs in production
nccl-rccl-parser
Tool to run rccl-tests/nccl-tests based on from an application
pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
shap
A game theoretic approach to explain the output of any machine learning model.
rocgputreeshap
ROCm support for GPUTreeShap
gloo
Collective communications library with various primitives for multi-machine training.
tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
rocGemmDriver
rocGemmDriver
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
ROCm-OpenCL-Runtime
ROCm OpenOpenCL Runtime
rocm-recipes
Recipes for rocm
frugally-deep
Header-only library for using Keras (TensorFlow) models in C++.
ClassyVision
An end-to-end PyTorch framework for image and video classification