terrychenism / AITemplate_OSS

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

terrychenism/AITemplate_OSS Issues

No issues in this repository yet.