zjing14's repositories

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:C++Stargazers:1Issues:0Issues:0
Language:C++Stargazers:1Issues:0Issues:0

DeepBench

Benchmarking Deep Learning operations on different hardware

Language:C++License:Apache-2.0Stargazers:1Issues:0Issues:0