Leyuan Wang's starred repositories
matxscript
A high-performance, extensible Python AOT compiler.
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
relay-bench
A repository containing examples and benchmarks for Relay.
ICCV19-GluonCV
Tutorial Materials for ICCV19
custom_op_benchmark
naive graph attention kernels
PlotNeuralNet
Latex code for making neural networks diagrams