Yang's repositories
scalehls
A scalable High-Level Synthesis framework on MLIR
buddy-benchmark
Benchmark Framework for Buddy Projects
YBTorchCompiler
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Democratizing-CGRA-Research
Lots of useful and basic knowledge for the CGRA research.
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
WingTsun
This is a repo for the development of WingTsun Lang!
taichi
Productive & portable high-performance programming in Python.
tvm_mlir_learn
tvm learn
bfsh
Being a full-stack hacker, RISCV, LLVM, and more.
cutlass
CUDA Templates for Linear Algebra Subroutines
vision
Datasets, Transforms and Models specific to Computer Vision
config-files
A collection of my config files.
nimble
Lightweight and Parallel Deep Learning Framework
ppl.nn
A primitive library for neural network
SMore-Graph
This is a repo for developing a NN compiler in graph-level optimization
TensorCoreExperiment
This is a repo for my TensorCore experiment
TransformerAccExperiments
These are all my experimental code about the acceleration of Transformer in DETR
inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
ToolsSeminar-CS
Seminar on selected tools in Computer Science