High Performance Cross-Stack Optimization (HiPO)'s repositories
dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
DynVec-artifact
Evaluation scripts for DynVec
MSC-stencil-compiler
The code of paper "Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core" Processors.
swCholesky
Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture
Atrec
AtRec: Accelerating Recommendation Model Training on CPUs
Language:C++000
CTStencil
The code of paper "Adapting Combined Tiling to Stencil Optimizations on Sunway Processor".
Language:C000
rvv_conv2d
Conv2d benchmark for RISC-V Vector Extension
intelligent-unroll
Implementation of DynVec
rodinia_3.1_SW
rodinia_3.1 benchmark for SW
Language:C000