Ruobing Han's repositories
NN_transform
Trans different platform's network to International Representation(IR)
mmdetection
Open MMLab Detection Toolbox and Benchmark
CompilerGym
Reinforcement learning environments for compiler and program optimization tasks
CuPBoP-JIT
A framework that support executing unmodified CUDA source code on non-NVIDIA devices.
fasterstreamline
Streamline Covert Channel Attack (presented in ASPLOS'21)
gpu-rodinia
Rodinia benchmark
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
mmclassification
OpenMMLab Image Classification Toolbox and Benchmark
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
pytorch-cifar
95.16% on CIFAR10 with PyTorch
releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
vector_addition_cuda
A simple CUDA vector addition program