Sheng Qin's starred repositories
CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
tvm_mlir_learn
compiler learning resources collect.
CPlusPlusThings
C++那些事
Paddle-Lite
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
openvino_notebooks
📚 Jupyter notebook tutorials for OpenVINO™
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
ColossalAI
Making large AI models cheaper, faster and more accessible
CS149-parallel-computing
Learning materials for Stanford CS149 : Parallel Computing
deepx_core
deepx_core是一个专注于张量计算/深度学习的基础库
CollapseNav.Net.Tool-Doc
Doc of CollapseNav.Net.Tool
erkaman.github.io
The source code of my website.
Benchmark_SpGEMM_using_CSR
CSR-based SpGEMM on nVidia and AMD GPUs
C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.