HanBing Guo's starred repositories
geektime-books
:books: 极客时间电子书
Cpp-Templates-2ed
C++11/14/17/20 templates and generic programming, the most complex and difficult technical details of C++, indispensable in building infrastructure libraries.
mlir-tutorial
MLIR For Beginners tutorial
trtllm-llama
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ComputerArchitectureAndCppBooks
📚 计算机体系结构与C++书籍收集(持续更新)
code_generator
Simple and straightforward code generator for creating program code. At the moment offers support for C++, Java and HTML5 for generating reports.
tvm_mlir_learn
compiler learning resources collect.
trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
Point-Cloud-Processing-example
点云库PCL从入门到精通 书中配套案例