topic | 主题 | 备注 |
---|---|---|
overview | 概述 | |
layout | 内存布局 | |
compute_graph_optimize | 计算图优化 | |
dynamic_shape | 动态shape | |
plugin | 插件 | |
calibration | 标定 | |
asp | 稀疏 | |
qat | 量化感知训练 | |
trtexec | 辅助工具 | |
runtime | 运行时 | |
inferflow | 模型调度 | |
mps | MPS | |
deploy | 基于onnx部署流程, trt 工具使用 | |
py-tensorrt | python tensorrt封装 | 解析 tensorrt __init__ |
cookbook | 食谱 | |
developer_guide | 开发者指导 |
tensorrt各版本迁移说明
https://docs.nvidia.com/deeplearning/tensorrt/migration-guide/index.html
https://docs.nvidia.com/deeplearning/tensorrt/archives/
https://developer.nvidia.com/search?page=1&sort=relevance&term=
https://github.com/HeKun-NVIDIA/TensorRT-Developer_Guide_in_Chinese/tree/main
https://developer.nvidia.com/zh-cn/blog/nvidia-gpu-fp8-training-inference/