q yao's repositories

mmdetection-to-tensorrt

convert mmdetection model to tensorrt, support fp16, int8, batch input, dynamic shape etc.

Language:PythonLicense:Apache-2.0Stargazers:598Issues:13Issues:109

torch2trt_dynamic

A pytorch to tensorrt convert with dynamic shape support

Language:PythonLicense:MITStargazers:267Issues:5Issues:32

amirstan_plugin

Useful tensorrt plugin. For pytorch and mmdetection model conversion.

Language:C++License:MITStargazers:165Issues:4Issues:28

TorchMPSCustomOpsDemo

A demo about how to add custom MPS ops in PyTorch.

Language:C++License:Apache-2.0Stargazers:9Issues:2Issues:0

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

coremltools

Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepEP

DeepEP: an efficient expert-parallel communication library

Language:CudaLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

effective-debugging-zh

effective debugging 中文翻译

Stargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

grimoire.github.io

My github pages website

Language:ShellLicense:CC-BY-SA-4.0Stargazers:0Issues:1Issues:4
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MegEngine

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mmrotate

OpenMMLab Rotated Object Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mmyolo

OpenMMLab YOLO series toolbox and benchmark

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

SimpleNES

An NES emulator in C++

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:1Issues:7

the-art-of-debugging

The Art of Debugging

Language:CLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0

Triton-distributed

Distributed Triton for Parallel Systems

License:MITStargazers:0Issues:0Issues:0