zhangxs's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
tvm_mlir_learn
compiler learning resources collect.
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
py-motmetrics
:bar_chart: Benchmark multiple object trackers (MOT) in Python
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
FaceStudio
Put Your Face Everywhere in Seconds.
TensorRT_Inference_Demo
A repo that uses TensorRT to deploy wll-trained models.Support RTDETR,YOLO-NAS,YOLOV5,YOLOV6,YOLOV7,YOLOV8,YOLOX.
yolox-pytorch
a Pytorch easy re-implement of "YOLOX: Exceeding YOLO Series in 2021"
Effective-Fusion-Factor
Effective Fusion Factor in FPN for Tiny Object Detection(WACV2021)
u-mixformer
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
TrackDiffusion
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).