royinx's repositories
CUDA_Resize
resize image in (CUDA, python, cupy)
VideoProcessingFramework
Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions
yolov7_cupy_trt
yolov7 + cupy + trt + resize module + padding
llm_triton
LLM in Triton , Hugging Face -> Pytorch -> ONNX -> TensorRT -> Triton
simple_api_template
simple API , including Flask and FastAPI
cutlass
CUDA Templates for Linear Algebra Subroutines
CV-CUDA
CV-CUDA™ is an open-source, graphics processing unit (GPU)-accelerated library for cloud-scale image processing and computer vision.
Nsight-Systems-Docker-Image
Nsight Systems in Docker
NeRF
Instant neural graphics primitives: lightning fast NeRF and more
kernel_tuner
Kernel Tuner
triton_ensemble_model_demo
triton server ensemble model demo
jetson_triton
Jetpack4.5 Triton server
LFD-A-Light-and-Fast-Detector
LFD is a big update upon LFFD. Generally, LFD is a multi-class object detector characterized by lightweight, low inference latency and superior precision. It is for real-world appilcations.
LFFD-A-Light-and-Fast-Face-Detector-for-Edge-Devices
A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......
trt_yolov3
tensorRT_yolov3
MSBD5001-kaggle
MSBD5001-kaggle
simple_flask_demo
blank flask template for quick start and testing
BilinearImageResize
Bilinear Image Resize with openmp/cuda
clothes_seg_trt
clothes segmentation
java_mini_build
build java mini container image
TensorRT_Deployment
Model (ONNX, Pytorch) to TensorRT inference server
color_extract
color clustering