Zhouzhou's starred repositories
interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
jetson_stats
📊 Simple package for monitoring and control your NVIDIA Jetson [Orin, Xavier, Nano, TX] series
CUDALibrarySamples
CUDA Library Samples
CUDA-Learn-Notes
🎉 Modern CUDA Learn Notes with PyTorch: fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Learn-CUDA-Programming
Learn CUDA Programming, published by Packt
rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies
jetson_dla_tutorial
A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson
ncnn-models
awesome AI models with NCNN, and how they were converted ✨✨✨
Parallel-Computing-Cuda-C
CUDA Learning guide
cuDLA-samples
YOLOv5 on Orin DLA
Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F
Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.
CodeFormer-ncnn
ncnn version of CodeFormer
llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
arm64-linux-debugging-disassembling-reversing
Source Code for 'Foundations of ARM64 Linux Debugging, Disassembling, and Reversing' by Dmitry Vostokov