Wayne's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
LookOnceToHear
A novel human-interaction method for real-time speech extraction on headphones.
calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
Multispectral-Pedestrian-Detection-Resource
A list of resouces for multispectral pedestrian detection,including the datasets, methods, annotations and tools.
lama-with-refiner
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
BitDistiller
[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
Fixed-Floating-Point-Adder-Multiplier
16-bit Adder Multiplier hardware on Digilent Basys 3
InspireFace
InspireFace is a cross-platform face recognition SDK developed in C/C++, supporting multiple operating systems and various backend types for inference, such as CPU, GPU, and NPU.
IWAENC-2024-Informed-FastICA
Matlab implementations of algorithms and scripts of simulations presented in Informed FastICA: Semi-Blind Minimum Variance Distortionless Beamformer