Wayne's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8276Issues:87Issues:1812

lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7874Issues:84Issues:253

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6952Issues:68Issues:22

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6374Issues:54Issues:146

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4709Issues:44Issues:123

stable-diffusion.cpp

Stable Diffusion and Flux in pure C/C++

Language:C++License:MITStargazers:3290Issues:53Issues:242

ZeroOmega

Manage and switch between multiple proxies quickly & easily.

Language:CoffeeScriptLicense:GPL-3.0Stargazers:1412Issues:15Issues:40

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

LookOnceToHear

A novel human-interaction method for real-time speech extraction on headphones.

Language:PythonLicense:NOASSERTIONStargazers:535Issues:10Issues:2

calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Language:PythonLicense:MITStargazers:499Issues:4Issues:35

T-MAC

Low-bit LLM inference on CPU with lookup table

Language:C++License:MITStargazers:453Issues:7Issues:30

MI-GAN

[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices

Language:PythonLicense:MITStargazers:451Issues:8Issues:15

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonLicense:MITStargazers:356Issues:16Issues:57

ara

The PULP Ara is a 64-bit Vector Unit, compatible with the RISC-V Vector Extension Version 1.0, working as a coprocessor to CORE-V's CVA6 core

Language:CLicense:NOASSERTIONStargazers:353Issues:22Issues:180

q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Language:PythonLicense:MITStargazers:315Issues:17Issues:39

LED

[ICCV 2023] Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denoising && [Arxiv 2023] Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

Language:PythonLicense:NOASSERTIONStargazers:309Issues:8Issues:40

I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Language:PythonLicense:MITStargazers:222Issues:4Issues:31

Multispectral-Pedestrian-Detection-Resource

A list of resouces for multispectral pedestrian detection,including the datasets, methods, annotations and tools.

I-ViT

[ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference

Language:PythonLicense:Apache-2.0Stargazers:147Issues:3Issues:12

lama-with-refiner

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:123Issues:3Issues:0

BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Language:PythonLicense:MITStargazers:71Issues:3Issues:7

Fixed-Floating-Point-Adder-Multiplier

16-bit Adder Multiplier hardware on Digilent Basys 3

Language:VerilogLicense:NOASSERTIONStargazers:62Issues:5Issues:0

InspireFace

InspireFace is a cross-platform face recognition SDK developed in C/C++, supporting multiple operating systems and various backend types for inference, such as CPU, GPU, and NPU.

FEATHER

A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching

Language:C++Stargazers:26Issues:1Issues:0

CSAPP

《深入理解计算机系统 第三版》家庭作业

Language:CStargazers:23Issues:2Issues:0
Language:CLicense:Apache-2.0Stargazers:17Issues:1Issues:1

IWAENC-2024-Informed-FastICA

Matlab implementations of algorithms and scripts of simulations presented in Informed FastICA: Semi-Blind Minimum Variance Distortionless Beamformer

Stargazers:2Issues:0Issues:0