Zhuang Zhuang's repositories
Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks
This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"
Group-Net-image-classification
Structured Binary Neural Networks for Image Recognition
Group-Net-semantic-segmentation
Structured Binary Neural Networks for Image Recognition
Fast-Training-of-Triplet-based-Deep-Binary-Embedding-Networks
Fast-Training-of-Triplet-based-Deep-Binary-Embedding-Networks
Parallel-Attention-A-Unified-Framework-for-Visual-Object-Discovery-through-Dialogs-and-Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Attend-in-Groups-a-Weakly-supervised-Deep-Learning-Framework-for-Learning-from-Web-Data
Attend in Groups: a Weakly-supervised Deep Learning Framework for Learning from Web Data
model-quantization
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
Visual-Tracking-via-Discriminative-Sparse-Similarity-Map
Visual Tracking via Discriminative Sparse Similarity Map
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
autogluon
AutoGluon: AutoML Toolkit for Deep Learning
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
cshizhe.github.io
Shizhe's homepage https://cshizhe.github.io/
fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
flashinfer
FlashInfer: Kernel Library for LLM Serving
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
jekyll
:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby
jina
☁️ Build multimodal AI applications with cloud-native stack
MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
SAQ
This is the official PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".
T-Stitch
Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.