bohanzhuang

followers

following

stars

Monash University

https://bohanzhuang.github.io/

Zhuang Zhuang's repositories

Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks

This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"

Language:Python20 4 1

Group-Net-image-classification

Structured Binary Neural Networks for Image Recognition

Language:Python16 3 2

Group-Net-semantic-segmentation

Structured Binary Neural Networks for Image Recognition

Language:Python16 1 2

Fast-Training-of-Triplet-based-Deep-Binary-Embedding-Networks

Fast-Training-of-Triplet-based-Deep-Binary-Embedding-Networks

Language:C++4 20

Parallel-Attention-A-Unified-Framework-for-Visual-Object-Discovery-through-Dialogs-and-Queries

Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries

Language:Python4 2 1

Attend-in-Groups-a-Weakly-supervised-Deep-Learning-Framework-for-Learning-from-Web-Data

Attend in Groups: a Weakly-supervised Deep Learning Framework for Learning from Web Data

Language:Python2 10

bohanzhuang.github.io

Language:JavaScriptMIT2 10

DoReFa-Net-implementation

Language:Python2 10

model-quantization

Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)

Language:PythonNOASSERTION200

role-kd

Role-Wise Data Augmentation for Knowledge Distillation

Language:Python200

GDFQ

official implementation of Generative Low-bitwidth Data Free Quantization(GDFQ)

Language:Python100

Visual-Tracking-via-Discriminative-Sparse-Similarity-Map

Visual Tracking via Discriminative Sparse Similarity Map

Language:MATLAB1 10

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.0000

autogluon

AutoGluon: AutoML Toolkit for Deep Learning

Language:PythonApache-2.0000

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

000

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT000

cshizhe.github.io

Shizhe's homepage https://cshizhe.github.io/

Language:JavaScript000

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

MIT000

flashinfer

FlashInfer: Kernel Library for LLM Serving

Apache-2.0000

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs

Language:Jupyter NotebookApache-2.0000

incubator-tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0000

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:C++Apache-2.0000

jekyll

:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby

Language:RubyMIT000

jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonApache-2.0000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Apache-2.0000

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.0000

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonMIT000

SAQ

This is the official PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

Language:PythonApache-2.0000

T-Stitch

Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"

NOASSERTION000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Apache-2.0000