quantized-neural-networks

There are 6 repositories under quantized-neural-networks topic.

tensorflow / model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
tensorflow machine-learning deep-learning optimization quantized-neural-networks quantized-networks quantized-training keras model-compression compression ml pruning sparsity quantization
Language:Python 1473
larq
larq / larq
An Open-Source Library for Training Binarized Neural Networks
deep-learning machine-learning tensorflow python binarized-neural-networks quantized-neural-networks keras binder larq
Language:Python 691
google / qkeras
QKeras: a quantization deep learning library for Tensorflow Keras
deep-learning quantization quantized-neural-networks hardware-acceleration quantized-networks tensorflow keras machine-learning asic-design fpga fpga-accelerator accelerator
Language:Python 527
BUG1989 / caffe-int8-convert-tools
Generate a quantization parameter file for ncnn framework int8 inference
caffe deeplearning-ai int8-inference ncnn quantized-neural-networks
Language:Python 521
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
quantization tvm model-compression distillation quantized-neural-networks pytorch hardware-aware mixed-precision efficient-neural-networks 8-bit 4-bit tensorcore hessian
Language:Python 400
amirgholami / ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
quantization quantized-neural-networks compression efficient-neural-networks efficient-model
Language:Python 271
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
deep-learning fpga inference machine-learning onnx quantization quantized-neural-networks
Language:Python 101
hailo-ai / hailo_model_zoo
The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment
ai-accelerators computer-vision deep-learning edge-ai hailo hailo8 quantization quantized-neural-networks
Language:Python 98
EEESlab / mobilenet_v1_stm32_cmsis_nn
Mobilenet v1 trained on Imagenet for STM32 using extended CMSIS-NN with INT-Q quantization support
stm32 cmsis-nn edge-computing neural-network mobilenet convolutional-neural-networks quantized-neural-network quantized-neural-networks deep-neural-networks deeplearning imagenet-classifier imagenet cortex-m7 arm-cortex-m7 tensorflow pythorch qnn cnn low-power-mcu
Language:C 83
Enderdead / Pytorch_Quantize_impls
Some recent Quantizing techniques on PyTorch
pytorch python3 binarized-neural-networks binarization quantized-neural-networks
Language:Python 72
megvii-research / megvii-tsinghua-dl-course
Slides with modifications for a course at Tsinghua University.
deep-learning image-signal-processing quantized-neural-networks
56
Zhen-Dong / BitPack
BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.
memory mixed-precision model-compression pytorch quantization quantized-neural-networks
Language:Python 49
tca19 / near-lossless-binarization
This repository contains source code to binarize any real-value word embeddings into binary vectors.
word-embeddings wordembeddings quantized-neural-networks autoencoder binary-word-embeddings binarization binary-word-vectors
Language:C 46
mrusci / training-mixed-precision-quantized-networks
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contraints of the target device.
mixed-precision-training qnn edge-ai quantized-neural-networks integer-arithmetic low-power-mcu pytorch
Language:Python 44
EEESlab / CMix-NN
CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices
cnn quantized-neural-networks mixed-precision inference tinyml edge-computing edge-ai iot cmsis-nn cmsis stm32 stm32h7 stm32f7 stm32l4 stm32f4 arm arm-cortex-m4 arm-cortex-m7
Language:C 36
htqin / QuantSR
This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.
model-quantization quantized-neural-networks super-resolution
Language:Python 33
zhaohui-yang / Binary-Neural-Networks
Binary neural networks developed by Huawei Noah's Ark Lab
binary-neural-networks convolutional-neural-networks quantized-neural-networks
Language:Python 28
sefaburakokcu / quantized-yolov5
Low Precision(quantized) Yolov5
brevitas finn fpga low-precision pynq-z2 quantized-neural-networks yolov1 yolov5
Language:Python 27
yashkant / quantized-nets
Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.
quantized-neural-networks binarized-neural-networks ternarized-neural-networks
Language:Python 23
LISTENAI / linger
a CSK serial based train tools， rely on pytorch
deep-learning pytorch quantized-neural-networks
Language:Python 20
anderspkd / SecureQ8
Input scripts for securely evaluating quantized ImageNet models with mp-spdz
imagenet mp-spdz mpc quantized-neural-networks secure tensorflow
Language:Python 17
spallanzanimatteo / QuantLab
artificial-intelligence machine-learning quantized-neural-networks
Language:Shell 7
kartikgupta-at-anu / md-bnn
Code implementation of our AISTATS'21 paper "Mirror Descent View for Neural Network Quantization"
binarized-neural-networks mirror-descent network-compression quantized-neural-networks
Language:Python 6
alessandrocapotondi / MobileNet_v1_x_cube_ai_4.1.0
Mobilenet v1 (3,160,160, alpha=0.25, and 3,192,192, alpha=0.5) on STM32H7 using X-CUBE-AI v4.1.0
stm32 stmcubemx edge-computing neural-network cubemxai mobilenet deep-neural-networks imagenet cortex-m7 stm32h7 stm32h743zi cnn tflite quantized-neural-networks mcu edge-ai edge-classification
Language:C 4
yashkant / enas-quantized-nets
Efficient Neural Architecture Search coupled with Quantized CNNs to search for resource efficient and accurate architectures.
enas keras tensorflow neural-architecture-search quantized-neural-networks automl
Language:Python 4
akashlevy / RRAM-Stuck-At-NN-Modeling
Modeling stuck-at faults for RRAM inference on popular neural networks after quantization
neural-network tensorflow keras keras-tensorflow quantized-neural-networks
Language:Python 3
kartikgupta-at-anu / attack-bnn
Code implementation of our AAAI'22 paper "Improved Gradient-Based Adversarial Attacks for Quantized Networks"
adversarial-attacks binarized-neural-networks pgd quantized-neural-networks
Language:Python 3
anderspkd / tf_train_quantized
Quantized training using Keras
python quantization quantized-neural-networks tensorflow
Language:Python 2
343-Guilty4Spark / CNN-accelerator
Exercises on HW acceleration of quantized neural networks for the course Integrated Systems Architecture at PoliTo
artificial-intelligence neural-networks quantized-neural-networks
Language:Python 1
cedard234 / Grayscale_Verilog_Converter
A python-based utility to convert a grayscale image into verilog code.
accelerator cnn cv fpga grayscale image-processing quantized-neural-networks verilog
Language:Python 1
hhsinhan / tf_keras_quantization_experiment
keras quantized-neural-networks tensorflow quantized-awareness-training
Language:Python 1
Mohammad-Heydariii / VQ-VAE
In this project, we have implemented the VQ-VAE algorithm on both MNIST and CIFAR10 datasets considering MSELOSS and also NLLLOSE.
latent-space quantized-neural-networks unsupervised-learning variational-autoencoder
Language:Jupyter Notebook 1
qnn
stracini-git / qnn
Training neural nets with quantized weights on arbitrarily specified bit-depth
quantization quantization-aware-training quantization-algorithms lenet resnet18 cifar10 mnist binary-neural-networks quantized-neural-networks
Language:Python 1
YukeWang96 / APNN-TC_SC21
Artifact for SC21: APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores.
dnn gpu quantized-neural-networks tensorcore
Language:Cuda 1
sadjadasadi / quantize-ResNet
Check the effect of quantization on ResNets architecture
quantized-neural-networks resnet-101 resnet-18 resnet-50 quantize-resnet
Language:MATLAB 0
vadimkantorov / fastmlp
[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP
cublaslt mlp pytorch quantized-neural-networks

quantized-neural-networks

tensorflow / model-optimization

larq / larq

google / qkeras

BUG1989 / caffe-int8-convert-tools

Zhen-Dong / HAWQ

amirgholami / ZeroQ

fastmachinelearning / qonnx

hailo-ai / hailo_model_zoo

EEESlab / mobilenet_v1_stm32_cmsis_nn

Enderdead / Pytorch_Quantize_impls

megvii-research / megvii-tsinghua-dl-course

Zhen-Dong / BitPack

tca19 / near-lossless-binarization

mrusci / training-mixed-precision-quantized-networks

EEESlab / CMix-NN

htqin / QuantSR

zhaohui-yang / Binary-Neural-Networks

sefaburakokcu / quantized-yolov5

yashkant / quantized-nets

LISTENAI / linger

anderspkd / SecureQ8

spallanzanimatteo / QuantLab

kartikgupta-at-anu / md-bnn

alessandrocapotondi / MobileNet_v1_x_cube_ai_4.1.0

yashkant / enas-quantized-nets

akashlevy / RRAM-Stuck-At-NN-Modeling

kartikgupta-at-anu / attack-bnn

anderspkd / tf_train_quantized

343-Guilty4Spark / CNN-accelerator

cedard234 / Grayscale_Verilog_Converter

hhsinhan / tf_keras_quantization_experiment

Mohammad-Heydariii / VQ-VAE

stracini-git / qnn

YukeWang96 / APNN-TC_SC21

sadjadasadi / quantize-ResNet

vadimkantorov / fastmlp