Beast code in Giters

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Language:PythonMIT000

MLIR-TVM

000

NN-CUDA-Example

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Apache-2.0000

OpenVINO-Custom-Layers

Tutorial for Using Custom Layers with OpenVINO (Intel Deep Learning Toolkit)

Language:C++000

pytorch-cifar

95.47% on CIFAR10 with PyTorch

Language:PythonMIT000

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

MIT000

roofline

Roofline prototype for Arm

Language:C++NOASSERTION000

shark-samples

Apache-2.0000

tensorflow

An Open Source Machine Learning Framework for Everyone

Apache-2.0000

tensorflow-predictor-cpp

tensorflow prediction using c++ api

Language:Python000

Tensorflow-TensorRT

This repository is for my YT video series about optimizing a Tensorflow deep learning model using TensorRT. We demonstrate optimizing LeNet-like model and YOLOv3 model, and get 3.7x and 1.5x faster for the former and the latter, respectively, compared to the original models.

000

jxhekang

hekang's repositories

ai_inference_tools

algorithm-pattern

caffe

caffe-windows

cmake-examples

CodingInterviewChinese2

Deep-Compression-AlexNet

DeepLearning-500-questions

DL_tensorflow

Edge-AI-Platform-Tutorials

Efficient-Neural-Network-Bilibili

gputil

Hands-On-GPU-Accelerated-Computer-Vision-with-OpenCV-and-CUDA

how-to-optimize-gemm

micronet

MLIR-TVM

NN-CUDA-Example

OpenVINO-Custom-Layers

pytorch-cifar

pytorch-distributed

roofline

shark-samples

tensorflow

tensorflow-predictor-cpp

Tensorflow-TensorRT

TensorRT-Program

test0620

torch-mlir

UVM_template

yolov5_cpp_openvino