dzy (666DZY666)

666DZY666

Geek Repo

Company:Peking University

Location:Beijing

Github PK Tool:Github PK Tool

dzy's repositories

micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Language:PythonLicense:MITStargazers:2200Issues:40Issues:109

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Torch-Pruning

[CVPR-2023] Towards Any Structural Pruning; LLMs / Diffusion / YOLOv8 / CNNs / Transformers

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

BRECQ

Pytorch implementation of BRECQ, ICLR 2021

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

how-to-learn-deep-learning-framework

how to learn PyTorch and OneFlow

License:Apache-2.0Stargazers:0Issues:0Issues:0

IntraQ

Pytorch implementation of our paper accepted by CVPR 2022 -- IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization

Language:PythonStargazers:0Issues:0Issues:0

KuiperInfer

带你从零实现一个高性能的深度学习推理库,支持Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

Language:C++License:MITStargazers:0Issues:0Issues:0

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MQBench

Model Quantization Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nndeploy

nndeploy is a cross-platform, high-performing, and straightforward AI model deployment framework. We strive to deliver a consistent and user-friendly experience across various inference framework in complex deployment environments and focus on performance. nndeploy一款跨平台、高性能、简单易用的模型端到端部署框架。我们致力于屏蔽不同推理框架的差异,提供一致且用户友好的编程体验,同时专注于部署全流程的性能。

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnx-modifier

A tool to modify onnx models in a visualization fashion, based on Netron and flask.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
License:BSD-3-Clause-ClearStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Python-100-Days

Python - 100天从新手到大师

Language:PythonStargazers:0Issues:0Issues:0

python-patterns

A collection of design patterns/idioms in Python

Language:PythonStargazers:0Issues:0Issues:0

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Quantformer

This is the official pytorch implementation for the paper: *Quantformer: Learning Extremely Low-precision Vision Transformers*.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆20 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:BSD-3-Clause-ClearStargazers:0Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

tvm_learn

tvm learn

Language:PythonStargazers:0Issues:0Issues:0

vision

Datasets, Transforms and Models specific to Computer Vision

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0