zeroine

zeroine

Geek Repo

Github PK Tool:Github PK Tool

zeroine's repositories

Language:C++Stargazers:19Issues:0Issues:0

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

MQBench

Model Quantization Benchmark

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0