ftian1

Tian, Feng's starred repositories

neural-speed

An innovative library for efficient LLM inference via low-bit quantization

Language:C++Apache-2.029300

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonApache-2.0203600

py-faster-rcnn

Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version

Language:PythonNOASSERTION806600

CNN-compression-performance

A python script that automatise the training of a CNN, compress it through tensorflow (or ristretto) plugin, and compares the performance of the two networks

Language:Python2800

caffe

Ristretto: Caffe-based approximation of convolutional neural networks.

Language:C++NOASSERTION29200