There are 0 repository under fp8 topic.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
An innovative library for efficient LLM inference via low-bit quantization
Spike, a RISC-V ISA Simulator with added 8-bit vector floating point support