NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Home Page:https://docs.nvidia.com/deeplearning/transformer-engine/user-guide/index.html

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cuda 12.1 dont work

yangguoming opened this issue · comments

I have run the test code which given on the git, and the cuda version also meets the installation requirements, how do I fixed with it? Do I need reinstall cuda?

Support for FP8 in Ada GPUs was added in CUDA 12.1 update 1 (https://developer.nvidia.com/cuda-12-1-1-download-archive). If reinstalling full CUDA toolkit is an issue, then you should be able to just use the updated cuBLAS build from there.