alecco / llm_fp8

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

llm fp8

Implement LLM using Nvidia FP8 support.

Inspired by Peng, H., et al; FP8-LM: Training FP8 Large Language Models. arXiv:2310.18313v2 and Karpathy's llm.c.

TODO

  • Load tokens

About

License:GNU Affero General Public License v3.0


Languages

Language:C++ 80.8%Language:Cuda 13.6%Language:CMake 5.0%Language:Makefile 0.6%