Can we get FP4?

Question

catid opened this issue 4 months ago · comments

FP6 doesn't seem to be a useful size. The best models are 70B that we can run, and only 4 bit models will fit in ~40-48GB VRAM

Haojun Xia · Answer 1 · Mon Apr 29 2024 13:13:20 GMT+0800 (China Standard Time)

We will support FP5 soon. Yeah, I will try to also support FP4.