How to quantize stories15M.bin

Question

forcekeng opened this issue 7 months ago · comments

Hi, I want know how to quantize stories15M.bin or stories42M.bin. I try to use python export.py, it shows no params.json.

electronics app dev · Answer 1 · Sun Jan 28 2024 05:07:45 GMT+0800 (China Standard Time)

if you have a checkpoint file .pt

$ python export.py stories15M_q8.bin --version 2 --checkpoint out/ckpt.pt