soodoshll / bsp

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Batched Speculative Inference of LLM

Don't expect accuracy with fp16.

About


Languages

Language:Python 97.9%Language:Shell 2.1%