[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Home Page:https://infini-ai-lab.github.io/TriForce/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool