[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Home Page:https://arxiv.org/abs/2402.02057
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool