hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Home Page:https://arxiv.org/abs/2402.02057

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

hao-ai-lab/LookaheadDecoding Issues