Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool