Organization data from Github https://github.com/FasterDecoding
Think deeper, decode faster
Location:United States of America
GitHub:@FasterDecoding
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
REST: Retrieval-Based Speculative Decoding, NAACL 2024