The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
Home Page:https://arxiv.org/abs/2404.09529
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool