siyan-zhao / prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Home Page:https://arxiv.org/abs/2404.09529

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

siyan-zhao/prepacking Issues