[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Home Page:https://arxiv.org/abs/2305.11627
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
qxpBlog opened this issue 5 months ago · comments
您在论文中提到的延迟数据具体是运行那个文件得到的:
@VainF @eltociear @horseee