horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

https://arxiv.org/abs/2305.11627

延迟评估

qxpBlog opened this issue 5 months ago · comments

Xinpeng Qin commented 5 months ago

您在论文中提到的延迟数据具体是运行那个文件得到的：

Xinpeng Qin commented 5 months ago

@VainF @eltociear @horseee