horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

https://arxiv.org/abs/2305.11627

Is this method implementable on multi-GPUs?

LeonCheng0129 opened this issue 2 months ago · comments

i_heard_you_looking commented 2 months ago