Is this method implementable on multi-GPUs?
LeonCheng0129 opened this issue · comments
i_heard_you_looking commented
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
LeonCheng0129 opened this issue · comments