[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Home Page:https://arxiv.org/abs/2305.11627
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
coderchem opened this issue 5 months ago · comments
I cut 25% of all the layers, but the cut shape is not I wanne, I hope the shape is [N,N] ,but [N,M] ,the M=N*0.25. it's difficult to load.