Code for the NeurIPS 2023 paper: "ZipLM: Inference-Aware Structured Pruning of Language Models".
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool