jongwooko/distillm Issues
Dataset link seems invalid now
Closed 2The metric of GPT-4 Eval
Closed 1How to download the models after sft
Closed 4OPTS+=" --kd-ratio 1.0"
Closed 2
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)