Question: LLama-70B - Calibration and Quantization

Question

rgandikota opened this issue 5 months ago · comments

In the Orca dataset, there is a Calibration pkl file which is generated after the preprocessing step.
Can this be used as the calibration dataset or should we use the cnn-daily dataset which has been used in other language benchmarks.
The current README file(https://github.com/mlcommons/inference/blob/e39003a9c4c89a2215db0ca57ad7a57b16f9a785/language/llama2-70b/README.md) doesn't give any pointers towards calibration or quantization. We are submitting for the first time and would appreciate any help we can get.