About model format conversion

Question

About model format conversion

fzxxg opened this issue 7 months ago · comments

I didn’t reproduce using your code, can I use your evaluation file? But I noticed that your model format needs to be converted. If I don’t convert it, it can still run successfully, but what impact will this have? The following is the code I executed for the evaluation file:
python evaluation.py
--model_name_or_path bert-base-uncased-sts
--pooler cls_before_pooler
--task_set sts/transfer
--mode test

fzxxg · Answer 1 · Sun Nov 26 2023 19:59:28 GMT+0800 (China Standard Time)

I only saved the BERT parameters in the model.

Sean (Seok-Won) Yi · Answer 2 · Tue Dec 26 2023 13:16:48 GMT+0800 (China Standard Time)

I believe the output format is safetensors. There should be any actual impact.

Tianyu Gao · Answer 3 · Tue Dec 26 2023 20:42:41 GMT+0800 (China Standard Time)

Hi @fzxxg , if you use our training code and don't convert it, the checkpoint won't be able to perform as well as the paper reported; however, if you are evaluating other models, it should be fine (as long as the inference for those models are strictly the same as how they are supposed to be used.

fzxxg · Answer 4 · Tue Dec 26 2023 21:06:29 GMT+0800 (China Standard Time)

Thank you for your reply. I will close this issue.