Expirement with using RepSet of 196k for EvolInstruct 1k

Question

walking-octopus opened this issue a year ago · comments

The new WizardLM 13B v1.1 was fine-tuned with a 1k instruct dataset, similar to the LIMA paper.

I wonder if making the 1k dataset more representative of the initial 100k distribution can boost performance on some tasks.

Perhaps this can be of use for diverse instruction alignment too?

ChiYeung Law · Answer 1 · Fri Jul 14 2023 13:54:22 GMT+0800 (China Standard Time)

Thank you for your suggestions. We will read this paper.