nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to reproduce Evol-Instruct datasets?

imoneoi opened this issue · comments

commented

I've seen your open-source Evol-Instruct generation scripts. Good job!

Additionally, can you provide instructions on how to reproduce the WizardLM dataset and WizardCoder dataset using the scripts provided?

You can just modify the file path you want to evol in WizardLM/Evol-Instruct/main.py and then run "python main.py". The default file is the alpaca data.

commented

What is the seed file for the WizardLM and WizardCoder datasets?

commented

BTW, the scripts seem to be missing the error checker and iterative evolution described in the paper. Are these parts necessary?

commented

Any updates?

BTW, the scripts seem to be missing the error checker and iterative evolution described in the paper. Are these parts necessary?

+1