This repo contains script to clean up data and These are two files you can run to do finetuning.
- Get your data into a CSV, ideally two columns. One will contain the prompt and one will contain the expected response.
- Specify your system prompt in the
SYSTEM_PROMPT
variable inpreparing-data.py
. - Run
preparing-data.py
to transform data into JSONL format with a system prompt of your choice. - Make sure you've exported the env variable
OPENAI_API_KEY
with your API key. - Specify the name of the model then run
finetuning-data.csv
to actually trigger a fine-tuning job through OpenAI.
- Validation script and step