spencerhhubert / comb

for combing through prompt, target instruction data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

comb

an application for combing through prompt, target instruction data for fine-tuning LLMs

if you already have prompt and target pairs scraped/gathered, but they need to be edited, formatted, and such, this is the right tool for you.

comb demo screenshot

how to use

although this is a server application, it's really only meant to be run locally for quickly iterating over your own data.

you'll need two folders in /data, /data/raw and /data/out. out can be empty. raw will contain .JSON files, each with a single prompt, target (the keys "prompt and "target" need to exist) inside them.

the program will iterate over those files, and write the edited versions to /data/out with the same file names.

chmod +x utils/docker/*
./utils/docker/build.sh
./utils/docker/run.sh

and go to http://localhost:3000/

About

for combing through prompt, target instruction data

License:MIT License


Languages

Language:JavaScript 69.2%Language:HTML 25.4%Language:Dockerfile 3.3%Language:Shell 1.3%Language:CSS 0.7%