captions-to-vqa

We will be running generate-qa.sh, which uses torchrun.

Replace --ckpt_dir and --tokenizer_path with your llama-3 checkpoint and tokenizer paths.
If you are not using shard output_28118.pkl, then replace this filename in generate_qa.py.
Select image keys in keys.py. The image keys must within the shard you previously selected. This is how we select images to generate QA pairs from.
Run bash generate-qa.sh. This will generate an output folder with subdirectories named after the prompt version. Output files are named output_{key}_{prompt_version}.

About

Language:Jupyter Notebook 99.6%Language:Python 0.3%Language:Shell 0.0%