Full training example with custom dataset

Question

Full training example with custom dataset

unrue opened this issue 7 months ago · comments

Do you have a owl-vit full training example with custom dataset from scratch? I don't understand what to do from the manual:


`python -m scenic.projects.owl_vit.main \
  --alsologtostderr=true \
  --workdir=/tmp/training \
  --config=scenic/projects/owl_vit/configs/clip_b32_finetune.py`

I have a dataset with 30k images. I need description of each images instead a simple label? I need to make some kind of conversion? I need to train a CLIP model before? Someone could explain all step to make a training on custom dataset? Thanks.

WKaiH123 · Answer 1 · Mon Jan 08 2024 16:27:20 GMT+0800 (China Standard Time)

@unrue I have the same confusion. Do you know the process of building a custom dataset and fine-tuning it? My task is the fine-tuning of object detection. Do I need to provide the target text and target coordinates for each image? Thanks!