microsoft / RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to finetune with my own dataset?

yupu891101 opened this issue · comments

Hi!!!
This is the task I am working on. The picture is of a flawed panel, and I hope to detect the flawed area through RegionCLIP. How should I proceed with fine-tuning?

Additionally, does the format of the JSON file for the dataset need to fully follow the format of the COCO dataset? I am not very clear about the requirements for the format.

Could you provide a clear explanation of the process and data preparation needed for fine-tuning?

7A3D829002_7C3DAEW6AA_PAT1
defect