using my own custom dataset

Question

using my own custom dataset

nikky4D opened this issue 2 years ago · comments

Nkiruka Uzuegbunam commented 2 years ago

I would like to finetune on my own dataset. Do you have recommendations on how I can create my own dataset for this?

Nkiruka Uzuegbunam · Answer 1 · Wed Feb 23 2022 16:40:43 GMT+0800 (China Standard Time)

I have a question on the pretraining. I want to pretrain only on my dataset. Can i modify pretrain.json to only specify path to my dataset? what else should I change to get pretraining?

Muhammad Maaz · Answer 2 · Fri Feb 25 2022 07:42:24 GMT+0800 (China Standard Time)

I have a question on the pretraining. I want to pretrain only on my dataset. Can i modify pretrain.json to only specify path to my dataset? what else should I change to get pretraining?

Hi @nikky4D,

Thank you for your interest in our work. We use the same setup as of MDETR for pretraining our model. Specifically, we trained on approximately 1.3 M image-caption pairs from GQA, COCO & Flicker.

In order to train on your custom dataset, you will need to convert your dataset in COCO format containing captions and tokens_positive defining alignment with the bounding boxes. The issue at explains the required format of tokens_positive. Further, the standard data loader used can be found at.

In addition to that, you can also evaluate MDef-DETR on your dataset without any pretraining/fine-tuning. Please refer to this issue for details.

I hope this information will be helpful.