Typo dirname `utli`->`util`

Question

Typo dirname `utli`->`util`

vadimkantorov opened this issue 2 years ago · comments

Also, it would be nice to add to README pointers to model component source (especially Decoder), since it's not discussed much in the paper.

E.g. could you please comment on the inference path ddim_sample and the preds, outputs_class, outputs_coord,outputs_kernel,mask_feat = self.model_predictions(backbone_feats, images_whwh, img, time_cond,self_cond, clip_x_start=clip_denoised) call which seems to be the decoder call. Counterintuitively, it seems that the noisy boxes are stored in the img variable, right? And the dynamic mask kernels are produced in a deterministic way, right?

Thanks!

Zhangxuan Gu · Answer 1 · Fri Dec 09 2022 10:35:17 GMT+0800 (China Standard Time)

@vadimkantorov
Thanks for your advice!
The dirname has been changed.
The decoder follows the code of DiffusionDet, such as variable names and function names.
Finally, the mask kernel filters are now generated from bounding boxes. We have fixed the training and inference equations in the latest arxiv version. Also see #1
We are still working on this project for directly denoising the filters and will clean and revise the code in the future.

Vadim Kantorov · Answer 2 · Tue Jan 10 2023 06:25:58 GMT+0800 (China Standard Time)

It's also a bit disappointing that according to your results in README, mask AP barely improves by going from 1step to 4steps :(

Zhangxuan Gu · Answer 3 · Tue Jan 10 2023 21:08:24 GMT+0800 (China Standard Time)

It's also a bit disappointing that according to your results in README, mask AP barely improves by going from 1step to 4steps :(

Yes, it is. And thus we are trying different denoising strategies and mask representations for further research.