[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
bradduy opened this issue 6 months ago · comments
Hi authors,
In DINO, the backbone is used including ViT. So why does MaskDINO not use ViT as backbone? Is there any reason?
Thank you.