SamsungLabs / imvoxelnet

[WACV2022] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

My guess.

luanliu11 opened this issue · comments

I would like to know, please, if you think the idea of adding a branch for simple depth prediction after 2d feature extraction, and then generating point cloud information from the depth as a proposal when projecting 2d features from the original network to the 3d voxels is feasible?

I think, this idea is called pseudo-lidar and is very well explored on outdoor datasets e.g. KITTI. You can also find some works with this keyword on SUN RGB-D, but I don't have much expertise here.