V2AI / Det3D

World's first general purpose 3D object detection codebse.

Home Page:https://arxiv.org/abs/1908.09492

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Multi modal fusion

Byte247 opened this issue · comments

Is there any example on how to do a multi modal approach in Det3d? Like how to adjust the data loading pipeline and so on? I am mainly using the nuScenes dataset for now.

Never mind, found a way to access the camera images by extending the nuscenes.py

So other people might struggle less in the future, you need to add your custom entry like "camera_images" to the list in: det3d/torchie/parallel/collate.py.py:
e.g.:
if key in ["voxels", "num_points", "num_gt", "voxel_labels", "num_voxels",
"cyv_voxels", "cyv_num_points", "cyv_num_voxels", "camera_images"]:

Otherwise they will be converted back to a numpy array even if they were torch.Tensors before.