caiyu97 / MVCNN

Multi-view dual attention network for 3D object recognition (Neural Computing and Applications, 2021)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyTorch code for MVDAN

A Pytorch implementation of Multi-view Dual Attention Network for 3D Object Recognitionn (MVDAN)

In this paper, the 3D object recognition problem is converted to multi-view 2D image classification problem. For each 3D object, there are multiple images taken from different views

Dependecies

  • Python 3.6
  • PyTorch 1.2.0
  • numpy

Dataset

  • ModelNet CAD data can be found at Princeton
  • ModelNet40 12-view png images can be downloaded at modelnet40_images_new_12x (1.6GB)
  • You can also create 3-view png images and 6-view png images by reducing the number of 12 views

Train the model

python train.py -name MVDAN -num_models 1000 -weight_decay 0.0001 -num_views 12 -cnn_name resnet50

About

Multi-view dual attention network for 3D object recognition (Neural Computing and Applications, 2021)

License:MIT License


Languages

Language:Python 100.0%