FannyChao / SalGAN360

Saliency prediction on 360° image with SalGAN

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SalGAN360: Visual Saliency Detection on 360° images with GAN

Abstract

Understanding visual attention of observers on 360° images gains interest along with the booming trend of Virtual Reality applications. Extending existing saliency prediction methods from traditional 2D images to 360° images is not a direct approach due to the lack of a sufficient large 360° image saliency database. In this paper, we propose to extend the SalGAN, a 2D saliency model based on the generative adversarial network, to SalGAN360 by fine tuning the SalGAN with our new loss function to predict both global and local saliency maps. Our experiments show that the SalGAN360 outperforms the tested state-of-the-art methods.

Visual Results

qualitative results

Requirements

Pretrained models

Usage

Replace 01-data_preprocessing.py, 02-train.py, 03-predict.py, model_salgan.py, dataRepresentation.py, model.py and utils.py in SalGAN.

  • Test: To predict saliency maps, run salgan360.m after specifying the path to images and the path to the output saliency maps
  • Train:
      1. Run preprocessing_trainingdata.m to transfer 360° images into multiple viewports.
      1. Run 01-data_preprocessing.py to make pickle files.
      1. Run THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32,lib.cnmem=1,optimizer_including=cudnn python 02-train.py to fine tune salgan model.

Citing

   @INPROCEEDINGS{8551543,
       author = {F. Chao and L. Zhang and W. Hamidouche and O. Deforges},
       booktitle = {2018 IEEE International Conference on Multimedia Expo Workshops (ICMEW)},
       title = {Salgan360: Visual Saliency Prediction On 360 Degree Images With Generative Adversarial Networks},
       year = {2018},
       month = {July},}

About

Saliency prediction on 360° image with SalGAN

License:MIT License


Languages

Language:Python 66.2%Language:MATLAB 33.8%