rajeevyasarla / 3SD

3SD: Self-Supervised Saliency Detection With No Labels

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

3SD

Self-Supervised Saliency Detection method

Paper link

We present a conceptually simple self-supervised method for saliency detection. Our method generates and uses pseudo-ground truth labels for training. The generated pseudo-GT labels don't require any kind of human annotations (\emph{e.g.}, pixel-wise labels or weak labels like scribbles). Recent works show that features extracted from classification tasks provide important saliency cues like structure and semantic information of salient objects in the image. Our method, called 3SD, exploits this idea by adding a branch for a self-supervised classification task in parallel with salient object detection, to obtain class activation maps (CAM maps). These CAM maps along with the edges of the input image are used to generate the pseudo-GT saliency maps to train our 3SD network. Specifically, we propose a contrastive learning-based training on multiple image patches for the classification task. We show the multi-patch classification with contrastive loss improves the quality of the CAM maps compared to naive classification on the entire image. Experiments on six benchmark datasets demonstrate that without any labels, our 3SD method outperforms all existing weakly supervised and unsupervised methods, and its performance is on par with the fully-supervised methods.

Prerequisites:

  1. Linux
  2. Python 2 or 3
  3. Pytorch version >=1.0
  4. CPU or NVIDIA GPU + CUDA CuDNN (CUDA 9.0)

Dataset

  1. download the dataset DUTS (http://saliencydetection.net/duts/)
  2. download test datasets (https://drive.google.com/open?id=11rPRBzqxdRz0zHYax995uvzQsZmTR4A7)
  3. edge maps are obtained using pretrained RCF. Thanks to Zhang et al. for providing the edge maps for DUTS dataset. edge maps download

BN

Run following commands to train and test (Training code will be released soon)

For training:
python basenet_train.py
For testing:
python u2net_test_pseudo_dino_final.py

Note we used step 3 while reporting the numbers in the paper and presentation. corresponding pretrained model can be founded in the folder "saved_models/trans_syn_u2net"

To train and test 3SD in self-supervised way:

  1. command for training
    python 3SD_train.py
  1. command for testing
    python u2net_test_pseudo_dino_final.py
  1. download pretrained models for self-supervised 3sd Dropbox

3SD results are available at Google Drive

Evaluation

For computing metrics run the following command

python compute_metrics.py

Acknowledgements

Thanks to authors of U2Net and DINO for sharing their code. Most of the code is borrowed from the U2Net and DINO methods

https://github.com/xuebinqin/U-2-Net
https://github.com/facebookresearch/dino
https://github.com/lucidrains/vit-pytorch/tree/main/vit_pytorch

About

3SD: Self-Supervised Saliency Detection With No Labels

License:MIT License


Languages

Language:Python 100.0%