attention multimodal rgbd-saliency-detection rgbd-salient-object-detection rgbd-sod rgbt-saliency-detection rgbt-salient-object-detection rgbt-sod saliency-detection salient-object-detection sod transformer

UniSOD

This repository provides the source code and results for the paper entilted "Unified-modal Salient Object Detection via Adaptive Prompt Learning".

arXiv version: https://arxiv.org/abs/2311.16835.

Thank you for your attention.

Citing our work

If you think our work is helpful, please cite

@article{wang2023unified,
  title={Unified-modal Salient Object Detection via Adaptive Prompt Learning},
  author={Wang, Kunpeng and Li, Chenglong and Tu, Zhengzheng and Luo, Bin},
  journal={arXiv preprint arXiv:2311.16835},
  year={2023}
}

Overview

Framework

Baseline SOD framework

RGB SOD Performance

RGB-D SOD Performance

RGB-T SOD Performance

Predictions

The predicted RGB, RGB-D, and RGB-T saliency maps can be found here. [baidu pan fetch code: vpvt]

Pretrained Models

The pretrained parameters of our models can be found here. [baidu pan fetch code: o8yx]

Usage

Requirement

Download the datasets for training and testing from here. [baidu pan fetch code: 2sfr]
Download the pretrained parameters of the backbone from here. [baidu pan fetch code: mad3]
Organize dataset directories for pre-training and fine-tuning.
Create directories for the experiment and parameter files.
Please use conda to install torch (1.12.0) and torchvision (0.13.0).
Install other packages: pip install -r requirements.txt.
Set your path of all datasets in ./options.py.

Pre-train

python -m torch.distributed.launch --nproc_per_node=2 --master_port=2024 train_parallel.py

Fine-tuning

python -m torch.distributed.launch --nproc_per_node=2 --master_port=2024 train_parallel_multi.py

Test

python test_produce_maps.py

Acknowledgement

The implement of this project is based on the following link.

Contact

If you have any questions, please contact us (kp.wang@foxmail.com).

About

Unified-modal Salient Object Detection via Adaptive Prompt Learning

attention multimodal rgbd-saliency-detection rgbd-salient-object-detection rgbd-sod rgbt-saliency-detection rgbt-salient-object-detection rgbt-sod saliency-detection salient-object-detection sod transformer

Languages

Language:Python 100.0%