Visual Attention Consistency

This repository is for the following paper:

@InProceedings{Guo_2019_CVPR,
author = {Guo, Hao and Zheng, Kang and Fan, Xiaochuan and Yu, Hongkai and Wang, Song},
title = {Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}

Note:

This is just a preliminary version for early access. I will complete it as soon as I can.

To run this code, you need to specify the WIDER_DATA_DIR (WIDER dataset "Image"), WIDER_ANNO_DIR (WIDER dataset annotations) in "configs.py"and the argument of model_dir (path to save checkpoints). Then, run the command (with PyTorch installed): python main.py.

Select the checkpoint with the best mAP.

You can also evaluate the predictions with code at: https://github.com/zhufengx/SRN_multilabel, which would produce a slightly better (~0.1%) performance.

Datasets

WIDER Attribute Dataset: http://mmlab.ie.cuhk.edu.hk/projects/WIDERAttribute.html
PA-100K: (will be supported later)
MS-COCO: (will be supported later)

About

Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification

Languages

Language:Python 100.0%