HXZhong1997 / impr-RT-VITAL

Improved Real-time Visual Tracking via Adversarial Learning, icassp 2020

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

impr-RT-VITAL: Improved Real-Time Visual Tracking via Adversarial Learning

Created by Haoxiang Zhong.

Introduction

Improved RT-VITAL is deep learning based tracking algorithm. And our work is greatly inspired by RT-MDNet as well as VITAL.
The pipeline of this tracker is based on RT-MDNet.

Visual Results of Our Tracker

System Requirements

This code is tested on 64 bit Linux (Ubuntu 16.04 LTS).

Prerequisites

  1. Python 2.7 (other versions of Python2 may also work)
  2. PyTorch (>= 0.2.1)
  3. For GPU support, a GPU (~2GB memory for test) and CUDA toolkit.
  4. Training Dataset (ImageNet-Vid) if needed.
  5. Testing Dataset (e.g. OTB100, GOT-10k, ...)

Online Tracking

Pretrained Model

Our work only modifies the tracker during online tracking. Pretrained model is provided by RT-MDNet at: RT-MDNet-ImageNet-pretrained.

Other crucial models or files from our work is provided here: Baidu Yun, extraction code:en3j.

Of all the models rt-mdnet.pth and G_sample_list_2.mat are crucial, other files can be generated during running the code.
If you want to save your time, g_model0.003.pth may also be downloaded.
Downloading feat_*.npy will save your time during feature extraction while pretraining net G.

Please put all the files and models in ./models/

Test

python2 Run.py  #Run the trakcer, and it will save a pickle file at ./result
cd ./result
python save_txt.py   #Decode the pickle file into txt files

We do not recommend test our tracker on VOT benchmark, because these data are used for model learning during initialization.

Learning RT-MDNet

Please refer to RT-MDNet for more details on traning.

Citation

If you're using this code for a publication, please cite our paper and RT-MDNet

@INPROCEEDINGS{impr-RT-VITAL,  
author={H. {Zhong} and X. {Yan} and Y. {Jiang} and S. {Xia}},
booktitle={ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},   
title={Improved Real-Time Visual Tracking via Adversarial Learning},
year={2020},  
volume={},  
number={},  
pages={1853-1857},}
@InProceedings{rtmdnet,
author = {Jung, Ilchae and Son, Jeany and Baek, Mooyeol and Han, Bohyung},
title = {Real-Time MDNet},
booktitle = {European Conference on Computer Vision (ECCV)},
month = {Sept},
year = {2018}
}

About

Improved Real-time Visual Tracking via Adversarial Learning, icassp 2020


Languages

Language:Python 75.6%Language:Cuda 15.6%Language:C 7.8%Language:C++ 0.8%Language:Makefile 0.2%