wangxiao5791509 / VisEvent_SOT_Benchmark

[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.

Home Page:https://sites.google.com/view/viseventtrack/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The First Large-scale Benchmark Dataset for Reliable Object Tracking by fusing RGB and Event Cameras


ProjectarXivBaselinesDemoVideoTutorial

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows[J] arXiv preprint arXiv:2108.05015, 2021, Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, Yaowei Wang, Yonghong Tian, Feng Wu

News:

  • 🔥 [2024.03.12] A New Long-term RGB-Event based Visual Object Tracking Benchmark Dataset (termed FELT) is available at [Paper] [Code] [DemoVideo]

  • 🔥 [2023.09.27] A High Definition (HD) Event based Visual Object Tracking Benchmark Dataset (termed EventVOT) is available at [arXiv] [Github]

  • 🔥 [2023.09.20] VisEvent is accepted by IEEE Transactions on Cybernetics [IEEE]

  • 🔥 [2022.11.27] Due to some aedat4 files are missing, you can use part of this dataset according to this list: [aedat4HARDVS_list]

  • 🔥 [2022.11.23] A new Color Frame + Event Stream based Tracking dataset COESOT is available at [arXiv] [GitHub]

  • 🔥 [2022.10.19] Event camera (DVS, Spike) based Papers Published on Top International Conference [ Event_Camera_in_Top_Conference]

  • [2022.07.14] Update the VOT2019-RGB-Event dataset used in our paper [BaiduYun].

  • [2022.02.10] Update paper list for event camera based tracking [Event_Tracking_Paper_List].

  • [2021.10.13] Update the links for onedrive.

Introduction

Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency. In practice, the visible cameras can better perceive texture details and slow motion, while event cameras can be free from motion blurs and have a larger dynamic range which enables them to work well under fast motion and low illumination. Therefore, the two sensors can cooperate with each other to achieve more reliable object tracking. In this work, we propose a large-scale Visible-Event benchmark (termed VisEvent) due to the lack of a realistic and scaled dataset for this task. Our dataset consists of 820 video pairs captured under low illumination, high speed, and background clutter scenarios, and it is divided into a training and a testing subset, each of which contains 500 and 320 videos, respectively. Based on VisEvent, we transform the event flows into event images and construct more than 30 baseline methods by extending current single-modality trackers into dual-modality versions. More importantly, we further build a simple but effective tracking algorithm by proposing a cross-modality transformer, to achieve more effective feature fusion between visible and event data. Extensive experiments on the proposed VisEvent dataset, and two simulated datasets (i.e., OTB-DVS and VOT-DVS), validated the effectiveness of our model.

Demo Video

A demo video for the VisEvent can be found by cliking the image below:

VisEvent_DemoVideo

VisEvent_examples

Video Tutorial

The tutorial for this paper can be found by cliking the image below:

VisEvent_DemoVideo

How to Download VisEvent Dataset?

  • BaiduYun (about 216 GB):
Link:https://pan.baidu.com/s/1VhdORXT4OvG8TUESfDZHfw 
Password:AHUE 

Links for VOT2019-RGB-Event dataset (36.3 GB) used in our paper

  • BaiduYun:
Link:https://pan.baidu.com/s/1cS79d1dJFD8mF0AwuGG5Og   Password: AHUT 
  • Googledrive: Click [here]

Baseline Methods

The source code of baseline trackers by fusing dual-modalities can be found at: [RGB-DVS-SOT-Baselines].

VisEvent_Baselines

How to load the aedat4 file?

We provide a python script (read_aedat4.py) to load the aedat4 file. You can download one aedat4 file to feel the data style: [dvSave-2021_12_21_16_32_19.aedat4]

Here is an example:

1). Install the required toolkit [dv-gui]. Please use different scripts for various Ubuntu OS:

Ubuntu 20.04:

sudo add-apt-repository ppa:inivation-ppa/inivation
sudo apt-get update
sudo apt-get install dv-gui

Ubuntu 18.04:

sudo add-apt-repository ppa:ubuntu-toolchain-r/test
sudo add-apt-repository ppa:inivation-ppa/inivation-bionic
sudo apt-get update
sudo apt-get install dv-gui

Ubuntu 16.04:

sudo add-apt-repository ppa:ubuntu-toolchain-r/test 
sudo add-apt-repository ppa:lkoppel/opencv 
sudo add-apt-repository ppa:janisozaur/cmake-update 
sudo add-apt-repository ppa:inivation-ppa/inivation-xenial 
sudo apt-get update 
sudo apt-get install dv-gui

Other softwares:

pip install dv
pip install opencv-python numpy pillow -i https://pypi.tuna.tsinghua.edu.cn/simple 

[ref] https://gitlab.com/inivation/dv/dv-python

2). Open your terminal and run the script:

python read_aedat4.py

Evaluation ToolKit

Only matlab version is available.

1. Download this github:

git clone https://github.com/wangxiao5791509/VisEvent_SOT_Benchmark

2. Download the tracking results of our benchmark: [GoogleDrive (185MB)]

unzip tracking_results_VisEvent_SOT_benchmark.zip, and put it into the folder "tracking_results". 

unzip the "annos.zip" in the folder "annos"

3. Open your matlab, and run the script "Evaluate_VisEvent_SOT_benchmark.m". Wait and check the final evaluated figures

More Related Materials

License

This project is under the MIT license. See [license] for details.

📃 BibTex:

If you find this work useful for your research, please cite the following papers:

@article{wang2021viseventbenchmark,
  title={VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows},
  author={Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, Yaowei Wang, Yonghong Tian, Feng Wu},
  journal={arXiv:2108.05015},
  year={2021}
}

If you have any questions about this work, please submit an issue or contact me via Email: wangxiaocvpr@foxmail.com, xiaowang@ahu.edu.cn, or Wechat: wangxiao5791509. Thanks for your attention!

About

[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.

https://sites.google.com/view/viseventtrack/

License:MIT License


Languages

Language:Python 53.6%Language:MATLAB 46.4%