MemX-Research / DogsView

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DogsView

This repository contains the DogsView dataset described in the paper "Unveiling Causal Attention in Dogs’ Eyes with Smart Eyewear".

If you use this dataset in your work, please cite our paper:

@article{10.1145/3569490,
  author = {Zhao, Yingying and Li, Ning and Pan, Wentao and Wang, Yujiang and Dong, Mingzhi and Ding, Xianghua (Sharon) and Lv, Qin and Dick, Robert P. and Li, Dongsheng and Yang, Fan and Lu, Tun and Gu, Ning and Shang, Li},
  title = {Unveiling Causal Attention in Dogs' Eyes with Smart Eyewear},
  year = {2023},
  issue_date = {December 2022},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  volume = {6},
  number = {4},
  url = {https://doi.org/10.1145/3569490},
  doi = {10.1145/3569490},
  journal = {Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.},
  month = {jan},
  articleno = {199},
  numpages = {33}
}

@article{chang2021memx,
  title={MemX: An Attention-Aware Smart Eyewear System for Personalized Moment Auto-capture},
  author={Chang, Yuhu and Zhao, Yingying and Dong, Mingzhi and Wang, Yujiang and Lu, Yutian and Lv, Qin and Dick, Robert P and Lu, Tun and Gu, Ning and Shang, Li},
  journal={Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies},
  volume={5},
  number={2},
  pages={1--23},
  year={2021},
  publisher={ACM New York, NY, USA}
}

Dataset

The DogsView dataset was collected in various open-world scenarios from five dog subjects. All videos, including eye/scene videos, have a spatial resolution of 1920x1080 and a sampling rate of 30fps. Currently, we have 63 pairs of eye/scene video clips. A detailed description of the DogsVies dataset is given in Section 4 of the paper.

The dataset can be downloaded here.

Data Format

The DogsView dataset has 63 folders. Each folder is named "scenario_x_dog_y_z", where x is the number of open-world scenarios, y is the number of dog subjects, and z is the repeated times for dog y in scenario x. Each folder contains three videos and two CSV files described as follows:

eye video.mp4: This video contains eye videos recorded under infrared recording conditions.

world video.mp4: This video contains scene videos from dog subjects' filed-of-view.

gazed world video.mp4: This video is the scene video marked with a dog subject's gaze point positions.

gaze_positions.csv: This CSV file contains the eye-tracking results.

It contains the following attributes:

world_index: The frame number of the scene video.

norm_pos_x and norm_pos_y: The gaze position in the space of the scene frame. They are floating point numbers with origin point (0,0) at the bottom left and ending point (1,1) at the top right.

auto_annotation.csv: This CSV file contains the annotations, which the CANINE model automatically obtains for each attentive session in a scene video.

attentive session start frame: The starting frame number in an attention session.

attentive session end frame: The ending frame number in an attention session.

dog behavior: The dog subject's behavior in an attention session.

human action: The human subject's action (or called communicative cues) that causes the dog's responsive behaviors.

About