deep-learning eye-segmentation gaze-estimation domain-randomization transfer-learning eye-region-isolation multistream-network unet-image-segmentation pytorch

MSGazeNet

This is the official implementation of our work entitled "Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning" in PyTorch (Version 1.9.0).

The repository contains the source code of our paper where we use the following datasets:

MPIIGaze: The dataset provides eye images and their corresponding head pose and gaze annotations from 15 subjects. It was collected in an unconstrained manner over the course of several months. The standard evaluation protocol of this dataset is leave-one-subject-out.
Eyediap: The dataset was collected in a laboratory environment from 16 subjects. Each participant participated in three different sessions and the standard evaluation protocol of this dataset is 5-fold validation. In this work, we have used the VGA videos from the continuous/discrete screen target session (CS/DS).
UTMultiview: The dataset was collected from 50 subjects in a laboratory setup. The collection procedure involved 8 cameras which generated multiview eye image samples and the corresponding gaze labels was also recorded.

Prerequisites

Please follow the steps below to train MSGazeNet:

Create a virtual environment with required libraries

To create a virtual environment via python:
```
python -m venv <environment name>
```
To create a virtual enrionment via anaconda:
```
conda create -n <environment name>
```
Install requirements
```
pip install -r requirements.txt
```
Download all the datasets and preprocess them following Zhang et al. [1]

Place all the datasets into the 'Data' directory according to the following

Data
├───eyediap
│   ├───Image
│   └───Label
├───mpiigaze
│   ├───Image
│   └───Label
└───utmultiview
    ├───Image
    └───Label

Train the Anatomical Eye Region Isolation (AERI) network
```
python AERI/train_aeri.py
```
This would train the AERI network which can later be used in the framework for gaze estimation. The trained weights will be stored in 'weights/aeri_weights/' folder which will be created upon the execution of this code.
Train the gaze estimation network using the pretrained weights of AERI network

For LOSO experiment on MPIIGaze:
```
python gaze_estimation/mpii_loso.py
```
For 5-fold experiment on Eyediap:
```
python gaze_estimation/eyediap_5fold.py
```
For 3-fold experiment on UTMultiview:
```
python gaze_estimation/utm_3fold.py
```

Citation

@ARTICLE{10438413,
  author={Mahmud, Zunayed and Hungler, Paul and Etemad, Ali},
  journal={IEEE Transactions on Artificial Intelligence}, 
  title={Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning}, 
  year={2024},
  volume={},
  number={},
  pages={1-15},
  keywords={Estimation;Synthetic data;Head;Iris;Feature extraction;Training;Lighting;Gaze estimation;eye region segmentation;multistream network;deep neural network;domain randomization;transfer learning},
  doi={10.1109/TAI.2024.3366174}}

Contact

Please email me your questions or concerns at zunayed.mahmud@queensu.ca

References

[1] X. Zhang, Y. Sugano, and A. Bulling, “Revisiting data normalization for appearance-based gaze estimation,” in Proceedings of the 2018 ACM Symposium on Eye Tracking Research & Applications, 2018, pp. 1–9.

About

This is the official implementation of our work entitled "Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning" accepted in IEEE Transactions on Artificial Intelligence

https://ieeexplore.ieee.org/document/10438413

deep-learning eye-segmentation gaze-estimation domain-randomization transfer-learning eye-region-isolation multistream-network unet-image-segmentation pytorch

MIT License

Languages

Language:Python 100.0%