revanurambareesh/instantaneous_transformer

deep-learning computer-vision machine-learning research transformers convolutional-neural-networks human-computer-interaction physiology pytorch instantaneous-transformer

Instantaneous Physiological Estimation using Video Transformers (2022)

Ambareesh Revanur, Ananyananda Dasari, Conrad Tucker, Laszlo Jeni

Download Paper, Download V4V Dataset

Summary

Video-based physiological signal estimation has been limited primarily to predicting episodic scores in windowed intervals. While these intermittent values are useful, they provide an incomplete picture of patients' physiological status and may lead to late detection of critical conditions. We propose a video Transformer for estimating instantaneous heart rate and respiration rate from face videos. Physiological signals are typically confounded by alignment errors in space and time. To overcome this, we formulated the loss in the frequency domain.

Code

In this work, we followed the below folder structure,

Instantaneous_Transformer
  |- code/inst_transformer
  |     |- dataloader.py
  |     |- globals.py
  |     |- main.py
  |     |- ...
  |
  |- data/
  |     |- Training/
  |     |- Validation/
  |     | ...
  |- weights/
  |- summaries/

Navigate into code/inst_transformer and run python main.py --args. See below for list of arguments.

python main.py --name vid_tra_1 --seqlen 100 --gpu 0 --phys HR
python main.py --name vid_tra_2 --seqlen 1000 --gpu 0 --phys RR --batch_size 4 --numlayer 1

The loss plots should be saved in summaries folder.

Training curves

The loss plots will look like below:

Citation

If you found this code repository useful, please consider citing the following works.

@article{revanur2022transformers,
  title={Instantaneous Physiological Estimation using Video Transformers},
  author={Revanur, Ambareesh and Dasari, Ananyananda and Tucker, Conrad and Jeni, L{\'a}szl{\'o} A},
  journal={Multimodal AI in Healthcare: A Paradigm Shift in Health Intelligence, Studies in Computational Intelligence},
  year={2022}
}

@inproceedings{revanur2021first,
  title={The first vision for vitals (v4v) challenge for non-contact video-based physiological estimation},
  author={Revanur, Ambareesh and Li, Zhihua and Ciftci, Umur A and Yin, Lijun and Jeni, L{\'a}szl{\'o} A},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={2760--2767},
  year={2021}
}

Contact

Please use the issues tab for any queries or contact ambareesh (dot) r (at) gmail (dot) com

Credits

We thank authors of RemotePPG and imresize for open-sourcing their code.

About

Official repo of Instantaneous Transformers for Video based Physiology estimation (Accepted to AAAI workshop and Springer Studies in Computational Intelligence)

deep-learning computer-vision machine-learning research transformers convolutional-neural-networks human-computer-interaction physiology pytorch instantaneous-transformer

Languages

Language:Python 100.0%