GuangmingZhu / Gestureness

Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Gestureness

(Please read the unpublished Manuscript.pdf for details)

Prerequisites

  1. Tensorflow-0.11
  2. Tensorlayer (commit ba30379f1b86f930d6e86e1c8db49cbd2d9aa314)
    git clone https://github.com/zsdonghao/tensorlayer.git
    git checkout ba30379f1b86f930d6e86e1c8db49cbd2d9aa314

The original Tensorlayer does not support the convolutional LSTM, so the tensorlayer/layers.py needs to be replaced with tensorlayer-layers.py.

How to use the code

Prepare the data

  1. Convert each video files into images using extract_frames.sh in the dataset_splits/video2image.tar.gz. Before running extract_frames.sh, you should change the ROOTDIR in extract_frames.sh, so that IsoGD_phase_1 and IsoGD_phase_2 do exist under $ROOTDIR.
  2. Replace the path "/ssd/dataset" in the files under "dataset_splits" with the path "$ROOTDIR"
  3. run check_files.py to make sure all necessary image files do exist

Training Stage

  1. Use training_*.py to finetune the networks for different modalities. Please change os.environ['CUDA_VISIBLE_DEVICES'] according to your workstation.
  2. You can run the three training_*.py on three TITAN X GPUs simultaneously.

Citation

Please cite the following paper if you feel this repository useful.
http://ieeexplore.ieee.org/abstract/document/7880648/ http://openaccess.thecvf.com/content_ICCV_2017_workshops/w44/html/Zhang_Learning_Spatiotemporal_Features_ICCV_2017_paper.html

@article{ZhuTMM2018,
  title={Continuous Gesture Segmentation and Recognition using 3DCNN and Convolutional LSTM},
  author={Liang Zhang and Guangming Zhu and Peiyi Shen and Juan Song and Syed Afaq Shah and Mohammed Bennamoun},
  journal={IEEE Transactions on Multimedia},
  year={2018}
}
@article{ZhuICCV2017,
  title={Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM for Gesture Recognition},
  author={Liang Zhang and Guangming Zhu and Peiyi Shen and Juan Song and Syed Afaq Shah and Mohammed Bennamoun},
  journal={ICCV},
  year={2017}
}
@article{Zhu2017MultimodalGR,
  title={Multimodal Gesture Recognition Using 3-D Convolution and Convolutional LSTM},
  author={Guangming Zhu and Liang Zhang and Peiyi Shen and Juan Song},
  journal={IEEE Access},
  year={2017},
  volume={5},
  pages={4517-4524}
}

Contact

For any question, please contact

  gmzhu@xidian.edu.cn

About

Learning Spatiotemporal Features using 3DCNN and Convolutional LSTM


Languages

Language:Python 100.0%Language:Shell 0.0%