semonemo / STAD

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

STAD: Stable Video Depth Estimation (ICIP 2021)

Alt text

Getting Started

This code has been developed under Anaconda(Python 3.6), Pytorch 1.6.0, Torchvision 0.7.0 and CUDA 10.1.

  1. Please install following environments:

    # create a new environment if needed
    conda create --name stad
    conda activate stad
    
    # install the the dependencies 
    pip install -r requirements.txt
    
  2. Download KITTI data

    (1) Download raw data

    (2) Download ground truth data

  3. Download pretrained weights

Train on KITTI

In the code folder, run

sh local_train_kitti.sh

You need to change dataset_path, exp_name and loss_type correctly.

Please set parameters as follows:

Models exp_name loss_type
Neural-RGBD ver0-nrgbd NLL
Neural-RGBD with scale-invariant loss ver0-nrgbd_silog silog
STAD-frame (Ours) ver1-per_frame_silog silog
STAD (Ours) ver4-aggr_silog silog

Alt text

Test on KITTI with given camera pose

In this case we assume the camera poses are given with the dataset.

In the code folder, run

sh local_test.sh

You need to change dataset_path and model_path correctly.

Contact

If you have any questions, please contact the author Hyunmin Lee<hyunmin057@gmail.com>.

Acknowledgement

Portions of the source code (e.g., training pipeline, argument parser, and logger) are from NVIDIA, Neural-RGBD

About

License:Other


Languages

Language:Python 99.3%Language:Shell 0.7%