l0stpenguin/DenseDepthResNet

High Quality Monocular Depth Estimation via Transfer Learning (arXiv 2018)

Ibraheem Alhashim and Peter Wonka

The original code is at the repo Dense Depth Original Code This version has been modified by me to

Add support for different encoders
Test the effect of decreasing decoder length
Create cool visualizations by testing the model on self collected images and videos

Requirements

This code is tested with Keras 2.2.4, Tensorflow 1.13, CUDA 9.0, on a machine with an NVIDIA Titan V and 16GB+ RAM running on Windows 10 or Ubuntu 16.
Other packages needed keras pillow matplotlib scikit-learn scikit-image opencv-python pydot and GraphViz.

Data

NYU Depth V2 (50K) (4.1 GB): You don't need to extract the dataset since the code loads the entire zip file into memory when training.

Download a pretrained model from the original paper

NYU Depth V2 (165 MB)

Training with DenseNet 169 encoder

Train from scratch:

python train.py --data nyu --bs 5 --full

Train from a previous checkpoint

python train.py --data nyu --bs 5 --full --checkpoint ./models/1557344811-n10138-e20-bs5-lr0.0001-densedepth_nyu/weights.04-0.12.h5

Training with DenseNet 121 encoder

python train.py --data nyu --bs 5 --full --dnetVersion small

Training with ResNet50 encoder

python train.py --data nyu --bs 5 --name resnet50_nyu --full --resnet

Evaluation

Download, but don't extract, the ground truth test data from here (1.4 GB). The call evaluate.py with your model checkpoint

python evaluate.py --model ./models/1557483797-n10138-e20-bs5-lr0.0001-densedepth_nyu/weights.06-0.12.h5

Visualizations on images and videos outside the NYU Depth V2 data

Visualization on images outside the test set - displays image + predicted depth map

Put the images in my_examples folder

python test.py --model ./models/1557483797-n10138-e20-bs5-lr0.0001-densedepth_nyu/weights.06-0.12.h5

Visualization on videos outside the test set - displays image + predicted depth map

Pass the checkpoint to the model. The results are stored in the folder called image_results

python test_video.py --model ./models/1557395533-n10138-e20-bs5-lr0.0001-densedepth_nyu/weights.03-0.12.h5 --input test_video.MOV

Reference

Corresponding paper to cite:

@article{Alhashim2018,
  author    = {Ibraheem Alhashim and Peter Wonka},
  title     = {High Quality Monocular Depth Estimation via Transfer Learning},
  journal   = {arXiv e-prints},
  volume    = {abs/1812.11941},
  year      = {2018},
  url       = {https://arxiv.org/abs/1812.11941},
  eid       = {arXiv:1812.11941},
  eprint    = {1812.11941}
}

l0stpenguin / DenseDepthResNet