DCVSMNet: Double Cost Volume Stereo Matching Network

Mahmoud Tahmasebi* (mahmoud.tahmasebi@research.atu.ie), Saif Huq, Kevin Meehan, Marion McAfee

Paper

Performance on KITTI raw dataset

Performance on Jetson AGX Orin for low resolution input

SOTA results.

The results on SceneFlow

The results on KITTI dataset using RTX 3090.

Method	KITTI 2012 (3-noc)	KITTI 2012 (3-all)	KITTI 2015 (D1-bg)	KITTI 2015 (D1-fg)	KITTI 2015 (D1-all)	Runtime (ms)
CGI-Stereo	1.41 %	1.76 %	1.66 %	3.38 %	1.94 %	29
CoEx	1.55 %	1.93 %	1.79 %	3.82 %	2.13 %	33
BGNet+	1.62 %	2.03 %	1.81 %	4.09 %	2.19 %	35
Fast-ACVNet+	1.45 %	1.85 %	1.70 %	3.53 %	2.01 %	45
HITNet	1.41 %	1.89 %	1.74 %	3.20 %	1.98 %	54
DispNetC	4.11 %	4.65 %	2.21 %	6.16 %	4.43 %	60
AANet	1.91 %	2.42 %	1.99 %	5.39 %	2.55 %	62
JDCNet	1.64 %	2.11 %	1.91 %	4.47 %	2.33 %	80
DCVSMNet	1.30 %	1.67 %	1.60 %	3.33 %	1.89 %	67

The results on SceneFlow dataset based on the selected cost volumes.

Group-wise correlation	Norm correlation	Concatenation	Group-wise substraction	EPE[px]	D1-all[%]	Runtime (ms)
✓	✓			0.60	2.11	67
✓		✓		0.59	2.05	75
✓			✓	0.59	2.06	89
	✓	✓		0.72	2.59	60
	✓		✓	0.65	2.28	74
		✓	✓	0.69	2.38	81

How to use

Environment

NVIDIA RTX 3090
Python 3.11
Pytorch 2.0.0

Install

Create a virtual environment and activate it.

conda create -n DCVSMNet python=3.11
conda activate DCVSMNet

Dependencies

conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch -c nvidia
pip install opencv-python
pip install scikit-image
pip install tensorboard
pip install matplotlib 
pip install tqdm
pip install timm==0.5.4

Data Preparation

Train

Use the following command to train DCVSMNet on SceneFlow. First training,

python train_sceneflow.py --logdir ./checkpoints/sceneflow/first/

Second training,

python train_sceneflow.py --logdir ./checkpoints/sceneflow/second/ --loadckpt ./checkpoints/sceneflow/first/checkpoint_000059.ckpt

Use the following command to finetune DCVSMNet on KITTI using the pretrained model on SceneFlow,

python train_kitti.py --logdir ./checkpoints/kitti/ --loadckpt ./checkpoints/sceneflow/second/checkpoint_000059.ckpt

Evaluation on SceneFlow and KITTI

Pretrained Model

DCVSMNet

Generate disparity images of KITTI test set,

python save_disp.py

Citation

If you find this project helpful in your research, welcome to cite the paper.

@misc{tahmasebi2024dcvsmnet,
      title={DCVSMNet: Double Cost Volume Stereo Matching Network}, 
      author={Mahmoud Tahmasebi and Saif Huq and Kevin Meehan and Marion McAfee},
      year={2024},
      eprint={2402.16473},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgements

Thanks to open source works: CoEx, ACVNet, CGI-Stereo.

About

A fast and accurate stereo matching network with powerful generalization on real-world datasets

MIT License

Languages

Language:Python 100.0%