FineRecon

This software project accompanies the research paper, FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction.

FineRecon is a deep learning model for 3D reconstruction from posed RGB images.

Rui

Environment

r4090:

conda create --name finerecon-py310 python=3.10 pip
conda activate finerecon-py310

# if you have not set up the cuda version to use in .bashrc, you can do it here
export LD_LIBRARY_PATH="/usr/local/cuda-11.8/lib64:$LD_LIBRARY_PATH"
export CUDA_HOME="/usr/local/cuda-11.8"
export PATH="/usr/local/cuda-11.8/bin:$PATH"

mm2:

# if you have not set up the cuda version to use in .bashrc, you can do it here
export LD_LIBRARY_PATH="/usr/local/cuda-11.3/lib64:$LD_LIBRARY_PATH"
export CUDA_HOME="/usr/local/cuda-11.3"
export PATH="/usr/local/cuda-11.3/bin:$PATH"

conda create --name finerecon-py39 python=3.9 pip
conda activate finerecon-py39
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
conda install clang llvm-openmp
pip install -r requirements_py39.txt

Extract ScanNet and GT TSDF for FineRecon

To preprocess ScanNet data -> /newfoundland/ScanNet/extracted:

python tools/extract_scannet.py # extract from .scan to image, depth and camera files
python tools/preprocess_scannet.py # dump to finerecon-compatible format
python tools/tmp_lns_extracted.py

To generate ground truth TSDF for ScanNet -> /data/finerecon_data/scannet_tsdf:

python generate_gt_tsdf.py --dataset-dir /newfoundland/ScanNet/extracted --output-dir /data/finerecon_data/scannet_tsdf # also set these to paths in config.yml

Extract keyframes

obsolete; reading from SimpleRecon files instead (see next section; the results should be the same though)

To extract keyframes json files for ScanNet using dvmvs -> /newfoundland/ScanNet/extracted/{split}_keyframes.json:

cd third-party/deep-video-mvs
python notebooks/extract_keyframes_scannet_finerecon.py

Extract depth maps for keyframes

To process keyframe selection by SimpleRecon -> /newfoundland/ScanNet/extracted_simplerecon/{split}_keyframes.json:

cd third-party/simplerecon
data_scripts/convert_keyframe_for_finerecon.py

To generate estimated depth maps for ScanNet using SimpleRecon:

First, extract another temporary copy of ScanNet -> /newfoundland/ScanNet/extracted_simplerecon, using https://github.com/Jerrypiglet/simplerecon/tree/main/data_scripts/scannet_wrangling_scripts

cd third-party/simplerecon/data_scripts/scannet_wrangling_scripts
conda activate simplerecon-py310
python reader.py --scans_folder /newfoundland/ScanNet/scans_test \
                 --output_path  /newfoundland/ScanNet/extracted_simplerecon/scans_test \
                 --scan_list_file splits/scannetv2_test.txt \
                 --num_workers 8 \
                 --export_poses \
                 --export_depth_images \
                 --export_color_images \
                 --export_intrinsics;

python reader.py --scans_folder /newfoundland/ScanNet/scans \
                 --output_path  /newfoundland/ScanNet/extracted_simplerecon/scans \
                 --scan_list_file splits/scannetv2_train.txt \
                 --num_workers 24 \
                 --export_poses \
                 --export_depth_images \
                 --export_color_images \
                 --export_intrinsics;

python reader.py --scans_folder /newfoundland/ScanNet/scans \
                 --output_path  /newfoundland/ScanNet/extracted_simplerecon/scans \
                 --scan_list_file splits/scannetv2_val.txt \
                 --num_workers 16 \
                 --export_poses \
                 --export_depth_images \
                 --export_color_images \
                 --export_intrinsics;

Then run inference:

CUDA_VISIBLE_DEVICES=0 python test.py --name HERO_MODEL_scannet \
            --output_base_path outputs \
            --config_file configs/models/hero_model.yaml \
            --load_weights_from_checkpoint weights/hero_model.ckpt \
            --data_config configs/data/scannet_default_test.yaml \
            --num_workers 8 \
            --fast_cost_volume \
            --cache_depths \
            --dump_depth_visualization \
            --run_fusion \
            --depth_fuser open3d \
            --fuse_color \
            --batch_size 2;

Run FineRecon

Preprocess data by:

python tmp_lns_extracted.py # create symbolic links to extracted data
python tmp_lns_tsdf.py # create symbolic links to tsdf data
python tmp_lns_depths.py # create png files from simplerecon prediction pickles

Then run inference:

python main.py --task predict --ckpt weights/RTS-DG-PB.ckpt

Setup

Dependencies

pip install \
  matplotlib \
  pillow \
  numpy \
  scikit-image \
  scipy \
  timm \
  torch==1.13 \
  torchvision \
  "tqdm>=4.65" \
  trimesh \
  pytorch_lightning==1.8 \
  pyyaml \
  opencv-python-headless \
  python-box \
  tensorboard

Config

cp example-config.yml config.yml

The paths in config.yml will need to be edited to point to the data directories.

Data

FineRecon requires an RGB-D scan dataset such as ScanNet, which can be downloaded and extracted using the scripts provided by the ScanNet authors.

The dataset structure expected by FineRecon is

/path/to/dataset/
    test.txt
    train.txt
    val.txt
    first_scan/
        color/
            0.jpg
            1.jpg
            2.jpg
            ...
        depth/
            0.png
            1.png
            2.png
            ...
        intrinsic_color.txt
        intrinsic_depth.txt
        pose.npy
    second_scan/
    ...
    last_scan/

The files test.txt, train.txt, and val.txt should each contain a newline-separated list of scan directory names (e.g. first_scan) describing the test, train, and validation splits respectively. Each pose.npy contains the camera poses (world-to-camera transformation matrices) as an array of size (N, 4, 4) in npy format, where any invalid poses are marked with the value Inf. The files intrinsic_color.txt and intrinsic_depth.txt should contain the (4, 4) color and depth intrinsic matrices, respectively. In config.yml, the value of dataset_dir should be set to /path/to/dataset.

To generate the ground truth TSDF run generate_gt_tsdf.py --dataset-dir /path/to/dataset --output-dir /path/to/gt_tsdf, and in config.yml set the value of tsdf_dir to /path/to/gt_tsdf.

To run training or inference with depth guidance, make sure depth_guidance.enabled is set to True in the config and set the value of depth_guidance.pred_depth_dir to /path/to/pred_depth, which should have the following structure:

/path/to/pred_depth/
    first_scan/
        depth/
            0.png
            1.png
            2.png
            ...
        intrinsic_depth.txt
    second_scan/
    ...
    last_scan/

It can be helpful to limit inference to only using a set of pre-defined keyframes, because it's faster (particulary with point back-projection enabled) and because depth estimates may not be available for all frames. To do this, set test_keyframes_file in the config to the location of a JSON file with the following structure:

{
  "first_scan": [i0, i1, i2, ...],
  ...
}

where i0, i1, etc. are the integer indices of the keyframes. The view selection strategy for keyframes is from DeepVideoMVS.

Training

python main.py

Inference

We provide pre-trained weights here: checkpoint.zip. These are weights for our main model using resolution-agnostic TSDF supervision, depth guidance, and point-backprojection.

python main.py --task predict --ckpt path/to/checkpoint.ckpt

For convenience, we also provide the inference results (meshes) of our main model on the ScanNet test set:

High-resolution [1 cm] (2.4 GB) → This is the resolution used in figures and metrics, unless otherwise stated.
Low-resolution [4 cm] (148 MB)

Evaluation

Evaluation code and data for 3D metrics can be found in TransformerFusion, and evaluation code for 2D metrics can be found in Atlas.

Citation

@article{stier2023finerecon,
  title={{FineRecon}: Depth-aware Feed-forward Network for Detailed 3D Reconstruction},
  author={Stier, Noah and Ranjan, Anurag and Colburn, Alex and Yan, Yajie and Yang, Liang and Ma, Fangchang and Angles, Baptiste},
  journal={arXiv preprint},
  year={2023}
}

Jerrypiglet / ml-finerecon