Evaluation-of-Frame-Field-Learning-using-Orthophotos

This repository attempts to produce the results of the original work by Girard et.al. https://github.com/Lydorn/Polygonization-by-Frame-Field-Learning. In this repository:

The environment setup has been made simple by removing the hard coded versions from packages.
The training and inference has been made runnable by making changes within the code and debugging errors.
The model has been trained on a new dataset called the large scale real world dataset with data from Lower Saxony, Germany.

ENVIRONMENT SETUP

For setting up the environment use the 'environment.yml' file. Run the following in the terminal to create and activate the environment, here it is named frame_field.

conda init bash
source .bashrc
conda config --set channel_priority false
conda env create -f environment.yml -p ~/frame_field
conda activate /home/frame_field

Takes .. minutes to create environment. This only needs to be done once.

After the environment is created, in order to activate it the next time just use:

conda init bash
source .bashrc
conda activate /home/frame_field

Change to the required directory:

cd Evaluation_of_Frame_Field_Learning_using_Orthophotos

P.S.: It takes over an hour to create the complete environment.

DATASET

INRIA Aerial Dataset can be downloaded using the link from the original work: https://github.com/Lydorn/Polygonization-by-Frame-Field-Learning
Large Scale Real World Dataset can be downloaded from the link below: https://tubcloud.tu-berlin.de/s/M6PobTMpaX6q7Ap

Downloading the zip file called data and unzipping it inside the workspace as explained in the pr-trained model section would be better for complete and correct extraction of the contents. The zip folder already has the required subfolders.

The path of data directory must be changed in 'configs/config.defaults.json' in 'data_dir_candidates'.

TRAINING

If inference is not sufficient and training wants to be performed, it can begin as below:

python main.py --config configs/<name_of_config> --gpus 1
python main.py --config configs/private_dataset_polygonized.unet_resnet101_pretrained --gpus 1

Train on raw images from scratch:

The raw images are cropped into 725 x 725 patches and stored in a folder called processed which is created during the initial training. The calculation of tangent angle to be used as annotation for frame field is calculated during this step as well.
Before calculation of loss starts, the patching of images takes place for all training images.

Note:

We have used numba=0.53.0 and in case of error such as:

AttributeError: module 'numba' has no attribute 'jitclass', the skan package needs to be edited due to incompability to the newer numba version.

- Go to /home/frame_field/lib/python3.8/site-packages/skan/csr.py 
- On Line 21 of csr.py, change '@numba.jitclass(csr_spec)' to '@numba.experimental.jitclass(csr_spec)'

RuntimeError: DataLoader worker (pid(s) 11355) exited unexpectedly

In configs/config.defaults.json; change num_workers to 0

ValueError: Number of processes must be at least 1

In configs/config.defaults.json; change num_workers to 1

EVALUATION

For evaluation, the num_workers in the config.defaults.json should be atleast 1.

python main.py --config configs/<name_of_config> --mode eval
python main.py --config configs/private_dataset_polygonized.unet_resnet101_pretrained --mode eval

First, the patching of the test images takes place. Evaluation requires quite a lot of disk space so might run into disk quota exceeded error. In that case, either free space if possible, else move on to inference.

CONFIGURATIONS

There is one main configuration file for each of the datasets, namely:

inria_dataset_polygonized.unet_resnet101_pretrained
private_dataset_polygonized.unet_resnet101_pretrained

The parameters can be changed accordingly depending on the experiment one wants to perform. The other config files are all connected to the above two main files. The following parameters can be changed in order to perform the experiments explained in the paper.

CNN Network Backbone: In the respective main config file mentioned above, the 'default_filepath' can be changed in the 'backbone_params' parameters to any of the backbone_params.json provided. Also the 'encoder_depth' can be changed for changing the number of layers in the respective backbone.
Regularization: in configs/backbone_params.json: the value of dropout_2d can be changed to tweak the dropout value.
Hyperparameters: in configs/optim_params.json; the value for max_lr and base_lr can be changed for twaeking the learning rate.
Frame field parameters: in configs/config.defaults.json; when the compute_crossfield parameter is set to false, the frame field is not computed and simple segmentation takes place.
Segmentation parameters: in configs/config.defaults.json; in seg_params; when compute_edge is set to false, the exterior of the polygons is not considered during segmentation.

PRE-TRAINED MODEL

Once the data is uploaded, the inference can be run on any image using the pre-trained models.

Download the zipped pre-trained models and unzip them inside the folder frame_field_learning inside the subfolder runs.

The zip folder of the pre-trained models can be downloaded from here:

For INRIA dataset: https://drive.google.com/file/d/1bUuiJD148AbVU_GvRlcMNZZzSz3K3JG9/view?usp=sharing
For Large scale real world dataset: https://drive.google.com/file/d/147BGJQdn95gj_Cro_A2MSFkEkG2-99_M/view?usp=sharing

Upload the zipped folder onto jupyter notebook (/frame_field_learning/runs).
Unzip it using the following:

Fixes the zip file in case it is corrupted
- !zip -FF /home/inria_dataset_polygonized_unet_resnet101_pretrained_2022_05_10_10_05_30.zip -O private_dataset_polygonized_unet_resnet101_pretrained_2022_05_10_10_05_30.fixed.zip
Unzips the file and saves it in the same location
- !unzip /home/Evaluation-of-Frame-Field-Learning-using-Orthophotos/frame_field_learning/runs/private_dataset_polygonized_unet_resnet101_pretrained_2022_05_10_10_05_30.fixed.zip

Rename the file separating the name and datetime stamp with a '|' like so: private_dataset_polygonized.unet_resnet101_pretrained | 2022_05_10_10_05_30

This can be used as run_name during inference without the datetime stamp like so:

python main.py --in_filepath <path_to_image> --run_name private_dataset_polygonized.unet_resnet101_pretrained

INFERENCE

The inference can be run on any image using the pre-trained models provided above or using a new run after training.

python main.py --in_filepath <path_to_image> --run_name <name_of_run>
python main.py --in_filepath /home/Evaluation-of-Frame-Field-Learning-using-Orthophotos/data/PrivateDataset/raw/test/images/bad_bodenteich3.tif --run_name private_dataset_polygonized_unet_resnet101_pretrained

Saves the crossfield, masks and segmentation in the same folder as the image. In order to save the shapefiles, in the config.json inside the runs folder, in the dictionary 'eval_params/save_individual_outputs', the necessary parameters can be changed to 'true' for example the poly_shapefile saves the polygons as shapefiles which can be used to evaluate the metrics as explained below.

METRICS

The IoU and tangent angle metrics can be evaluated for each image as below using the image and ground truth shapefile:

change directory to scripts: cd scripts
save the shapefiles for each image in the subfolder 'PrivateDataset/raw/test/shp/'
python eval_shapefiles.py --im_filepath <path_to_image> --gt_filepath <path_to_gt_shapefile> --pred_filepath <path_to_predicted_shapefiles>

python eval_shapefiles.py --im_filepath ~/Polygonization-by-Frame-Field-Learning/data/PrivateDataset/raw/test/images/bad_bodenteich3.tif --gt_filepath ~/Evaluation-of-Frame-Field-Learning-using-Orthophotos/data/PrivateDataset/raw/test/shp/bad_bodenteich3.shp --pred_filepath ~/Evaluation-of-Frame-Field-Learning-using-Orthophotos/data/PrivateDataset/raw/test/images/poly_shapefile.simple.tol_1/bad_bodenteich3.shp

Run check.py to get the average values of IoU and tangent angle for each image.

TENSORBOARD

The logs that are saved inside the runs folder can be used to track the training process and the graphs for loss and some predictions can be viewed on tensorboard through the following way:

Create a virtual environment on your local machine eg. virtual_env.
On your command prompt change directory to the virtual_env.
Activate it using 'Scripts/activate.bat'; now we are inside the virtual environment.
Change directora to where the logs are saved.
Then type: tensorboard --logdir=<name_of_log>
Let it run until it displays a local host link like so: http://localhost:6006/. Run this link in the browser to access the data inside the tensorboard.

RESULTS

The results on INRIA dataset can be seen below where a building with hole in the city of Innsbruck is well polygonized, followed by a series of attached building in SanFrancisco which are polygonized somewhere in cluster and not separately as wanted.

The results for large scale real world dataset can be seen below where a building with sharp edges in the city of Uelzen has been well polygonized, followed by a series of single standing buildings in Bad bodenteich which are polygonized well as well. Another building from Uelzen has been shown where the curve of the building is well polygonized while obstruction due to the shadow of a tree leads to improper polygonization.

PREPARATION OF OWN DATASET

For preparation of own dataset to be used for training the following repository can be used.

https://github.com/kriti115/Dataset-Preparation-for-Frame-Field-Learning-using-Orthophotos

One component of the annotation is not acquired above which is the gt_polygonized. For this, either the network needs to run on the existing dataset above or using the pre-trained model so that we have at least one run. This run is then used to run the script polygonize_mask.py inside the scripts folder like so:

python polygonize_mask.py -f <path_to_the_respective_binary_mask(gt)> --run_name <name_of_the_run>
python polygonize_mask.py -f /home/Evaluation-of-Frame-Field-Learning-using-Orthophotos/PrivateDataset/raw/train/gt/.tif --run_name private_dataset_polygonized_unet_resnet101_pretrained The geojsons are saved in the same folder as the binary mask (gt). These can be saved inside the gt_polygonized folder of the train folder.

Now, we have the complete annotations then we can move forward.

The following script can be formatted, where the CITY_METADATA_DICT on Line 27 can be editted according to the number of images, pixelsize and their mean and standard deviation.

pytorch_lydorn/torch_lydorn/torchvision/datasets/private_dataset.py

Necessary additions needs to be made in the dataset_folds.py scripts as stated.
The data needs to be added into the data folder according to the subfolder presented in the data.zip file.

Simple Segmentation Model

A link to a simple segmentation model has been provided below:

https://github.com/kriti115/Binary-segmentation-model.git

kriti115 / Evaluation-of-Frame-Field-Learning-using-Orthophotos