ImgAlign

If you have training images that aren't aligned properly, you've come to the right place. This tool is useful for auto aligning, cropping, and scaling HR and LR images for training image based neural networks. It is a CLI that takes pairs of high and low resolution images that are misaligned, misscaled, cropped out, and rotated, and outputs new, usable images for use in training neural networks.

Quick Start

Accepts file inputs or directories. Add the ImgAlign.exe file to path or open a CMD prompt in the folder it is in. Have a folder named HR and another named LR which contain the HR and LR images with matching names. Use the options -s (or --scale) to set the scaling multiple, and -m (or --mode) to set retention mode. Output images are saved in Output folder and are scaled properly. It is recommended to only use rotations or homography if they are needed because it may give worse results on image pairs that are aren't rotated or have warped homography.

Example:

ImgAlign -s 2 -m 0

Example 2 with some settings enabled with default vaules:

ImgAlign -s 2 -m 0 -g HR\ -l LR\ -c -f -a -i -1

Options:

The python script and exe file both work the same way. If using the python scipt, make sure OpenCV and Pillow are installed installed using the line 'pip install opencv-python Pillow' (OpenCV not yet working on python 3.10). It is suggested to add the exe file to path in Windows and used as a typical CLI.

All options are now fully functional:

-s SCALE, --scale SCALE: Positive integer value. How many times bigger you want the HR resolution to be from the LR resolution.

-m MODE, --mode MODE: Options: 0 or 1. Mode 0 manipulates the HR images while remaining true to the LR images aside from cropping. Mode 1 manipulates the LR images and remains true to the HR images aside from cropping.

-c, --autocrop: Disabled by default. If enabled, this auto crops black boarders around HR and LR images.

-t THRESHOLD, --threshold THRESHOLD: Integer 0-255, default 50. Luminance threshold for autocropping. Higher values cause more agressive cropping. Only works when autocrop is enabled.

-r, --rotate: Disabled by default. If enabled, this allows rotations when aligning images.

-g HR, --hr HR: HR File or folder directory. No need to use if they are in HR folder in current working directory.

-l LR, --lr LR: LR File or folder directory. No need to use if they are in LR folder in current working directory.

-o, --overlay: Enabled by default. After saving aligned images, this option will create a separate 50:50 merge of the aligned images in the Overlay folder. Useful for quickly checking through image sets for poorly aligned outputs

-i COLOR, --color COLOR: Default 0. Choose which color to use for color correction. -1 uses LR color and 1 uses HR color

-f, --full: Disabled by default. If enabled, this allows full homography mapping of the image, correcting rotations, translations, and warping.

-e, --score: Disabled by default. Calculate an alignment score for each processed pair of images

-w, --warp: Disabled by default. Match images using Thin Plate Splines, allowing full image warping

-a, --semiauto Disabled by default. Semiautomatic mode. Automatically find matching points, but load into a viewer window to manually delete or add more.

-n, --threads: Default 1. Number of threads to use for automatic matching. Large images require a lot of RAM, so start small to test first.

-u, --manual: Disabled by default. Manual mode. If enabled, this opens windows for working pairs of images to be aligned. Double click pairs of matching points on each image in sequence, and close the windows when finished.

                                      Manual Keys: 
                                      Double click left: Select point.
                                      Click and Drag left: Pan image.
                                      Scroll Wheel: Zoom in and out.
                                      Double Click right: Reset image view.
                                      u: Undo last point selection.
                                      w: Close both windows to progress.
                                      p: Preview alignment.  Overlays images using current alignment points.

If using python, matplotlib 3.5.1 works best, every other version causes one of the window's cursor to change after previewing an image