janiavdv / colorizer

Grayscale image colorization using a U-Net CNN (with VGG-19) and perceptual loss.

Home Page:https://rgbit.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Image Colorization with CNNs

CSCI1430 Spring 2024 Final Project

We implemented a convolutional neural network (CNN) to colorize grayscale images using a U-Net architecture with the VGG-19 model. U-Net is a popular deep learning architecture known for its effectiveness in image segmentation tasks. VGG-19 is a large model with almost 150 million parameters that is pre-trained. It is traditionally used for feature detection and was adapted for colorizing in our project. Our model is trained using the MIT Places365 dataset, which contains 365,000 images of scenes (which we split into 328,500 train and 36,500 test images, a 90/10 split). Moreover, the model makes use of a custom Perceptual Loss function for a higher level chromatic evaluation of the CNN. Our results show that the model produces vibrant and realistically colored images. This project reinforces the potential of deep learning in creative image processing. Below is was our VGG-19 U-Net architecture.

arch

Example Results

grid

Usage

Download data:

cd data
./download.sh

Running the Flask server:

export FLASK_APP=api
flask run

Visualize images:

python3 visualize.py

About

Grayscale image colorization using a U-Net CNN (with VGG-19) and perceptual loss.

https://rgbit.io/

License:MIT License


Languages

Language:Python 97.7%Language:Shell 2.3%