selous123 / FR-SRGAN

Class Final Project for MIT 6.819 Advances in Computer Vision

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FR-SRGAN

Final Project for MIT 6.819 Advances in Computer Vision

Required Package

pytorch
skimage
numpy
cv2
tqdm

Usage

TODO

Overview

This project is based on the following two papers.

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

https://arxiv.org/abs/1609.04802

Frame-Recurrent Video Super-Resolution

https://arxiv.org/abs/1801.04590

We are trying to combine these two models together in our project (this is the where the name come from "Frame Recurrent Super Resolution GAN").

Our final report is available here: Report

Dataset

We used the dataset from toflow-dataset, which is about 15GB, containing 7.8k video clips with 7 frames per clip.

Results

We trained our model on the dataset for 9 epochs. and compared with default SRGAN model in the repo. We did not retrain SRGAN in our dataset, so the result is for reference only.

The following results are produced by 7-epoch model.

Temporal Profile

SRGAN

Frvri4.png

FRVSR

FrvyW9.png

FR-SRGAN

Frv6zR.png

Ground Truth

FrvsJJ.png

Comparison

Frxiyq.gif

Acknowledgement

We referred to SRGAN Implementation by LeftThomas in our project. We would like to thank the author for his fabulous work.

About

Class Final Project for MIT 6.819 Advances in Computer Vision


Languages

Language:Python 100.0%