SHI-Labs / VideoINR-Continuous-Space-Time-Super-Resolution

[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

Home Page:https://arxiv.org/abs/2206.04647

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VideoINR

This repository contains the official implementation for VideoINR introduced in the following paper:

VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
Zeyuan Chen, Yinbo Chen, Jingwen Liu, Xingqian Xu, Vidit Goel, Zhangyang Wang, Humphrey Shi, Xiaolong Wang
CVPR 2022

You can find more visual results and a brief introduction to VideoINR at our project page.

Citation

If you find our work useful in your research, please cite:

@inproceedings{chen2022vinr,
  author    = {Chen, Zeyuan and Chen, Yinbo and Liu, Jingwen and Xu, Xingqian and Goel, Vidit and Wang, Zhangyang and Shi, Humphrey and Wang, Xiaolong},
  title     = {VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution},
  journal   = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
}

Environmental Setup

The code is tested in:

If you are using Anaconda, the following command can be used to build the environment:

conda create -n videoinr
conda activate videoinr
conda install pytorch=1.6 -c pytorch

cd models/modules/DCNv2/
python setup.py install

Demo

  1. Download the pre-trained model from google drive.

  2. Convert your video of interest to a sequence of images. This process can be completed by many apps, e.g. ffmpeg and AdobePR.

The folder that contains this image sequence should have a structure as follows:

data_path
├── img_1.jpg
├── img_2.jpg
├── ...
├── img_n.jpg
  1. Using VideoINR for performing space-time super-resolution. You can adjust up-sampling scales by setting different space_scale and time_scale.
python demo.py --space_scale 4 --time_scale 8 --data_path [YOUR_DATA_PATH]
  1. The output would be three folders including low-resolution images, bicubic-upsampling images, and the results of VideoINR.

Training

About

[CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution

https://arxiv.org/abs/2206.04647


Languages

Language:Python 67.3%Language:Cuda 23.7%Language:C++ 6.5%Language:C 2.4%Language:Shell 0.1%