ArtCNN

Overview

Super-Resolution Convolutional Neural Networks optimised for anime. ArtCNN implements a simple feed-forward architecture with one long-skip connection and a pixel-shuffle layer to get the HR image.

The model is offered in 4 sizes:

C16F64: This has 16 internal convolution layers with 64 filters each. Offered only in the ONNX format. If you're interested in using ArtCNN outside of mpv you should probably use this.
C4F32: This has 4 internal convolution layers with 32 filters each. You need a relatively decent GPU to run this well. Also offered in the ONNX format.
C4F16: This has 4 internal convolution layers with 16 filters each. You should be able to run this on most modern GPUs.
C4F8: This has 4 internal convolution layers with 8 filters each. You should probably only use this on very slow systems.

Regarding the suffixes:

Shaders without any suffixes are the base models. These are meant to respect the source and produce fairly neutral outputs.
Shaders with the DS suffix are trained to denoise and sharpen, which is usually useful for most web sources.
Shaders with the CMP suffix are compute shaders. These are still experimental, but they're usually faster (specially on Vulkan).
Shaders with the Chroma suffix are chroma shaders. These are meant to be used on high-quality sources, and you should not use them alongside luma prescalers.
The old YCbCr and RGB variants can be found under the "Old" directory. These have not been updated to reflect the new software stack and training dataset yet.

You may occasionaly find some models under the "Experiments" directory. This is meant to serve as a testing ground for the main ArtCNN models.

Technical Details

The luma models are trained on an anime dataset containing images from the following sources:

Violet Evergarden
Koe no Katachi
Kimi no Na Wa
Hibike Euphonium (Chikai and Ensemble)
Yuru Camp (Film only)
SAO (OS and Progressive)
Evangelion: 3.0+1.0
Tsukihime R (Backgrounds only)

The Chroma models are trained on DIV2K+Manga109.

The images are split into smaller 256x256 patches and downsampled with the box filter. The L1 loss function is used alongside the Adam optimiser. The models are trained using Keras 3 with its JAX backend.

mpv Instructions

Add something like this to your mpv config:

glsl-shader="path/to/shader/ArtCNN_C4F16_DS.glsl"

VapourSynth Instructions

ArtCNN is natively supported by vs-mlrt. Please follow the instructions found there.

Alternatively, can also run the GLSL shaders with vs-placebo.

ONNX models should be generally preferred over their GLSL counterparts.

Artoriuz / ArtCNN

ArtCNN

Overview

Technical Details

mpv Instructions

VapourSynth Instructions

Example

About

Languages