cozcinar / vmaf

Perceptual video quality assessment based on multi-method fusion.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VMAF - Video Multi-Method Assessment Fusion

Build Status AppVeyor Build Status

VMAF is a perceptual video quality assessment algorithm developed by Netflix. VMAF Development Kit (VDK) is a software package that contains the VMAF algorithm implementation, as well as a set of tools that allows a user to train and test a custom VMAF model. Read this techblog post for an overview, or this post for the latest updates and tips for best practices.

News

  • (1/31/19) Optimized C code for speed. Running in multithreading mode, vmafossexec achieves ~40% run time reduction compared to the previous version.
  • (11/19/18) Added a BD-rate calculator implementation. See more details here.
  • (10/25/18) We have published our second techblog on VMAF, with recommendations on best practices.
  • (9/13/18) SUREAL is no longer a submodule to VMAF.
  • (6/19/18) Each VMAF prediction score now comes with a 95% confidence interval (CI), which quantifies the level of confidence that the prediction lies within the interval.

Frequently Asked Questions

Refer to the FAQ page.

Usages

The VDK package offers a number of ways for a user to interact with the VMAF algorithm implementations. The core feature extraction library is written in C. The rest scripting code including the classes for machine learning regression, training and testing VMAF models and etc., is written in Python. Besides, there is C++ "wrapper" code partially replicating the logic in the regression classes, such that the VMAF prediction (excluding training) is fully implemented in C/C++.

There are a number of ways one can use the package:

  • VMAF Python library offers full functionalities including running basic VMAF command line, running VMAF on a batch of video files, training and testing a VMAF model on video datasets, and visualization tools, etc.
  • vmafossexec - a C++ "wrapper" executable offers running the prediction part of the algorithm in full, such that one can easily deploy VMAF in a production environment without needing to configure the Python dependencies. Additionally, vmafossexec offers a number of exclusive features, such as 1) speed optimization using multi-threading and skipping frames, 2) optionally computing PSNR, SSIM and MS-SSIM metrics in the output.
  • libvmaf.a - a static library offers an interface to incorporate VMAF into your C/C++ code. Using this library, VMAF is now included as a filter in FFmpeg main branch, and can be configured using: ./configure --enable-libvmaf --enable-version3. See this section for details. Using FFmpeg with libvmaf allows passing in compressed video bitstreams directly to VMAF.
  • VMAF Dockerfile generates a VMAF docker image from the VMAF Python library. Refer to this document for detailed usages.
  • Build VMAF on Windows: follow instructions on this page.

Datasets

We also provide two sample datasets including the video files and the properly formatted dataset files in Python. They can be used as sample datasets to train and test custom VMAF models.

Models

Besides the default VMAF model which predicts the quality of videos displayed on a 1080p HDTV in a living-room-like environment, VDK also includes a number of additional models, covering phone and 4KTV viewing conditions. Refer to the models page for more details.

Confidence Interval

Since VDK v1.3.7 (June 2018), we have introduced a way to quantify the level of confidence that a VMAF prediction entails. Each VMAF prediction score now can come with a 95% confidence interval (CI), which quantifies the level of confidence that the prediction lies within the interval. Refer to the VMAF confidence interval page for more details.

Matlab Functionality

Besides the Python/C/C++ part of the repository, we also introduced a number of algorithms that are implemented in Matlab. For example, users can calculate ST-RRED, ST-MAD, SpEED-QA, and BRISQUE. For more details, see the Matlab Usage page for more details.

References

Refer to the references page.

About

Perceptual video quality assessment based on multi-method fusion.

License:Apache License 2.0


Languages

Language:C++ 39.3%Language:C 26.1%Language:Python 20.4%Language:HTML 6.6%Language:MATLAB 4.0%Language:Java 1.5%Language:Makefile 1.1%Language:M4 0.9%Language:Batchfile 0.1%Language:Shell 0.0%Language:Dockerfile 0.0%