zdyshine / vmaf

Perceptual video quality assessment based on multi-method fusion.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

VMAF - Video Multi-Method Assessment Fusion

Build Status AppVeyor Build Status libvmaf

VMAF is a perceptual video quality assessment algorithm developed by Netflix. VMAF Development Kit (VDK) is a software package that contains the VMAF algorithm implementation, as well as a set of tools that allows a user to train and test a custom VMAF model. Read this techblog post for an overview, or this post for the latest updates and tips for best practices.

vmaf logo

News

  • (2/27/20) We have changed VMAF's license from Apache 2.0 to BSD+Patent, a more permissive license compared to Apache that also includes an express patent grant. VMAF has been increasingly used in a number of open-source multimedia projects. However, it posed a problem to more deeply integrate VMAF into these projects to perform advanced tasks. Many of the open-source projects are licensed under GPLv2/LGPLv2, and the old Apache 2.0 license was incompatible with GPLv2/LGPLv2. Due to this incompatibility, when integrating VMAF source code into these projects, the license automatically got bumped to GPLv3/LGPLv3, which could prove challenging for downstream projects who might want to integrate VMAF. Changing VMAF’s license to BSD+Patent have resolved this issue, because it is compatible with GPLv2/LGPLv2.
  • (2/27/20) We made a few changes in a recent refactoring effort: 1) migrated the build system from makefile to meson, 2) restructured the code, and 3) introduced a new release candidate API with the associated library libvmaf_rc and executable vmaf_rc, co-existing with the current libvmaf and vmafossexec, all under libvmaf/build. The new release candidate API is designed for better interoperrability with encoding optimization. We will deprecate the old API on a future date.
  • (9/8/19) Added a link to report VMAF bad cases. Over time, we have received feedbacks on when VMAF's prediction does not reflect the expected perceptual quality of videos, either they are corner cases where VMAF fails to cover, or new application scenarios which VMAF was not initially intended for. In response to that, we have created the Google form to allow users to upload their video samples and describe the scenarios. The bad cases are valuable for improving future versions of VMAF. Users can opt in or out for sharing their sample videos publicly.

Frequently Asked Questions

Refer to the FAQ page.

Usages

The VDK package offers a number of ways for a user to interact with the VMAF algorithm implementations. The core feature extraction library is written in C. The rest scripting code including the classes for machine learning regression, training and testing VMAF models and etc., is written in Python. Besides, there is C++ an implementation partially replicating the logic in the regression classes, such that the VMAF prediction (excluding training) is fully implemented.

There are a number of ways one can use the package:

  • VMAF Python library offers full functionalities including running basic VMAF command line, running VMAF on a batch of video files, training and testing a VMAF model on video datasets, and visualization tools, etc.
  • vmafossexec - a C++ executable offers running the prediction part of the algorithm in full, such that one can easily deploy VMAF in a production environment without needing to configure the Python dependencies. Additionally, vmafossexec offers a number of exclusive features, such as 1) speed optimization using multi-threading and skipping frames, 2) optionally computing PSNR, SSIM and MS-SSIM metrics in the output.
  • libvmaf - a C library offers an interface to incorporate VMAF into your C/C++ code.
  • VMAF is now included as a filter in FFmpeg and can be configured using: ./configure --enable-libvmaf --enable-version3. See the FFmpeg documentation for usage.
  • VMAF Dockerfile generates a VMAF docker image from the VMAF Python library. Refer to this document for detailed usages.
  • Build VMAF on Windows: follow instructions on this page.

Datasets

We also provide two sample datasets including the video files and the properly formatted dataset files in Python. They can be used as sample datasets to train and test custom VMAF models.

Models

Besides the default VMAF model which predicts the quality of videos displayed on a 1080p HDTV in a living-room-like environment, VDK also includes a number of additional models, covering phone and 4KTV viewing conditions. Refer to the models page for more details.

Confidence Interval

Since VDK v1.3.7 (June 2018), we have introduced a way to quantify the level of confidence that a VMAF prediction entails. Each VMAF prediction score now can come with a 95% confidence interval (CI), which quantifies the level of confidence that the prediction lies within the interval. Refer to the VMAF confidence interval page for more details.

Matlab Functionality

Besides the Python/C/C++ part of the repository, we also introduced a number of algorithms that are implemented in Matlab. For example, users can calculate ST-RRED, ST-MAD, SpEED-QA, and BRISQUE. For more details, see the Matlab Usage page for more details.

References

Refer to the references page.

About

Perceptual video quality assessment based on multi-method fusion.

License:Other


Languages

Language:Python 57.1%Language:C 27.8%Language:MATLAB 10.4%Language:C++ 4.1%Language:Meson 0.5%Language:Dockerfile 0.0%Language:Makefile 0.0%Language:Shell 0.0%