anthony-wss / AudioDecBenchmark

Audio Codec Benchmark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Audio Codec Benchmark

Codec Collection:

Criteria

Waveform (Lower is better)

L1Loss in waveform

Mel Distance (Lower is better)

The Mel Distance is the distance between the log mel spectrograms of the reconstructed and ground truth waveforms.

STFT Distance (Lower is better)

This metric calculates the distance between the log magnitude spectrograms of the reconstructed and ground truth waveforms, using window lengths of [2048, 512], and is better at capturing fidelity in higher frequencies compared to the Mel Distance.

PESQ (Higher is better)

PESQ is an intrusive perceptual quality metric that assesses audio quality based on the mean opinion score of a human listener.

STOI (Higher is better)

STOI is an intrusive perceptual quality metric that assesses audio quality based on the intelligibility of the reconstructed speech.

About

Audio Codec Benchmark


Languages

Language:Python 91.6%Language:JavaScript 6.1%Language:HTML 1.6%Language:CSS 0.7%