markostam

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookMIT30000

meme-vibing-cat

Vibing Cat meme generator

Language:Shell6800

audio-degradation-toolbox

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Language:PythonGPL-2.04300

IRAPT

Instantaneous pitch estimation based on RAPT framework (EUSIPCO-2012)

Language:MATLABGPL-3.0700

Speech-Enhancement-Measures

speech enhancement metrics：CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS

Language:MATLAB5400

loudness.py

EBU R128 / ITU-R BS.1770 integrated loudness measurement in Python

Language:PythonMIT3800

P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

Language:HTMLMIT19200

markostam

Marko Stamenovic's starred repositories

phaseret

SPSI_Python

gsoc-wav2vec2

fairseq

audio_dspy

allosaurus

NBAcomebacks

fourier-feature-superresolution

autodsp

keras-surgeon

DTLN

pitch-detection

sudo_rm_rf

meme-vibing-cat

audio-degradation-toolbox

IRAPT

Speech-Enhancement-Measures

loudness.py

P.808

homebrew-python

wav_logger

chime4-nn-mask

deep_complex_networks

interspeech2019-tutorial

speechmetrics

pyroomacoustics

World

Conv-TasNet

CNN-for-single-channel-speech-enhancement

PeachPy