Marko Stamenovic's starred repositories
SPSI_Python
Single Pass Spectrogram Inversion in a Jupyter Python notebook
gsoc-wav2vec2
GSoC'2021 | TensorFlow implementation of Wav2Vec2
audio_dspy
A Python package for audio signal processing tools
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
NBAcomebacks
Analysis of the largest comebacks in the NBA
fourier-feature-superresolution
Fourier Features for Image, Audio, and Video Super-Resolution
keras-surgeon
Pruning and other network surgery for trained Keras models.
pitch-detection
autocorrelation-based O(NlogN) pitch detection
sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
meme-vibing-cat
Vibing Cat meme generator
audio-degradation-toolbox
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
Speech-Enhancement-Measures
speech enhancement metrics:CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
loudness.py
EBU R128 / ITU-R BS.1770 integrated loudness measurement in Python
P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
homebrew-python
Homebrew tap for Python versions.
wav_logger
Real-Time Audio Logging in C++
chime4-nn-mask
Implementation of NN based mask estimator in pytorch
deep_complex_networks
Implementation related to the Deep Complex Networks
interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
CNN-for-single-channel-speech-enhancement
Convolutional neural nets for single channel speech enhancement