Marko Stamenovic's starred repositories

phaseret

Phase ReTrieval for time-frequency representations

Language:CLicense:GPL-3.0Stargazers:48Issues:0Issues:0

SPSI_Python

Single Pass Spectrogram Inversion in a Jupyter Python notebook

Language:Jupyter NotebookStargazers:33Issues:0Issues:0

gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:89Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29426Issues:0Issues:0

audio_dspy

A Python package for audio signal processing tools

Language:PythonLicense:MITStargazers:67Issues:0Issues:0

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Language:PythonLicense:GPL-3.0Stargazers:518Issues:0Issues:0

NBAcomebacks

Analysis of the largest comebacks in the NBA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

fourier-feature-superresolution

Fourier Features for Image, Audio, and Video Super-Resolution

Language:Jupyter NotebookLicense:MITStargazers:17Issues:0Issues:0

autodsp

Train custom adaptive filter optimizers without hand tuning or extra labels.

Language:PythonLicense:NOASSERTIONStargazers:60Issues:0Issues:0

keras-surgeon

Pruning and other network surgery for trained Keras models.

Language:PythonLicense:NOASSERTIONStargazers:406Issues:0Issues:0

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonLicense:MITStargazers:549Issues:0Issues:0

pitch-detection

autocorrelation-based O(NlogN) pitch detection

Language:C++License:MITStargazers:559Issues:0Issues:0

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookLicense:MITStargazers:300Issues:0Issues:0

meme-vibing-cat

Vibing Cat meme generator

Language:ShellStargazers:68Issues:0Issues:0

audio-degradation-toolbox

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Language:PythonLicense:GPL-2.0Stargazers:43Issues:0Issues:0

IRAPT

Instantaneous pitch estimation based on RAPT framework (EUSIPCO-2012)

Language:MATLABLicense:GPL-3.0Stargazers:7Issues:0Issues:0

Speech-Enhancement-Measures

speech enhancement metrics:CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS

Language:MATLABStargazers:54Issues:0Issues:0

loudness.py

EBU R128 / ITU-R BS.1770 integrated loudness measurement in Python

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).

Language:HTMLLicense:MITStargazers:192Issues:0Issues:0

homebrew-python

Homebrew tap for Python versions.

Language:RubyStargazers:159Issues:0Issues:0

wav_logger

Real-Time Audio Logging in C++

Language:C++License:MITStargazers:6Issues:0Issues:0

chime4-nn-mask

Implementation of NN based mask estimator in pytorch

Language:PythonStargazers:30Issues:0Issues:0

deep_complex_networks

Implementation related to the Deep Complex Networks

Language:PythonLicense:MITStargazers:708Issues:0Issues:0

interspeech2019-tutorial

INTERSPEECH 2019 Tutorial Materials

Language:Jupyter NotebookStargazers:192Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:846Issues:0Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:1341Issues:0Issues:0

World

A high-quality speech analysis, manipulation and synthesis system

Language:C++License:NOASSERTIONStargazers:1132Issues:0Issues:0

Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language:PythonLicense:MITStargazers:641Issues:0Issues:0

CNN-for-single-channel-speech-enhancement

Convolutional neural nets for single channel speech enhancement

Language:PythonStargazers:140Issues:0Issues:0

PeachPy

x86-64 assembler embedded in Python

Language:PythonLicense:NOASSERTIONStargazers:1952Issues:0Issues:0