cc-cherie's repositories
awesome-matlab
A curated list of awesome Matlab frameworks, libraries and software.
cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
ConferencingSpeech2021
Conferencing Speech Challenge
deep_learning_based_speech_enhancement_keras_python
deep learning based speech enhancement using keras python, make it easy to use
Instrument_Classification_Paper
Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music, NTUA
kaldi-enhan
Tools for speech enhancement based on kaldi
MaiGenre
Implemention of Classification Algorithms for Music Genre Features
Microphone_Array_Beamforming
A simple demo display of a sound source localization by traditional beamforming algorithm.
mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
neural-networks-and-deep-learning
Code samples for my book "Neural Networks and Deep Learning"
pocketsphinx
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
rnnoise_orignal
Recurrent neural network for audio noise reduction
SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
SourceLocalization
SPR-PHAT source localization
speech-denoising-wavenet
A neural network for end-to-end speech denoising
speechbrain
A PyTorch-based Speech Toolkit
Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
A two-stage polyphonic sound event detection and localization method. It can achieve the best scores for both SED and DOA.
voxceleb_trainer
In defence of metric learning for speaker recognition
WebRTC_AGC
Automatic Gain Control Module Port From WebRTC