JustKowalski's repositories
ConferencesStastics
The stastics information of top conference realted to information area including AI, ML, CV, etc
asteroid
The PyTorch-based audio source separation toolkit for researchers
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
DeepFilterNet
Noise supression using deep filtering
MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Sound_event_detection
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.
speech_recognition
end2end asr system with ctc + dynamic cnn transformer, well organized using custom template
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
wespeaker
Research and Production Oriented Speaker Recognition Toolkit