JustKowalski's repositories

Language:PythonLicense:Apache-2.0Stargazers:7Issues:2Issues:0

ConferencesStastics

The stastics information of top conference realted to information area including AI, ML, CV, etc

Stargazers:1Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:0Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Sound_event_detection

This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

speech_recognition

end2end asr system with ctc + dynamic cnn transformer, well organized using custom template

Language:PythonStargazers:0Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:0Issues:0Issues:0

wespeaker

Research and Production Oriented Speaker Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0