Pawel Cyrta's repositories
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
UrbanSounds
SONYC project - UrbanSounds - dataset of sound recordings from New York, trying to recreate papers that classify the sound sources in the stream
ICPC2015-dataset
ICPC2015 - Dataset of International Chopin Piano Competition 2015
broadcast-news-videos-dataset
Collection of broadcast news video clips
50languages
Corpus, dataset of speech recording in 50 languages
docker-kaldi
Kaldi ASR (speech-to-text engine) Docker Images
GenChanSim
Generic Channel Simulator for VHF/UHF (WBHF) voice channel - in air radio voice distortion generator
dictaphone
Free phonetic dictionaries for automatic speech recognition
nlp_workshops
Let's dive into text analysis
pocketsphinx-android-build
Environment for automatic build of PocketSphinx Android app (speech recognition)
asteroid
The PyTorch-based audio source separation toolkit for researchers
awesome-deep-learning-papers-reading-notes
Notes and reading list on machine learning and deep learning research publications
awesome-linuxaudio
A list of software and resources for professional audio/video/live events production on Linux.
cheat-scripts
because you cant remember everything
dockerfiles
Compilation of Dockerfiles with automated builds enabled on the Docker Registry
kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
textnormalizer
text normalization and cleaning - cli and python package [WIP]
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
wavenet_vocoder
WaveNet vocoder