Pavel Denisov's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
mkchromecast
Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
chinese_speech_pretrain
chinese speech pretrained models
PyTorch-Model-Compare
Compare neural networks by their feature similarity
SpeechTransProgress
Tracking the progress in end-to-end speech translation
libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
bk_clustering
Burj Khalifa Clustering method
speaker-anonymization
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
mt-bigscience
Evaluation results for Machine Translation within the BigScience project
zamia-speech
Open tools and data for cloudless automatic speech recognition