Alexander Korolev's repositories
Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
BigVGAN
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
CharsiuG2P
Multilingual G2P in 100 languages
deep-learning-german-tts
Thorsten - Open German Voice Dataset
DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
DeepSpeech
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
deepspeech-german
Automatic Speech Recognition (ASR) - German
GeneExpImgTL
TL with CNN for cancer survival prediction using gene-expression data
haystack
:mag: Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
speechbrain
A PyTorch-based Speech Toolkit
thesis-template
A LaTeX template for Bachelor and Master thesis
univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
vits_extended
TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)