ahmet can's repositories
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
GanyuTTS
A small VITS+SOVITS/RVC TTS API
GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
Grad-SVC
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
hummingbot
Hummingbot is open source software that helps you build trading bots that run on any exchange or blockchain
Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
SC_VALL-E
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
so-vits-svc-4.0-v2
SoftVC VITS Singing Voice Conversion
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
voicefixer
General Speech Restoration