ahmet can's repositories

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:C#Stargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:VueStargazers:0Issues:0Issues:0

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

License:NOASSERTIONStargazers:0Issues:0Issues:0

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:DartStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

License:MITStargazers:0Issues:0Issues:0
Language:DartStargazers:0Issues:0Issues:0

GanyuTTS

A small VITS+SOVITS/RVC TTS API

Stargazers:0Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

License:MITStargazers:0Issues:0Issues:0

Grad-SVC

Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei

License:MITStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

hummingbot

Hummingbot is open source software that helps you build trading bots that run on any exchange or blockchain

License:Apache-2.0Stargazers:0Issues:0Issues:0

Music-Demixing-with-Band-Split-RNN

An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

License:MITStargazers:0Issues:0Issues:0

SC_VALL-E

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

License:MITStargazers:0Issues:0Issues:0

so-vits-svc-4.0-v2

SoftVC VITS Singing Voice Conversion

License:MITStargazers:0Issues:0Issues:0

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

License:MITStargazers:0Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

License:Apache-2.0Stargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

License:MITStargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

License:MITStargazers:0Issues:0Issues:0

voicefixer

General Speech Restoration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0