MuruganR96

Mraj96's repositories

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonAGPL-3.0000

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:Python000

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonMIT000

Diffusion-SVC

Language:PythonMIT000

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Language:PythonMIT000

F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Language:PythonMIT000

F5-TTS-with-ForcedAlignment-DurationPredictor

Based on Official code of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching". This work uses phoneme-level forced alignment to stabilize the generation process.

MIT000

fish-diffusion

An easy to understand TTS / SVS / SVC framework

Language:PythonMIT000

free-svc

[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

MIT000

generative-ai-for-beginners

12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT000

glow-svc

singing voice conversion based on glow-tts

Language:PythonMIT000

Grad-SVC

Singing Voice Conversion based on Grad-TTS. The core algorithm is diffusion.

Language:PythonMIT000

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT000

hifigan-yingram-vc

Language:Jupyter NotebookMIT000

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonMIT000

moshi

Apache-2.0000

MyArxiv

Language:CSSGPL-2.0000

PitchVC

PitchVC: Pitch Conditioned Any-to-Many Voice Conversion

Language:PythonMIT000

pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

MIT000

PPG-GradVC

A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis

000

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Language:PythonMIT000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonMIT000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0000

so-vits-svc-2

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.0000

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonNOASSERTION000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonMIT000

voice-changer

NOASSERTION000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION000

YoutubeScrapping

Apache-2.0000