Kevin Wang's repositories

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2575Issues:30Issues:95
Language:PythonLicense:MITStargazers:34Issues:0Issues:0
Language:Jupyter NotebookStargazers:9Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

Retrieval-based-Voice-Conversion-New

Voice data <= 10 mins can also be used to train a good VC model!

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3Issues:1Issues:0
Language:PythonStargazers:2Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:0Issues:0

AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

DDPM-IP

repo for our ICML 2023 paper "Input Perturbation Reduces Exposure Bias in Diffusion Models"

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Stargazers:1Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLStargazers:1Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

License:NOASSERTIONStargazers:1Issues:0Issues:0

Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

HairFastGAN

Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"

License:MITStargazers:0Issues:0Issues:0

KevinWang676

My profile

Stargazers:0Issues:2Issues:0

KevinWang676.github.io

Personal website

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

License:MITStargazers:0Issues:0Issues:0

test-repo

RVC Inference with multiple model and huggingface support

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ttts

Train the next generation of TTS systems.

License:Apache-2.0Stargazers:0Issues:0Issues:0