zachx121's repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
roop
one-click face swap
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Bert-VITS2
vits2 backbone with bert
SimSwap
An arbitrary face-swapping framework on images and videos with one single trained model!
so-vits-svc-realtime
so-vits-svc fork with realtime support, improved interface and more features.
stable-diffusion-webui
Stable Diffusion web UI
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
DeepFaceLive
Real-time face swap for PC streaming or video calls
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
ObjDetection
object detection through yolov5.