wintdkyo's repositories
ControllableTalkNet
A web app that lets you play around with TalkNet models
CycleGAN-VC2
Voice Conversion by CycleGAN
DeepFaceLab
DeepFaceLab is a tool that utilizes machine learning to replace faces in videos. Includes prebuilt ready to work standalone Windows 7,8,10 binary (look readme.md).
faceit_live
Swap your face in realtime to someone's else.
faceit_live3
This is an update to faceit_live using first order model
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on
NeMo
NeMo: a toolkit for conversational AI
Neural-Photo-Editor
A simple interface for editing natural photos with generative neural networks.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Talking-Face_PC-AVS
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.