sarumaj / persona

Talking head video AI generator

Persona

Tool for creating talking head videos using generative AI. Uses the following projects/libraries:

Tortoise-TTS - generating speech
SDXL-Turbo - generating the digital avatar image
One-Shot Free-View Neural Talking Head Synthesis - animating the face
Wav2Lip - using speech to generate lip movement and superimposing it on the animated face
Real ESRGAN - improving the image and scaling it

Setup

$ git clone https://github.com/sarumaj/persona.git
$ cd persona
$ pip install -r requirements.txt

Download Way2Lib weights and put them in the wav2lip directory

Download Face detection weights and place them in the directory wave2lip/face_detection/detection/sfd/s3fd.pth

Create a folder a folder weights and place the Real-ESRGAN weight files in there

Run

$ python persona.py

About

Talking head video AI generator

Languages

Language:Python 98.1%Language:Cuda 1.6%Language:C++ 0.2%