Tool for creating talking head videos using generative AI. Uses the following projects/libraries:
- Tortoise-TTS - generating speech
- SDXL-Turbo - generating the digital avatar image
- One-Shot Free-View Neural Talking Head Synthesis - animating the face
- Wav2Lip - using speech to generate lip movement and superimposing it on the animated face
- Real ESRGAN - improving the image and scaling it
$ git clone https://github.com/sarumaj/persona.git
$ cd persona
$ pip install -r requirements.txt
- Download Way2Lib weights and put them in the
wav2lip
directory
- Download Face detection weights and place them in the directory
wave2lip/face_detection/detection/sfd/s3fd.pth
- Create a folder a folder
weights
and place the Real-ESRGAN weight files in there
$ python persona.py