Official Implementation of "Seeing is Believing: A Novel Approach to Voice Synthesis from Facial Images Using Zero-shot TTS"
Official Implementation of "Seeing is Believing: A Novel Approach to Voice Synthesis from Facial Images Using Zero-shot TTS"