Beast code in Giters

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonApache-2.0771300

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Language:Python52500

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:JavaScript17000

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.0146100

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01205300

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookMIT334100

LipGAN

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Language:PythonMIT57800

STIT

Language:PythonMIT119500

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION190500

wikipron

Massively multilingual pronunciation mining

Language:PythonApache-2.029300

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

Language:PythonMIT4400

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT35800

AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonMIT10300

796_S22_v1

A temporary repository for 796 v1 submissions

Language:Jupyter Notebook700

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause111400