Beast code in Giters

Yunlin Chen's starred repositories

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonNOASSERTION34996 501 461

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.012265 166 497

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonApache-2.07737 108 354

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookMIT3366 66 89

eg3d

Language:PythonNOASSERTION3168 159 111

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION1939 48 81

parti

Apache-2.01521 56 9

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.01485 65 5

STIT

Language:PythonMIT1198 59 47

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause1115 73 194

LipGAN

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Language:PythonMIT578 26 42

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Language:Python529 15 34

wav2lip_288x288

Language:PythonMIT514 18 147

video-preprocessing

Language:Python496 16 35

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT371 33 70

wikipron

Massively multilingual pronunciation mining

Language:PythonApache-2.0296 17 157

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonMIT185 7 3

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:JavaScript170 6 7

M4Singer

Language:PythonNOASSERTION169 10 14

AdaIN-VC

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Language:Python112 4 14

AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonMIT104 8 11

facestar

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Language:PythonNOASSERTION98 10 1

S2VC

Language:Python96 5 8

VISinger

Language:PythonMIT54 11 2

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

Language:PythonMIT44 60

AVSU-VIPL

Collection of works from VIPL-AVSU

39 6 3

VITSinger

Singing Voice Speech modeling test

Language:PythonMIT35 4 2

jphones

A Python3 program for converting Japanese words and numbers into phonemes.

Language:PythonMIT16 40

GreenScreenMatting

This is an implementation of Green Screen Matting.

Language:C++14 20

796_S22_v1

A temporary repository for 796 v1 submissions

Language:Jupyter Notebook7 30