Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:34996Issues:501Issues:461

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12265Issues:166Issues:497

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonLicense:Apache-2.0Stargazers:7737Issues:108Issues:354

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookLicense:MITStargazers:3366Issues:66Issues:89
Language:PythonLicense:NOASSERTIONStargazers:3168Issues:159Issues:111

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:1939Issues:48Issues:81

ai-audio-startups

Community list of startups working with AI in audio and music technology

Language:PythonLicense:MITStargazers:1198Issues:59Issues:47

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:1115Issues:73Issues:194

LipGAN

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Language:PythonLicense:MITStargazers:578Issues:26Issues:42

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:371Issues:33Issues:70

wikipron

Massively multilingual pronunciation mining

Language:PythonLicense:Apache-2.0Stargazers:296Issues:17Issues:157

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonLicense:MITStargazers:185Issues:7Issues:3

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:PythonLicense:NOASSERTIONStargazers:169Issues:10Issues:14

AdaIN-VC

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonLicense:MITStargazers:104Issues:8Issues:11

facestar

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Language:PythonLicense:NOASSERTIONStargazers:98Issues:10Issues:1
Language:PythonLicense:MITStargazers:54Issues:11Issues:2

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

Language:PythonLicense:MITStargazers:44Issues:6Issues:0

AVSU-VIPL

Collection of works from VIPL-AVSU

VITSinger

Singing Voice Speech modeling test

Language:PythonLicense:MITStargazers:35Issues:4Issues:2

jphones

A Python3 program for converting Japanese words and numbers into phonemes.

Language:PythonLicense:MITStargazers:16Issues:4Issues:0

GreenScreenMatting

This is an implementation of Green Screen Matting.

Language:C++Stargazers:14Issues:2Issues:0

796_S22_v1

A temporary repository for 796 v1 submissions

Language:Jupyter NotebookStargazers:7Issues:3Issues:0