Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

facestar

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Language:PythonLicense:NOASSERTIONStargazers:98Issues:0Issues:0
Language:PythonStargazers:96Issues:0Issues:0

AdaIN-VC

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Language:PythonStargazers:112Issues:0Issues:0

VITSinger

Singing Voice Speech modeling test

Language:PythonLicense:MITStargazers:35Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:167Issues:0Issues:0
Language:PythonLicense:MITStargazers:54Issues:0Issues:0
Language:PythonLicense:MITStargazers:490Issues:0Issues:0

jphones

A Python3 program for converting Japanese words and numbers into phonemes.

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

AVSU-VIPL

Collection of works from VIPL-AVSU

Stargazers:38Issues:0Issues:0

GreenScreenMatting

This is an implementation of Green Screen Matting.

Language:C++Stargazers:14Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:34789Issues:0Issues:0

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonLicense:MITStargazers:182Issues:0Issues:0
Language:PythonStargazers:493Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:3144Issues:0Issues:0
License:Apache-2.0Stargazers:1524Issues:0Issues:0

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonLicense:Apache-2.0Stargazers:7713Issues:0Issues:0

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Language:PythonStargazers:525Issues:0Issues:0

Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Language:JavaScriptStargazers:170Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:1461Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12053Issues:0Issues:0

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookLicense:MITStargazers:3341Issues:0Issues:0

LipGAN

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Language:PythonLicense:MITStargazers:578Issues:0Issues:0
Language:PythonLicense:MITStargazers:1195Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CLicense:NOASSERTIONStargazers:1905Issues:0Issues:0

wikipron

Massively multilingual pronunciation mining

Language:PythonLicense:Apache-2.0Stargazers:293Issues:0Issues:0

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:358Issues:0Issues:0

AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonLicense:MITStargazers:103Issues:0Issues:0

796_S22_v1

A temporary repository for 796 v1 submissions

Language:Jupyter NotebookStargazers:7Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:1114Issues:0Issues:0