Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34094Issues:340Issues:2666

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25406Issues:383Issues:767

leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11464Issues:264Issues:81

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:9098Issues:186Issues:560

few-shot-vid2vid

Pytorch implementation for few-shot photorealistic video-to-video translation.

Language:PythonLicense:NOASSERTIONStargazers:1792Issues:117Issues:88

loop

A method to generate speech across multiple speakers

Language:PythonLicense:NOASSERTIONStargazers:871Issues:68Issues:75

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Language:PythonLicense:NOASSERTIONStargazers:600Issues:29Issues:61

matting_human_datasets

人像matting数据集,包含34427张图像和对应的matting结果图。

ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

Language:PythonLicense:MITStargazers:578Issues:31Issues:68

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonLicense:MITStargazers:491Issues:42Issues:20

Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Language:PythonLicense:MITStargazers:427Issues:31Issues:22

matlab-dockerfile

Create a docker container that contains a MATLAB install

Language:PythonLicense:NOASSERTIONStargazers:323Issues:29Issues:101

ClariNet

A Pytorch Implementation of ClariNet

Language:PythonLicense:MITStargazers:288Issues:23Issues:9

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonLicense:BSD-3-ClauseStargazers:265Issues:17Issues:11

Neural-Voice-Cloning-with-Few-Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

GAN-TTS

A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

spectrogramJS

spectrogram visualization in the browser

Language:JavaScriptLicense:MITStargazers:143Issues:12Issues:1

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonLicense:MITStargazers:134Issues:17Issues:16

Audiovisual-Synthesis

Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders

Language:MatlabLicense:Apache-2.0Stargazers:91Issues:10Issues:0

magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Language:PythonLicense:Apache-2.0Stargazers:78Issues:20Issues:13

videoprocess

CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.

style-token_tacotron2

style token with tacotron2

Language:PythonLicense:MITStargazers:61Issues:7Issues:12

Talking_Face_Generation

Talking Face Generation by Conditional Recurrent Adversarial Network

PiENet

Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals

Language:PythonLicense:Apache-2.0Stargazers:50Issues:5Issues:1

FilterBanks_FastPythonImplementation

Filter Banks, Fast Python Implementation

snreval

Objective measures of speech quality SNR

ahoproc_tools

Tools for Ahocoder data processing and evaluation metrics

Language:PythonLicense:MITStargazers:14Issues:3Issues:1
Language:PythonStargazers:12Issues:0Issues:0