Beast code in Giters

Yunlin Chen's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.034094 340 2666

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT25406 383 767

leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Language:Jupyter NotebookNOASSERTION11464 264 81

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookMPL-2.09098 186 560

few-shot-vid2vid

Pytorch implementation for few-shot photorealistic video-to-video translation.

Language:PythonNOASSERTION1792 117 88

speech-driven-animation

Language:Python946 56 70

loop

A method to generate speech across multiple speakers

Language:PythonNOASSERTION871 68 75

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)

Language:PythonNOASSERTION600 29 61

matting_human_datasets

人像matting数据集，包含34427张图像和对应的matting结果图。

NOASSERTION599 8 8

ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

Language:PythonMIT578 31 68

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonMIT491 42 20

Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Language:PythonMIT427 31 22

matlab-dockerfile

Create a docker container that contains a MATLAB install

Language:PythonNOASSERTION323 29 101

ClariNet

A Pytorch Implementation of ClariNet

Language:PythonMIT288 23 9

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

Language:PythonBSD-3-Clause265 17 11

Neural-Voice-Cloning-with-Few-Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Language:Python253 14 4

GAN-TTS

A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

Language:Python228 15 7

spectrogramJS

spectrogram visualization in the browser

Language:JavaScriptMIT143 12 1

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonMIT134 17 16

Audiovisual-Synthesis

Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders

Language:Python120 10 12

yang_vocoder

Language:MatlabApache-2.091 100

magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Language:PythonApache-2.078 20 13

videoprocess

CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.

Language:Python70 4 5

style-token_tacotron2

style token with tacotron2

Language:PythonMIT61 7 12

Talking_Face_Generation

Talking Face Generation by Conditional Recurrent Adversarial Network

Language:Python60 7 6

PiENet

Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals

Language:PythonApache-2.050 5 1

FilterBanks_FastPythonImplementation

Filter Banks, Fast Python Implementation

Language:Python41 3 2

snreval

Objective measures of speech quality SNR

Language:MATLAB16 2 3

ahoproc_tools

Tools for Ahocoder data processing and evaluation metrics

Language:PythonMIT14 3 1

speech2vid

Language:Python1200