Wangzhen's starred repositories

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonLicense:MITStargazers:327Issues:0Issues:0

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonLicense:MITStargazers:187Issues:0Issues:0

STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Language:PythonLicense:MITStargazers:156Issues:0Issues:0

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonLicense:MITStargazers:187Issues:0Issues:0

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Language:PythonLicense:MITStargazers:123Issues:0Issues:0

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。

Language:PythonLicense:MITStargazers:244Issues:0Issues:0

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonLicense:MITStargazers:632Issues:0Issues:0
Language:PythonLicense:MITStargazers:51Issues:0Issues:0

BVAE-TTS

Official implementation of BVAE-TTS

Language:PythonLicense:MITStargazers:169Issues:0Issues:0

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonLicense:MITStargazers:821Issues:0Issues:0

style-token_tacotron2

style token with tacotron2

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1726Issues:0Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:979Issues:0Issues:0

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Language:PythonLicense:BSD-3-ClauseStargazers:632Issues:0Issues:0

rzsz

lrzsz上传下载mac配置 及两个必要的.sh文件 iterm2-recv-zmodem.sh 和 iterm2-send-zmodem.sh

Language:ShellStargazers:256Issues:0Issues:0

image-captioning-bottom-up-top-down

PyTorch implementation of Image captioning with Bottom-up, Top-down Attention

Language:PythonStargazers:161Issues:0Issues:0

SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Language:PythonLicense:MITStargazers:525Issues:0Issues:0

treelstm.pytorch

Tree LSTM implementation in PyTorch

Language:PythonLicense:MITStargazers:550Issues:0Issues:0

gcn

Implementation of Graph Convolutional Networks in TensorFlow

Language:PythonLicense:MITStargazers:7057Issues:0Issues:0

show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Language:PythonLicense:BSD-3-ClauseStargazers:282Issues:0Issues:0

VCTree-Visual-Question-Answering

Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

bottom-up-attention-tf

Unofficial tensorflow implementation of "Bottom-up and Top-down attention for VQA" (TF v. 1.13)

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

gae

Implementation of Graph Auto-Encoders in TensorFlow

Language:PythonLicense:MITStargazers:1632Issues:0Issues:0

visual_genome_python_driver

A python wrapper for the Visual Genome API

Language:Jupyter NotebookLicense:MITStargazers:352Issues:0Issues:0