Wangzhen-kris

followers

following

stars

Wangzhen's starred repositories

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonMIT32700

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonMIT18700

STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Language:PythonMIT15600

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonMIT18700

EA-SVC

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Language:PythonMIT12300

FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Language:PythonMIT24400

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonMIT63200

vae_tacotron

Language:PythonMIT5100

BVAE-TTS

Official implementation of BVAE-TTS

Language:PythonMIT16900

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonMIT82100

style-token_tacotron2

style token with tacotron2

Language:PythonMIT6100

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonMIT172600

speech-synthesis-paper

List of speech synthesis papers.

MIT97900

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Language:PythonBSD-3-Clause63200

rzsz

lrzsz上传下载mac配置及两个必要的.sh文件 iterm2-recv-zmodem.sh 和 iterm2-send-zmodem.sh

Language:Shell25600

image-captioning-bottom-up-top-down

PyTorch implementation of Image captioning with Bottom-up, Top-down Attention

Language:Python16100

SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Language:PythonMIT52500

treelstm.pytorch

Tree LSTM implementation in PyTorch

Language:PythonMIT55000

gcn

Implementation of Graph Convolutional Networks in TensorFlow

Language:PythonMIT705700

show-control-and-tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Language:PythonBSD-3-Clause28200

VCTree-Visual-Question-Answering

Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"

Language:PythonMIT3500

bottom-up-attention-tf

Unofficial tensorflow implementation of "Bottom-up and Top-down attention for VQA" (TF v. 1.13)

Language:PythonMIT3900

gae

Implementation of Graph Auto-Encoders in TensorFlow

Language:PythonMIT163200

visual_genome_python_driver

A python wrapper for the Visual Genome API

Language:Jupyter NotebookMIT35200