shangzengqiang's repositories

gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

Language:PythonStargazers:1Issues:1Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ddsp-pytorch

Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)

Language:PythonStargazers:0Issues:1Issues:0

DeepLearningExamples

Deep Learning Examples

Language:PythonStargazers:0Issues:1Issues:0

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:1Issues:0

FastSpeech

The Implementation of FastSpeech based on pytorch.

Stargazers:0Issues:0Issues:0

ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

License:MITStargazers:0Issues:0Issues:0

g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

GAN-TTS

A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

Language:PythonStargazers:0Issues:1Issues:0

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LPCNet

Efficient neural speech synthesis

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

LPCTron

Tacotron2 + LPCNET for complete End-to-End TTS System

Language:CStargazers:0Issues:1Issues:0

melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

merlin

This is now the official location of the Merlin project.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0

pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

License:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

Speaker_Embedding_Torch

PyTorch based speaker embedding model

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tacotron2

Forked from https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2 and merged with https://github.com/Rayhane-mamah/Tacotron-2

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

torchcrepe

Pytorch implementation of the CREPE pitch tracker

License:MITStargazers:0Issues:0Issues:0

UniversalVocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

VAEX

code f

License:GPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

WGANSing

Multi-voice singing voice synthesis

Language:PythonStargazers:0Issues:1Issues:0