Beast code in Giters

shangzengqiang's repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:PythonBSD-3-Clause010

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonMIT010

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLApache-2.0010

ddsp-pytorch

Implementation of DDSP (PyTorch), Differentiable Digital Signal Processing (ICLR 2020)

Language:Python010

DeepLearningExamples

Deep Learning Examples

Language:Python010

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonApache-2.0010

End-to-end-ASR-Pytorch

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Language:PythonMIT010

espnet

End-to-End Speech Processing Toolkit

Language:ShellApache-2.0010

FastSpeech

The Implementation of FastSpeech based on pytorch.

Language:Python010

ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

Language:PythonMIT010

g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Language:PythonApache-2.0010

GAN-TTS

A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

Language:Python010

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonMIT010

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonMIT010

google-research

Google Research

Language:Jupyter NotebookApache-2.0010

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause010

LPCTron

Tacotron2 + LPCNET for complete End-to-End TTS System

Language:C010

melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Language:PythonMIT010

merlin

This is now the official location of the Merlin project.

Language:PythonApache-2.0010

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookMIT010

pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Language:PythonMIT010

shangqwe123.github.io

Language:HTML020

Speaker_Embedding_Torch

PyTorch based speaker embedding model

Language:PythonMIT010

tacotron2

Forked from https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/Tacotron2 and merged with https://github.com/Rayhane-mamah/Tacotron-2

Language:PythonBSD-3-Clause010

torchcrepe

Pytorch implementation of the CREPE pitch tracker

Language:PythonMIT010

UniversalVocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Language:PythonMIT010

VAEX

code f

Language:Jupyter NotebookGPL-3.0010

voice_conversion

Language:PythonMIT010

WGANSing

Multi-voice singing voice synthesis

Language:Python010