Suldier's repositories

GCOT

Graph Convolutional Optimal Transport for Hyperspectral Image Spectral Clustering

POP909-Dataset

This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

ASR_Syllable

采用音节建模构建语音识别声学模型

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

ASR_WORD

采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DeepSpeech

A TensorFlow implementation of Baidu's DeepSpeech architecture

Language:C++License:MPL-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

expressive_tacotron

Tensorflow Implementation of Expressive Tacotron

Language:PythonStargazers:0Issues:1Issues:0

faust

Functional programming language for signal processing and sound synthesis

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:1Issues:0

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

intro2musictech

公众号“无痛入门音乐科技”开源代码

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Language:PythonStargazers:0Issues:1Issues:0

LPCTron

Tacotron2 + LPCNET for complete End-to-End TTS System

Language:CStargazers:0Issues:1Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

POT

POT : Python Optimal Transport

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Sinsy-Remix

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:PythonStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Tacotron2-1

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Language:PythonStargazers:0Issues:1Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

wavenet_vocoder

WaveNet vocoder

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0