suldier

followers

following

stars

Suldier's repositories

GCOT

Graph Convolutional Optimal Transport for Hyperspectral Image Spectral Clustering

Language:Python9 1 1

POP909-Dataset

This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation

Language:PythonMIT1 10

ASR_Syllable

采用音节建模构建语音识别声学模型

Language:PythonGPL-3.0010

ASR_WORD

采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。

Language:PythonAGPL-3.0010

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonGPL-3.0010

athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language:PythonApache-2.0010

dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Language:PythonApache-2.0010

DeepSpeech

A TensorFlow implementation of Baidu's DeepSpeech architecture

Language:C++MPL-2.0010

DualSC

Language:Python000

expressive_tacotron

Tensorflow Implementation of Expressive Tacotron

Language:Python010

faust

Functional programming language for signal processing and sound synthesis

Language:JavaScriptNOASSERTION010

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonMIT010

gantts

PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)

Language:Jupyter NotebookNOASSERTION010

intro2musictech

公众号“无痛入门音乐科技”开源代码

Language:Jupyter Notebook010

Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Language:Python010

LPCTron

Tacotron2 + LPCNET for complete End-to-End TTS System

Language:C010

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonMIT010

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonMIT010

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT000

POT

POT : Python Optimal Transport

Language:PythonMIT000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION010

Sinsy-Remix

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

Language:C++NOASSERTION010

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonBSD-3-Clause000

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:Python010

suldier.github.io

Language:HTML010

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonMIT010

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonMIT010

Tacotron2-1

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Language:Python010

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)

Language:PythonApache-2.0010

wavenet_vocoder

WaveNet vocoder

Language:PythonNOASSERTION010