sunxh16's repositories

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:0Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

async_cosyvoice

使用vllm加速cosyvoice2的推理

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ClariNet

A Pytorch Implementation of ClariNet

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Concatenate_wav

Concatenate wavs(for unit selection)

Language:C++Stargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NeuralVoicePuppetry

This github contains the network architectures of NeuralVoicePuppetry.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

NNPACK

Acceleration package for neural networks on multi-core CPUs

Language:CLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

onnxruntime

ONNX Runtime

Language:C++License:MITStargazers:0Issues:2Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

rigl

End-to-end training of sparse deep neural networks with little-to-no performance loss.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

License:GPL-3.0Stargazers:0Issues:0Issues:0

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

sp2si-code

Contains code for our work on speech to singing conversion (ICASSP 2020)

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

tacotron2_v1

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wav2letter

Facebook AI Research Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

waveglow

A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

World

A high-quality speech analysis, manipulation and synthesis system

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0