shaun95

zyser's repositories

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT200

FlowVAE_E2E_TTS

Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion

Language:Jupyter Notebook100

lovely-tensors

Tensors, ready for human consumption

Language:Jupyter NotebookMIT100

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonMIT100

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonApache-2.01 10

speech-synthesis-paper

List of speech synthesis papers.

MIT100

AdaSpeech-Adaptive-Text-to-Speech-for-Custom-Voice

Language:Python000

aligner-pytorch

Sequence alignement methods with helpers for PyTorch.

Language:PythonMIT000

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION000

dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps"

Language:PythonMIT000

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonMIT000

HarmoF0

Language:PythonMIT000

HierTTS

Language:Python000

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT000

meso-dtfa

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)

Language:PythonMIT000

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonMIT000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT000

NaturalSpeech2_NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

000

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT000

openvino

OpenVINO™ Toolkit repository

Language:C++Apache-2.0000

penn_Pitch-Estimating-Neural-Networks-

Pitch Estimating Neural Networks (PENN)

Language:PythonMIT000

PSST

Prosodic Speech Segmentation with Transformers

Language:Jupyter NotebookMIT000

pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Language:PythonNOASSERTION000

PyTorch-Wavelet-Toolbox

Differentiable fast wavelet transforms in PyTorch with GPU support.

Language:PythonEUPL-1.2000

SC-CNN

An Effective Style Conditioning Method for Zero-Shot Text-to-Speech System

Language:PythonMIT000

score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations

Language:Jupyter NotebookApache-2.0000

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT000

torchview

torchview: visualize pytorch models

Language:PythonMIT000

univnet-1

Unofficial PyTorch Implementation of UnivNet Vocoder

Language:PythonBSD-3-Clause000

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Language:Jupyter NotebookNOASSERTION010