zyser's repositories

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

FlowVAE_E2E_TTS

Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

lovely-tensors

Tensors, ready for human consumption

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

aligner-pytorch

Sequence alignement methods with helpers for PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

meso-dtfa

Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NaturalSpeech2_NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Stargazers:0Issues:0Issues:0

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

openvino

OpenVINO™ Toolkit repository

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

penn_Pitch-Estimating-Neural-Networks-

Pitch Estimating Neural Networks (PENN)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PSST

Prosodic Speech Segmentation with Transformers

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

PyTorch-Wavelet-Toolbox

Differentiable fast wavelet transforms in PyTorch with GPU support.

Language:PythonLicense:EUPL-1.2Stargazers:0Issues:0Issues:0

SC-CNN

An Effective Style Conditioning Method for Zero-Shot Text-to-Speech System

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchview

torchview: visualize pytorch models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

univnet-1

Unofficial PyTorch Implementation of UnivNet Vocoder

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0