zyser's repositories
FlowVAE_E2E_TTS
Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion
lovely-tensors
Tensors, ready for human consumption
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
speech-synthesis-paper
List of speech synthesis papers.
aligner-pytorch
Sequence alignement methods with helpers for PyTorch.
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps"
FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
meso-dtfa
Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
NaturalSpeech2_NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
nnsvs
Neural network-based singing voice synthesis library for research
openvino
OpenVINO™ Toolkit repository
penn_Pitch-Estimating-Neural-Networks-
Pitch Estimating Neural Networks (PENN)
PSST
Prosodic Speech Segmentation with Transformers
pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
PyTorch-Wavelet-Toolbox
Differentiable fast wavelet transforms in PyTorch with GPU support.
SC-CNN
An Effective Style Conditioning Method for Zero-Shot Text-to-Speech System
score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations
spleeter
Deezer source separation library including pretrained models.
torchview
torchview: visualize pytorch models
univnet-1
Unofficial PyTorch Implementation of UnivNet Vocoder