p0p's repositories
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
Matcha-TTS
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
humble-gumbel
Jupyter notebook on Gumbel-max and Gumbel-softmax tricks
label-studio-converter
Tools for converting Label Studio annotations into common dataset formats
MagneticData
MagWi + mobile dataset
ModifiedOpenLabelling
A modified version of https://github.com/Cartucho/OpenLabeling OpenLabelling tool
paraspeechcaps
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
python-inquirer
A collection of common interactive command line user interfaces, based on Inquirer.js (https://github.com/SBoudrias/Inquirer.js/)
speechbrain
A PyTorch-based Speech Toolkit
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
transformer-walkthrough
A walkthrough of transformer architecture code
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers