p0p's repositories

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:PythonLicense:MITStargazers:513Issues:23Issues:59

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonLicense:MITStargazers:222Issues:14Issues:43

Matcha-TTS-2

E2E TTS using Conditional Flow Matching (Experimental*)

Language:Jupyter NotebookLicense:MITStargazers:69Issues:10Issues:4

CoquiTTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

g2pK

g2pK: g2p module for Korean

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

humble-gumbel

Jupyter notebook on Gumbel-max and Gumbel-softmax tricks

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

kss

Kss: A Toolkit for Korean sentence segmentation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

label-studio-converter

Tools for converting Label Studio annotations into common dataset formats

Language:PythonStargazers:0Issues:1Issues:0

MagneticData

MagWi + mobile dataset

Stargazers:0Issues:2Issues:0

ModifiedOpenLabelling

A modified version of https://github.com/Cartucho/OpenLabeling OpenLabelling tool

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:SCSSLicense:MITStargazers:0Issues:2Issues:0

paraspeechcaps

Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

python-inquirer

A collection of common interactive command line user interfaces, based on Inquirer.js (https://github.com/SBoudrias/Inquirer.js/)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

transformer-walkthrough

A walkthrough of transformer architecture code

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

UJIdata

Data for UJI

Stargazers:0Issues:2Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:0Issues:1Issues:0