Mateo Cedillo's repositories
My-Colab-Notebooks
Our colab notebooks about AI's or Python in the cloud.
Piper-Training-Guide-with-Screen-Reader
A guide to help newcomers to the Piper TTS system create voices for NVDA and other screen readers down the line.
FakeYou-Tacotron2-Notebook
Tacotron2 Training Notebook for FakeYou.com
tts-dataset-guidelines
Guide and material for the build of a good voice corpus for the purpose of use in a screen reader.
Universal-calculator
An accessible advanced calculator where you can do much math formulas and operations.
AHK-scripts-for-accessibility
Screen Reader Accessibility Scripts and Utilities, Now in Auto HotKey
DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
fasttext-langdetect
80x faster and 95% accurate language identification with Fasttext
ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
gdown
Download a large file from Google Drive (curl/wget fails because of the security notice).
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
kabooks
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using audiobooks, KABooks will generate dataset with segmented audios and aligned texts.
kathleen
US voice for RHVoice
Matcha-TTS
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
packages
RHVoice data package directory
piper-nvda
This add-on implements a speech synthesizer driver for NVDA using [Piper](https://github.com/rhasspy/piper).
piper-phonemize
C++ library for converting text to phonemes for Piper
RHVoice
a free and open source speech synthesizer for Russian and other languages
toneMaster
Plays monophonic tone sequences by using NVDA beeps and tone data files.
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
vits-2-finetune
Fine-tune VITS-2 easier.
vits2_pytorch
unofficial vits2-TTS implementation in pytorch