aflr's repositories
apiaudio-python
api.audio Python SDK
viseme-to-video
Creates video from TTS output and viseme images.
apiaudio-npm
api.audio Javascript SDK
birdcache_examples
example repo for birdcache showcase
NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
pythonista-chromeless
Serverless selenium which dynamically execute any given code.
aflr_client
Aflorithmic Client
coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepLearningExamples
Deep Learning Examples
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
DurIAN
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
kubeflow-manifests
KubeFlow on AWS
mozilla_TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
NeMo
NeMo: a toolkit for conversational AI
news_article_summarizer
An example use case built to summarize articles from websites and then produce an mp3 file.
phonemapper
Maps transcriptions between phone sets.
SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck