aflr

aflorithmic

aflr's repositories

apiaudio-python

api.audio Python SDK

Language:PythonMIT25 2 3

viseme-to-video

Creates video from TTS output and viseme images.

Language:PythonMIT11 10

aflr_npm

Aflorithmic Javascript SDK

Language:TypeScript9 30

apiaudio-npm

api.audio Javascript SDK

Language:TypeScript300

examples

A collection of ready-to-deploy examples for api.audio.

Language:JavaScript3 1 2

alexa

a very simple alexa integration using aflorithmic API

Language:JavaScript2 10

birdcache_examples

example repo for birdcache showcase

Language:Python101

demo-ford

Language:Python1 10

openapi

An OpenAPI specification for api.audio.

Language:HTMLMIT100

argoflow-aws

Language:YAMLAGPL-3.0010

audioexample

Language:PythonApache-2.0010

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT000

pythonista-chromeless

Serverless selenium which dynamically execute any given code.

Language:PythonMIT000

aflr_client

Aflorithmic Client

Language:Python020

audiostack-sdks

010

coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

DeepLearningExamples

Deep Learning Examples

Language:Python010

demo-voice-over-magic-link

Language:Python000

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION010

DurIAN

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Language:PythonBSD-3-Clause000

espnet-1

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0010

kubeflow-manifests

KubeFlow on AWS

Language:PythonApache-2.0000

mozilla_TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:PythonMPL-2.0010

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookApache-2.0000

news_article_summarizer

An example use case built to summarize articles from websites and then produce an mp3 file.

Language:PythonMIT010

nzk-alexa

Language:JavaScript000

phonemapper

Maps transcriptions between phone sets.

Language:Python010

readme-images

010

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonMIT010