aflr (aflorithmic)

aflr

aflorithmic

Geek Repo

Github PK Tool:Github PK Tool

aflr's repositories

apiaudio-python

api.audio Python SDK

Language:PythonLicense:MITStargazers:25Issues:2Issues:3

viseme-to-video

Creates video from TTS output and viseme images.

Language:PythonLicense:MITStargazers:11Issues:1Issues:0

aflr_npm

Aflorithmic Javascript SDK

Language:TypeScriptStargazers:9Issues:3Issues:0

apiaudio-npm

api.audio Javascript SDK

Language:TypeScriptStargazers:3Issues:0Issues:0

examples

A collection of ready-to-deploy examples for api.audio.

Language:JavaScriptStargazers:3Issues:1Issues:2

alexa

a very simple alexa integration using aflorithmic API

Language:JavaScriptStargazers:2Issues:1Issues:0

birdcache_examples

example repo for birdcache showcase

Language:PythonStargazers:1Issues:0Issues:1
Language:PythonStargazers:1Issues:1Issues:0

openapi

An OpenAPI specification for api.audio.

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0
Language:YAMLLicense:AGPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pythonista-chromeless

Serverless selenium which dynamically execute any given code.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

aflr_client

Aflorithmic Client

Language:PythonStargazers:0Issues:2Issues:0
Stargazers:0Issues:1Issues:0

coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

DeepLearningExamples

Deep Learning Examples

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

DurIAN

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

espnet-1

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

kubeflow-manifests

KubeFlow on AWS

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mozilla_TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

news_article_summarizer

An example use case built to summarize articles from websites and then produce an mp3 file.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

phonemapper

Maps transcriptions between phone sets.

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonLicense:MITStargazers:0Issues:1Issues:0