Rong GONG's starred repositories

deeplearning-models

A collection of various deep learning architectures, models, and tips

Language:Jupyter NotebookLicense:MITStargazers:16542Issues:593Issues:28

numpy-ml

Machine learning, in numpy

Language:PythonLicense:GPL-3.0Stargazers:15222Issues:457Issues:50

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:9174Issues:186Issues:560

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:5020Issues:117Issues:556

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:2258Issues:133Issues:472

waveglow

A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:BSD-3-ClauseStargazers:2254Issues:78Issues:256

WaveRNN

WaveRNN Vocoder + TTS

Language:PythonLicense:MITStargazers:2122Issues:86Issues:227

deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Language:PythonLicense:NOASSERTIONStargazers:1961Issues:93Issues:193

gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

Language:PythonLicense:MITStargazers:1923Issues:75Issues:47

libsoundio

C library for cross-platform real-time audio input and output

rtaudio

A set of C++ classes that provide a common API for realtime audio input/output across Linux (native ALSA, JACK, PulseAudio and OSS), Macintosh OS X (CoreAudio and JACK), and Windows (DirectSound, ASIO, and WASAPI) operating systems.

Language:C++License:NOASSERTIONStargazers:1480Issues:58Issues:258

LAMA

LAnguage Model Analysis

Language:PythonLicense:NOASSERTIONStargazers:1338Issues:72Issues:48

awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

License:MITStargazers:533Issues:25Issues:0

audioread

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Language:PythonLicense:MITStargazers:482Issues:25Issues:91

SimplyCoreAudio

🔊 A Swift framework that aims to make Core Audio use less tedious in macOS

Language:SwiftLicense:MITStargazers:428Issues:12Issues:54

ClariNet

A Pytorch Implementation of ClariNet

Language:PythonLicense:MITStargazers:288Issues:23Issues:9

LearningCoreAudioWithSwift2.0

All the examples of the Learning Core Audio book rewritten with Swift 2.0

Learning-Core-Audio-Swift-SampleCode

Swift sample code for the book, Learning Core Audio. The original sample code was written in C/Objective-C but I tried to make it in Swift version.

Language:SwiftLicense:MITStargazers:154Issues:12Issues:4

parallel_wavenet_vocoder

Parallel WaveNet Vocoder Based on ClariNet

Language:PythonLicense:NOASSERTIONStargazers:146Issues:24Issues:0

os-x-ios-kernel-programming

Source code for 'OS X and iOS Kernel Programming' by Ole Henry Halvorsen and Douglas Clarke

Language:C++License:NOASSERTIONStargazers:141Issues:15Issues:0

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonLicense:MITStargazers:132Issues:17Issues:16

TalentedHack

LV2 port of Autotalent pitch correction plugin

Language:CLicense:GPL-3.0Stargazers:116Issues:10Issues:9

dtwalign

Comprehensive dynamic time warping module for python

Language:PythonLicense:MITStargazers:104Issues:7Issues:10

pySpeechRev

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

RealRIRs

Python loaders for many Real Room Impulse Response databases

Language:PythonStargazers:83Issues:6Issues:0

maracas

maracas is a library for corrupting audio files with additive and convolutive noise.

Language:PythonLicense:MITStargazers:72Issues:3Issues:4

idlak

Official home of the Idlak Speech Synthesis Toolkit

Language:ShellLicense:NOASSERTIONStargazers:66Issues:11Issues:31

gmm-hmm-asr

Python implementation of simple GMM and HMM models for isolated digit recognition.

Language:PythonLicense:Apache-2.0Stargazers:57Issues:16Issues:28

loudness.py

EBU R128 / ITU-R BS.1770 integrated loudness measurement in Python

Language:PythonLicense:MITStargazers:40Issues:3Issues:1