railsloes's repositories
Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
awesome-python-scientific-audio
Curated list of python software and packages related to scientific research in audio
BDA_course_Aalto
Bayesian Data Analysis course at Aalto
BDA_py_demos
Bayesian Data Analysis demos for Python
BDA_R_demos
Bayesian Data Analysis demos for R
blist-hugo-theme
Blist is a clean and fast blog theme for your Hugo site.
blow
Code to train and run Blow
craig
Craig is a multi-track voice recorder for Discord.
DDSP-48kHz-Stereo
A 48kHz/stereo implementation of Google Magenta's DDSP. Also includes variable audio file render length.
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
freqtrade
Free, open source crypto trading bot
Gdocs
Toos to manage GDocs interactions
hugo-profile
A highly customizable and mobile first Hugo template for personal portfolio and blog.
Knowledge_distillation_via_TF2.0
The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API
mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
NLP-Models-Tensorflow
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
rnnoise
Recurrent neural network for audio noise reduction
StyleTTS
Official Implementation of StyleTTS
tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
tf_unet
Generic U-Net Tensorflow implementation for image segmentation
uberduck-ml-dev
ML models for Uberduck
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).