railsloes

followers

following

stars

railsloes's repositories

AI-test

000

Audiovisual-Synthesis

Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders

Language:Python000

avatarify

Avatars for Zoom, Skype and other video-conferencing apps.

Language:PythonNOASSERTION010

awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio

000

BDA_course_Aalto

Bayesian Data Analysis course at Aalto

Language:TeX000

BDA_py_demos

Bayesian Data Analysis demos for Python

Language:Jupyter NotebookGPL-3.0000

BDA_R_demos

Bayesian Data Analysis demos for R

Language:HTMLBSD-3-Clause000

blist-hugo-theme

Blist is a clean and fast blog theme for your Hugo site.

Language:HTMLMIT000

blow

Code to train and run Blow

Language:PythonApache-2.0000

craig

Craig is a multi-track voice recorder for Discord.

Language:C000

ddsp

DDSP: Differentiable Digital Signal Processing

Language:PythonApache-2.0010

DDSP-48kHz-Stereo

A 48kHz/stereo implementation of Google Magenta's DDSP. Also includes variable audio file render length.

Apache-2.0000

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

NOASSERTION000

freqtrade

Free, open source crypto trading bot

GPL-3.0000

Gdocs

Toos to manage GDocs interactions

000

hugo-profile

A highly customizable and mobile first Hugo template for personal portfolio and blog.

MIT000

Knowledge_distillation_via_TF2.0

The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API

000

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

BSD-3-Clause000

MetaGPT

🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo

MIT000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0010

NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

MIT000

Nonparallel-emotional-VC

Language:Python010

pase

Problem Agnostic Speech Encoder

Language:Python010

rnnoise

Recurrent neural network for audio noise reduction

BSD-3-Clause000

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT000

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Apache-2.0000

tf_unet

Generic U-Net Tensorflow implementation for image segmentation

Language:PythonGPL-3.0000

trypython

000

uberduck-ml-dev

ML models for Uberduck

Language:Jupyter NotebookApache-2.0010

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).

000