ericguizzo's repositories

anti_transfer

This repository supports the paper "Anti-Transfer Learning for Task Invariance in Convolutional Neural Networks for Speech Processing"

emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021

Language:Jupyter NotebookLicense:MITStargazers:2Issues:1Issues:0

compound-word-transformer

Official implementation of compound word transformer (AAAI'21)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

ConditionalStyleGAN

Conditional implementation for NVIDIA's StyleGAN architecture

Language:PythonStargazers:0Issues:1Issues:0

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CycleGAN-PyTorch

A very simple implementation of cyclegan, which is based on pytorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

deep-landscape

Official repository for the paper "DeepLandscape: Adversarial Modeling of Landscape Videos" (ECCV2020)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DeepBach

code accompanying "DeepBach: a Steerable Model for Bach Chorales Generation" paper

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

fast-style-transfer

TensorFlow CNN for fast style transfer ⚡🖥🎨🖼

Language:PythonStargazers:0Issues:1Issues:0

image-super-resolution

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LoopTest

Official repo of ISMIR-21 publication, “A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset”.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

muscaps

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:1Issues:0

musegan

An AI for Music Generation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

musicnn

Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.

Language:Jupyter NotebookLicense:ISCStargazers:0Issues:1Issues:0

neural-dream

PyTorch implementation of DeepDream algorithm

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

neural-style

Neural style in TensorFlow! 🎨

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

Noise2Noise-audio_denoising_without_clean_training_data

Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

pixray

neural image generation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

remi

"Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions", ACM Multimedia 2020

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tf-diffwave

Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

Voice-Converter-CycleGAN

Voice Converter Using CycleGAN and Non-Parallel Data

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Wave-U-Net-Pytorch

Improved Wave-U-Net implemented in Pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0