miquelescobar / asmr-is-all-you-need

Audio generation using WaveRNN and WaveNet

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ASMR Is All You Need

VĂ­ctor Adell, Jordi Aguilar, Pau Autrand, Miquel Escobar

Abstract

In the past few years the popularity of the autonomous sensory meridian response (ASMR) concept has risen exponentially, being the video format the most common source for its stimuli. These are videos made by content creators who generate trigger sounds by using multiple materials and techniques. Therefore, in this study we propose variations of the WaveRNN and WaveNet models designed to generate these trigger sounds from scratch. We observe that the baseline architecture of these models outperform the alternative conditioned models in terms of the quality of the generated audios.

Please see the webpage of the project for a sample of the different model outputs that have been outlined on the paper ASMR Is All You Need.

About

Audio generation using WaveRNN and WaveNet


Languages

Language:Jupyter Notebook 90.1%Language:Python 9.9%Language:Shell 0.0%