FM Synthesizer Sound Matching Experiment

Machine learning and evolutionary algorithms to program a VST FM synthesizer.

In synthesizer sound matching, algorithms are used to find parameters for a synthesizer to replicate a target sound as closely as possible. In this experiment six different algorithms are compared in sound matching Dexed, a VST emulation of the Yamaha DX7 synthesizer.

Four deep learning models are compared: a multi-layer perceptron (MLP), a long short term memory (LSTM) recurrent neural network, a LSTM model with highway layers (LSTM++), and a convolutional neural network. Two genetic algorithms are also included for comparison: a simple single objective genetic algorithm, and a multi-objective non-dominated sorting genetic algorithm (NSGA III).

This repository contains python notebooks for an example experiment that highlights the use of the SpiegeLib software library for automatic synthesizer programming research and was published as a conference paper for the Audio Engineering Society Spring 2020 conference that was held virtually in June of 2020.

Original datasets and sound files used for evaluation can be downloaded from zenodo. For more information on this experiment and SpiegeLib please visit the experiment website

Requirements

In order to run these notebooks, you will need:

Dexed FM VST https://asb2m10.github.io/dexed/
Python 3.6 >
A way to run python notebooks, we recommend Jupyter in an Anconda environment
SpiegeLib in a conda environment

Installation

Deatiled instructions on installing SpiegeLib in a conda virtual environment are available here
SpiegeLib can be installed via pip: pip install spiegelib
All dependencies except for RenderMan will be installed by pip. RenderMan will need to be installed manually.

Running

To recreate the experiment from start to finish, you can run the notebooks in this order:

synth_config.ipynb : Configures Dexed synthesizer for this experiment by selecting a subset of parameters to automatically program and freezing the rest.
dataset_generation.ipynb : Create datasets for training and validating deep learning models. Also creates 25 audio targets for evaluating results.
train_deep_learning_models.ipynb : Train deep learning models using the dataset we just created.
sound_match_deep_learning.ipynb : Estimate synthesizer parameters to match the 25 evaluation audio targets using the trained deep learning models.
sound_match_genetic.ipynb : Estimate synthesizer parameters to match the 25 evaluation audio targets using genetic algoritmhs.
evaluation.ipynb : Evaluate the results of sound matching.

Pre-trained models are also included in this repo which can be used. The datasets used in the original experiment are also available to download, see https://doi.org/10.5281/zenodo.3722784.

Satrat / vst-fm-sound-match

FM Synthesizer Sound Matching Experiment

Machine learning and evolutionary algorithms to program a VST FM synthesizer.

Requirements

Installation

Running

About

Languages