MakeDiffSinger

Pipelines and tools to build your own DiffSinger dataset.

For the recommended standard dataset making pipelines, see:

acoustic-forced-alignment: make dataset from scratch with MFA for acoustic model training
variance-temp-solution: temporary solution to extend acoustic datasets into variance datasets

For other useful pipelines and tools for making a dataset, welcome to raise issues or submit PRs.

DiffSinger dataset structure

Dataset tools now have their own repository: dataset-tools.

There are mainly 3 components:

AudioSlicer: Slice your recordings into short segments
MinLabel: Label *.lab files containing word transcriptions for acoustic model training.
SlurCutter: Edit MIDI sequence in *.ds files for variance model training.

Pipelines and tools to build your own DiffSinger dataset.

GNU Affero General Public License v3.0

Language:Python 100.0%