openvpi / DiffSingerMiniEngine

A minimum inference engine for DiffSinger

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DiffSingerMiniEngine

A minimum inference engine for DiffSinger MIDI-less mode.

Getting Started

  1. Install onnxruntime following the official guidance.
  2. Install other dependencies with pip install PyYAML soundfile.
  3. Download ONNX version of the NSF-HiFiGAN vocoder from here and unzip it into assets/vocoder directory.
  4. Download an ONNX rhythm predictor from here and put it into assets/rhythmizer directory.
  5. Put your ONNX acoustic models into assets/acoustic directory.
  6. Edit configs/default.yaml or create another config file according to your preference and local environment.
  7. Run server with python server.py or python server.py --config <YOUR_CONFIG>.

API Specification

TBD

How to Obtain Acoustic Models

  1. Train with your own dataset or download pretrained checkpoints from here.
  2. Export PyTorch checkpoints to ONNX format. See instructions here.

About

A minimum inference engine for DiffSinger

License:GNU Affero General Public License v3.0


Languages

Language:Python 100.0%