AK391 / omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Home Page:https://music-and-culture-technology-lab.github.io/omnizart-doc/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OMNIZART

build docs PyPI version PyPI - License PyPI - Downloads Docker Pulls

DOI DOI

Omnizart is a Python library that aims for democratizing automatic music transcription. Given polyphonic music, it is able to transcribe pitched instruments, vocal melody, chords, drum events, and beat. This is powered by the research outcomes from Music and Culture Technology (MCT) Lab. The paper has been published to Journal of Open Source Software (JOSS).

Transcribe your favorite songs now in Colab Open In Colab or Replicate

Quick start

Visit the complete document for detailed guidance.

Pip

# Install omnizart
pip install omnizart

# Download the checkpoints
omnizart download-checkpoints

# Transcribe your songs
omnizart drum transcribe <path/to/audio.wav>
omnizart chord transcribe <path/to/audio.wav>
omnizart music transcribe <path/to/audio.wav>

Docker

docker pull mctlab/omnizart:latest
docker run -it mctlab/omnizart:latest bash

Supported applications

Application Transcription Training Evaluation Description
music ✔️ ✔️ Transcribe musical notes of pitched instruments.
drum ✔️ ⁉️ Transcribe events of percussive instruments.
vocal ✔️ ✔️ Transcribe note-level vocal melody.
vocal-contour ✔️ ✔️ Transcribe frame-level vocal melody (F0).
chord ✔️ ✔️ Transcribe chord progressions.
beat ✔️ ✔️ Transcribe beat position.

NOTES The current implementation for the drum model has unknown bugs, preventing loss convergence when training from scratch. Fortunately, you can still enjoy drum transcription with the provided checkpoints.

Compatibility Issue

Currently, Omnizart is incompatible for ARM-based MacOS system due to the underlying dependencies. More details can be found in the issue #38.

Citation

If you use this software in your work, please cite:

@article{Wu2021,
  doi = {10.21105/joss.03391},
  url = {https://doi.org/10.21105/joss.03391},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {68},
  pages = {3391},
  author = {Yu-Te Wu and Yin-Jyun Luo and Tsung-Ping Chen and I-Chieh Wei and Jui-Yang Hsu and Yi-Chin Chuang and Li Su},
  title = {Omnizart: A General Toolbox for Automatic Music Transcription},
  journal = {Journal of Open Source Software}
}

About

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

https://music-and-culture-technology-lab.github.io/omnizart-doc/

License:MIT License


Languages

Language:Python 96.6%Language:TeX 1.8%Language:Shell 1.2%Language:Makefile 0.3%Language:Dockerfile 0.1%