tuka217 / OMR-Datasets

Collection of datasets used for Optical Music Recognition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository contains a collection of many datasets used for various Optical Music Recognition tasks, including staff-line detection and removal, training of Convolutional Neuronal Networks (CNNs) or validating existing systems by comparing your system with a known ground-truth.

Note that most datasets have been developed by researchers and using their dataset requires accepting a certain license and/or citing their respective publications, as indicated for each dataset. Most datasets link to the official website, where you can download the dataset.

The following datasets are referenced from this repository:

Tools for working with the datasets

Build Status codecov PyPI version Documentation Status Code Health GitHub license

Handwritten Online Musical Symbols (HOMUS)

Official website: http://grfia.dlsi.ua.es/homus/

License

Summary: The Handwritten Online Musical Symbols (HOMUS) dataset is a reference corpus with around 15000 samples for research on the recognition of online handwritten music notation. For each sample, the individual strokes that the musicians wrote on a Samsung Tablet using a stylus were recorded and can be used in online and offline scenarios.

Scientific Publication: J. Calvo-Zaragoza and J. Oncina, "Recognition of Pen-Based Music Notation: The HOMUS Dataset," 2014 22nd International Conference on Pattern Recognition, Stockholm, 2014, pp. 3038-3043. DOI: 10.1109/ICPR.2014.524

Example:

Example of HOMUS dataset

Remarks: The original dataset contains around 20 artifacts and misclassifications that were reported to the authors and corrected by Alexander Pacha.

Music Score Classification Dataset

Official website: https://github.com/apacha/MusicScoreClassifier

License: MIT

Summary: A dataset of 2000 images, containing 1000 images of music scores and 1000 images of other objects including text documents. The images were taken with a smartphone camera from various angles and different lighting conditions.

Scientific Publication: Under review

Example:

Example of MusicScoreClassifier dataset

CVC-MUSCIMA

Official website: http://www.cvc.uab.es/cvcmuscima/index_database.html

License: CC BY-NC-SA 4.0

Summary: The CVC-MUSCIMA database contains handwritten music score images, which has been specially designed for writer identification and staff removal tasks. The database contains 1,000 music sheets written by 50 different musicians. All of them are adult musicians, in order to ensure that they have their own characteristic handwriting style. Each writer has transcribed the same 20 music pages, using the same pen and the same kind of music paper (with printed staff lines). The set of the 20 selected music sheets contains music scores for solo instruments and music scores for choir and orchestra.

Scientific Publication: Alicia Fornés, Anjan Dutta, Albert Gordo, Josep Lladós. CVC-MUSCIMA: A Ground-truth of Handwritten Music Score Images for Writer Identification and Staff Removal. International Journal on Document Analysis and Recognition, Volume 15, Issue 3, pp 243-251, 2012. DOI: 10.1007/s10032-011-0168-2

Example:

Example of CVC MUSCIMA dataset

MUSCIMA++

Official website: https://ufal.mff.cuni.cz/muscima

License: CC BY-NC-SA 4.0

Summary: MUSCIMA++ is a dataset of handwritten music notation for musical symbol detection that is based on the MUSCIMA dataset. It contains 91255 symbols, consisting of both notation primitives and higher-level notation objects, such as key signatures or time signatures. There are 23352 notes in the dataset, of which 21356 have a full notehead, 1648 have an empty notehead, and 348 are grace notes. Composite objects, such as notes, are captured through explicitly annotated relationships of the notation primitives (noteheads, stems, beams...). This way, the annotation provides an explicit bridge between the low-level and high-level symbols described in Optical Music Recognition literature.

Scientific Publication: Jan Hajič jr., Pavel Pecina. In Search of a Dataset for Handwritten Optical Music Recognition: Introducing MUSCIMA++. CoRR, arXiv:1703.04824, 2017. https://arxiv.org/abs/1703.04824

Example:

Example of MUSCIMA++ dataset

Remarks: Since this dataset is derived from the MUSCIMA dataset, using it requires to reference both works.

Capitan collection

Official website: http://grfia.dlsi.ua.es/

License (Freely available for research purposes)

Summary: A corpus collected by an electronic pen while tracing isolated music symbols from Early manuscripts. The dataset contains information of both the sequence followed by the pen and the patch of the source under the tracing itself. In total it contains 10230 samples unevenly spread over 30 classes. Each symbol is described as stroke (capitan stroke) and including the piece of score below it (capitan score).

Scientific Publication: Jorge Calvo-Zaragoza, David Rizo and Jose M. Iñesta. Two (note) heads are better than one: pen-based multimodal interaction with music scores. International Society of Music Information Retrieval conference, 2016. Download the PDF

Example:

Example of Capitan dataset

Remarks: This dataset exists in two flavours:

  • As raw dataset, which contains only the textual descriptions of the strokes and the images, called Bimodal music symbols from Early notation. This format is similar to the HOMUS dataset.
  • As rendered images inside of the Isolated handwritten music symbols dataset. Also refered to as Capitan collection.

MuseScore Monophonic MusicXML Dataset

Official website: https://github.com/eelcovdw/mono-musicxml-dataset

License: MIT

Summary: This dataset contains the IDs to 17000 monophonic scores, that can be downloaded from musescore.com. A sample script is given that downloads one score, given you've obtained a developer key from the MuseScore developers.

Scientific Publication: Eelco van der Wel, Karen Ullrich. Optical Music Recognition with Convolutional Sequence-to-Sequence Models. CoRR, arXiv:1707.04877, 2017. https://arxiv.org/abs/1707.04877

Examples:

Example of Monophonic MusicXML dataset

Rebelo Dataset

Official websites: http://www.inescporto.pt/~arebelo/index.php and http://www.inescporto.pt/~jsc/projects/OMR/

License: CC BY-SA 4.0

Summary: Three datasets of perfect and scanned music symbols including an extensive set of synthetically modified images for staff-line detection and removal.

Scientific Publication: A. Rebelo, G. Capela, and J. S. Cardoso, "Optical recognition of music symbols: A comparative study" in International Journal on Document Analysis and Recognition, vol. 13, no. 1, pp. 19-31, 2010. DOI: 10.1007/s10032-009-0100-1

Examples:

Example of Rebelo dataset Example of Rebelo dataset

Remarks: The dataset is usually only available upon request, but with written permission of Ana Rebelo I hereby make the datasets available under a permissive CC-BY-SA license, which allows you to use it freely given you properly mention her work by citing the above mentioned publication: Download the dataset.

Fornes Dataset

Official website: http://www.cvc.uab.es/~afornes/

License

Summary: A dataset of 4100 black and white symbols of 7 different symbol classes: flat, natural, sharp, double-sharp, c-clef, g-clef, f-clef.

Scientific Publication: A.Fornés and J.Lladós and G. Sanchez, "Old Handwritten Musical Symbol Classification by a Dynamic Time Warping Based Method", in Graphics Recognition: Recent Advances and New Opportunities. Liu, W. and Lladós, J. and Ogier, J.M. editors, Lecture Notes in Computer Science, Volume 5046, Pages 51-60, Springer-Verlag Berlin, Heidelberg, 2008. DOI: 10.1007/978-3-540-88188-9_6

Example:

Example of the Fornes Dataset Example of the Fornes Dataset

Audiveris OMR

Official website: https://github.com/Audiveris/omr-dataset-tools

License: CC BY-NC-SA 4.0

Summary: A collection of four music sheets with approximately 800 annotated music symbols. The DeepScore project in cooperation with the ZHAW targets towards automatically generating these images and the annotations from MuseScore or Lilypond documents.

Example:

Example of the Audiveris OMR Dataset

Printed Music Symbols Dataset

Official website: https://github.com/apacha/PrintedMusicSymbolsDataset

License: MIT

Summary: A small dataset of about 200 printed music symbols out of 36 different classes. Partially with their context (staff-lines, other symbols) and partially isolated.

Example:

Example of the Printed Music Symbols Dataset

OpenOMR Dataset

Official website: http://sourceforge.net/projects/openomr/

License: GPL v2

Summary: A dataset of 706 symbols (g-clef, f-clef) and symbol primitives (note-heads, stems with flags, beams) of 16 classes created by Arnaud F. Desaedeleer as part of his master thesis to train artificial neural networks.

Scientific Publication: Arnaud F. Desaedeleer, "Reading Sheet Music", Master Thesis, University of London, September 2006, Download

Example:

Example of the Printed Music Symbols Dataset

Gamera MusicStaves Toolkit

Official website: [http://music-staves.sf.net/ and https://github.com/hsnr-gamera](http://music-staves.sf.net/ and https://github.com/hsnr-gamera)

License: GPL v2

Summary: The Synthetic Score Database by Christoph Dalitz that contains 32 scores that have been computer generated with different music typesetting programs. It contains ground truth data and is suitable for the deformations implemented in the toolkit.

Scientific Publication: C. Dalitz, M. Droettboom, B. Pranzas, I. Fujinaga: A Comparative Study of Staff Removal Algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 5, pp. 753-766 (2008) DOI: 10.1109/TPAMI.2007.70749

Example:

Example of the Gamera MusicStaves Dataset

Byrd Dataset

Official website: http://www.diku.dk/hjemmesider/ansatte/simonsen/suppmat/jnmr/

License (Authors want to be contacted)

Summary: A small dataset of 34 high quality images with individual music score pages of increasing difficulty.

Scientific Publication: Donald Byrd & Jakob Grue Simonsen: "Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images". Journal of New Music Research, vol 44, nr.3, pages 169-195, 2015. DOI: 10.1080/09298215.2015.1045424

Example:

Example of the Byrd Dataset

About

Collection of datasets used for Optical Music Recognition

License:MIT License


Languages

Language:Python 99.8%Language:PostScript 0.2%