stepanmk / memovision

Official repository for the MemoVision software.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MemoVision

Official repository for the MemoVision software. The online version of the software is currently available here: https://memovision.phil.muni.cz

MemoVision software aims to provide an interface for comparative performance music analysis. This includes music information retrieval and feature selection methods. The frontend is developed with the Vue.js framework and communicates with the Python backend server built with Flask. JavaScript allows the interface to update dynamically and provides modern visuals together with Tailwind CSS. The Python backend leverages the open-source music feature extraction modules and deep learning implementations.

Current state

Keep in mind that this is a work in progress, and you may experience bugs and optimization problems. The first official version was released in January 2024.

Modules of UI:

  • Track manager
  • Region selector
  • Interpretation player
  • Visualization

Progress:

  • User interface via a web browser, sign-in and logging;
  • Allows uploading of music recordings (different versions of the same piece), ground-truth annotations (measure/downbeat positions), binary labels, and metadata (performer names and year of recordings);
  • Parameter extraction: dynamics (RMS and loudness based on EBU R 128), timing (measure durations, tempo of measures), and beat activation function from the TCN model;
  • Processing: tuning deviation estimation, chroma features, duplicate finder, structural differences detection, audio-to-audio synchronization (combined approach: MrMsDTW and activation function from TCN beat tracker);
  • Feature selection: Max-Relevance method on the duration of measures to rank each measure based on its relevance to given binary labels, ranking of measure relevance;
  • Playback of synchronized tracks, easy piece-wise orientation, colorbar of selected relevance, detecting the structural differences in performances (such as missing or additional measures and repetitions);
  • Visualization: Plotting of performance parameters of any sections of interest (both individual plots and comparative plots with measures as x-axis), statistical scatter plot based on binary labels;
  • Piano roll: automatic transcription to MIDI piano roll (bytedance-based), onset positions and their distribution (available only for piano recordings), IOI plots.

License

MIT License

How to install

We use the VSCode IDE for the development, and the instructions reflect it. If you use other IDEs, please,

  • Install Node.js and pnpm

  • Create a new Python virtual env memovision in, e.g., C:/python-venv/memovision

    • note: it should be Python 3.7.x or Python 3.8.x due to some dependencies (it was not tested on newer versions)
    • install backend Python modules (cd to backend folder): pip install -r requirements.txt
  • Install the PostgreSQL database and create a new DB called memovision

  • The list of the VSCode extensions we use is listed in vscode_extentions.txt.

  • Navigate to the frontend folder: cd frontend

  • Run pnpm i to install Node.js modules

  • For the DB to run properly, you have to adjust the backend/mos_backend/__init__.py script for the correct DB URI (provide the password for the PostgreSQL user)

  • You also need to adjust the .vsode/tasks.json file and provide a path to your Python virtual env and run.py in the project folder such as: "command": "cd backend; & c:/python-venv/memovision/Scripts/python.exe C:/memovision/backend/run.py"

  • Then, run the create_tables.py script that creates tables for the memovision database.

  • Furthermore, you need to download ffmpeg (https://ffmpeg.org/download.html) and waveform (https://github.com/bbc/audiowaveform/releases) and add them to your sys path. FFmpeg loads the non-wav files, and waveform renders the waveforms of audio recordings in the browser.

  • Reset the VSCode session for changes to take place. The .vscode/tasks.json file automatically runs Flask and Vite dev servers after the start of VSCode if all paths are provided correctly.

Presentation

The methodology, some technical descriptions, and first visual examples were presented at the Second International Workshop on Multilayer Music Representation and Processing (MMRP23), in a satellite event of the 4th International Symposium on the Internet of Sounds (IS2), in the paper "Application of Computational Methods for Comparative Music Analysis," and at the late-breaking demo section of the International Society of Music Information Retrieval (ISMIR23) conference, in the paper "Memovision: a Tool for Feature Selection and Visualization of Performance Data."

Acknowledgment

This work was supported by the project TA ČR, TL05000527 Memories of Sound: The Evolution of the Interpretative Tradition Based on the Works of Antonin Dvorak and Bedrich Smetana and was co-financed with state support of the Technology Agency of the Czech Republic within the ÉTA Program.

About

Official repository for the MemoVision software.

License:MIT License


Languages

Language:Vue 45.8%Language:JavaScript 26.6%Language:Python 25.8%Language:CSS 1.6%Language:HTML 0.2%