Parisson / TimeSide

Scalable audio processing framework written in Python with a RESTful API

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TimeSide : scalable audio processing framework and server written in Python

TimeSide is a python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large datasets of any audio or video assets with a plug-in architecture, a secure scalable backend and an extensible dynamic web frontend.

Use cases

  • Scaled audio computing (filtering, machine learning, etc)
  • Web audio visualization
  • Audio process prototyping
  • Realtime and on-demand transcoding and streaming over the web
  • Automatic segmentation and labelling synchronized with audio events

Goals

  • Do asynchronous and fast audio processing with Python,
  • Decode audio frames from any audio or video media format into numpy arrays,
  • Analyze audio content with some state-of-the-art audio feature extraction libraries like Aubio, Yaafe and VAMP as well as some pure python processors
  • Visualize sounds with various fancy waveforms, spectrograms and other cool graphers,
  • Transcode audio data in various media formats and stream them through web apps,
  • Serialize feature analysis data through various portable formats,
  • Provide audio sources from plateform like YouTube or Deezer
  • Deliver analysis and transcode on provided or uploaded tracks over the web through a REST API
  • Playback and interact on demand through a smart high-level HTML5 extensible player,
  • Index, tag and annotate audio archives with semantic metadata (see Telemeta which embed TimeSide).
  • Deploy and scale your own audio processing engine through any infrastructure

Funding and support

To fund the project and continue our fast development process, we need your explicit support. So if you use TimeSide in production or even in a development or experimental setup, please let us know by:

Thanks for your help!

News

1.0

  • Server refactoring:
    • audio process run on items (REST API track's model)
    • several tools, views, models and serializers
    • REST API's schema on OpenAPI 3 specification and automatic Redoc generation
  • Move core and server from Python 2.7 to 3.7
  • Upgrade Django to 2.2, Django REST Framework to 3.11, Celery to 4.4
  • Add an Aubio based decoder
  • Add core and server processors' versioning and server process' run time
  • Regroup all dependencies on pip requirements removing conda use
  • Add :ref:`provider` as a core API component and as a REST API model
  • Add provider plugins :ref:`deezer-preview`, :ref:`deezer-complete` and :ref:`youtube`
  • Improve server unit testing
  • Add JWT authentication on REST API
  • Various bug fixes
  • Add core, server and workers logging

0.9

  • Upgrade all python dependencies
  • Add Vamp, Essentia, Yaafe, librosa, PyTorch, Tensorflow libs and wrappers
  • Add a few analyzing plugins (Essentia Dissonance, Vamp Constant Q, Vamp Tempo, Vamp general wrapper, Yaafe general wrapper)
  • Add processor parameter management
  • Add processor inheritance
  • Improve HTML5 player with clever data streaming
  • Improve REST API and various serialzers
  • Improve unit testing
  • Various bug fixes

0.8

  • Add Docker support for instant installation. This allows to run TimeSide now on any OS platform!
  • Add Jupyter Notebook support for easy prototyping, experimenting and sharing (see the examples in the doc).
  • Add an experimental web server and REST API based on Django REST Framework, Redis and Celery. This now provides a real web audio processing server with high scaling capabilities thanks to Docker (clustering) and Celery (multiprocessing).
  • Start the development of a new player interface thanks to Angular and WavesJS.
  • Huge cleanup of JS files. Please now use bower to get all JS dependencies as listed in settings.
  • Add metadata export to Elan annotation files.
  • Fix and improve some data structures in analyzer result containers.
  • Many various bugfixes.

0.7.1

  • fix django version to 1.6.10 (sync with Telemeta 1.5)

0.7

  • Code refactoring:
    • Create a new module timeside.plugins and move processors therein: timeside.plugins.decoder,analyzer, timeside.plugins.encoder, timeside.plugins.fx
    • WARNING: to properly manage the namespace packages structure, the TimeSide main module is now timeside.core and code should now be initialized with import timeside.core
    • timeside.plugins is now a namespace package enabling external plugins to be automatically plugged into TimeSide (see for example timeside-diadems). This now makes TimeSide a real plugin host, yeah!
    • A dummy timeside plugin will soon be provided for easy development start.
  • Move all analyzers developped by the partners of the Diadems project to a new repository: timeside-diadems
  • Many fixes for a better processing by Travis-CI
  • Add a dox file to test the docker building continously on various distributions

For older news, please visit: https://github.com/Parisson/TimeSide/blob/master/NEWS.rst

Documentation

Install

Thanks to Docker, Timeside is now fully available as a docker composition ready to work. The docker based composition bundles some powerfull applications and modern frameworks out-of-the-box like: Python, Conda, Numpy, Jupyter, Gstreamer, Django, Celery, Haystack, ElasticSearch, MySQL, Redis, uWSGI, Nginx and many more.

First, install Docker and docker-compose

Then clone TimeSide:

git clone --recursive https://github.com/Parisson/TimeSide.git
cd TimeSide
docker-compose pull

That's it! Now please go to the documentation to see how to use it.

Note

If you need to user TimeSide outside a docker image please refer to the rules of the Dockerfile which is based on a Debian stable system. But we do not provide any kind of free support in this usercase anymore (the dependency list is now huge). To get commercial support in more various usecases, please reach the Parisson dev team.

Sponsors and Partners

  • Parisson
  • CNRS: National Center of Science Research (France)
  • Huma-Num: big data equipment for digital humanities (CNRS, France)
  • CREM: French National Center of Ethomusicology Research (France)
  • Université Pierre et Marie Curie (UPMC Paris, France)
  • ANR: Agence Nationale de la Recherche (France)
  • MNHN : Museum National d'Histoire Naturelle (Paris, France)
  • C4DM : Center for Digital Music, Queen Mary University (London, United Kingdom)
  • NYU Steinhardt : Music and Performing Arts Professions, New York University (New York, USA)
  • IRCAM : IRCAM (Paris, France)

Related projects

  • Telemeta : Open web audio platform
  • Sound archives of the CNRS, CREM and the "Musée de l'Homme" in Paris, France
  • DIADEMS sponsored by the ANR.
  • DaCaRyh, Data science for the study of calypso-rhythm through history
  • KAMoulox Online unmixing of large historical archives
  • NYU+CREM+Parisson : arabic music analysis from the full CREM database
  • WASABI: Web Audio Semantic Aggregated in the Browser for Indexation, sponsored by the ANR

Copyrights

  • Copyright (c) 2019, 2022 IRCAM
  • Copyright (c) 2006, 2022 Guillaume Pellerin
  • Copyright (c) 2010, 2022 Paul Brossier
  • Copyright (c) 2019, 2020 Antoine Grandry
  • Copyright (c) 2006, 2019 Parisson SARL
  • Copyright (c) 2013, 2017 Thomas Fillon
  • Copyright (c) 2013, 2014 Maxime Lecoz
  • Copyright (c) 2013, 2014 David Doukhan
  • Copyright (c) 2006, 2010 Olivier Guilyardi

License

TimeSide is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

TimeSide is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

Read the LICENSE.txt file for more details.

About

Scalable audio processing framework written in Python with a RESTful API

License:GNU Affero General Public License v3.0


Languages

Language:Python 96.6%Language:Shell 1.2%Language:HTML 1.0%Language:JavaScript 0.5%Language:EJS 0.3%Language:Dockerfile 0.3%Language:TypeScript 0.0%