Oran2009 / audiosync_ipynb

audiosync ipynb

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

audiosync_ipynb

This tool automatically syncs matching video and audio files using recurrence quantification analysis (RQA). The tool allows you to adjust the sampling rate, hop length, and ftt window size so you can achieve near-perfect synchronization.

At the start of this project, I used dynamic time warping (DTW) instead of recurrence quantification analysis (RQA) to synchronize the two audio tracks. This method seemed efficient for my purposes, but didn't achieve near-perfect synchronization on some tests. This led me to implement RQA, which maximizes alignment paths and relies on similarity rather than distance.

More information on RQA can be discovered here on Wikipedia, and a similar project has been documented on this pdf

ipynb

This ipynb is mainly used for visual and debug information. It is possible to display waveplots, spectrograms, and chromagrams of audio files. It also displays the warping path between the video and audio files provided (using the DTW method).

example of a warping path visual:

Warping Path

About

audiosync ipynb


Languages

Language:Jupyter Notebook 100.0%