Tai-Min / Spectogram-Dataset-Recorder

Simple recording app that saves recordings as matrices containing time-frequency data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spectogram Dataset Recorder

Simple GUI tool written in Qt to record sound from microphone input or speakers and export it as spectogram either as Mel Scale Filter Banks or Mel Frequency Cepstral Coefficients. This program was written to create dataset for sound event detection. I relied on this blog post to create conversion routine.

alt text

Features

  • Easy to use
  • Lightweigh
  • Allows fine tuning of conversion parameters
  • Allows batch recording one after another
  • Exports to four different formats: plain text (.txt), numpy array (.npy), color image (.jpg), grayscale image (.jpg)
  • Saves samples under "path\to\dataset_root\class_name"
  • Does not overwrite previous recordings
  • audioprocessor.h written in pure STL so it can be reused in any C++ project to convert audio/pcm data to spectogram

Compiling

I have compiled it using Qt Creator 4.9.2, Qt 5.15.0 and MSVC19 64 bit. zlib 64 bit is required (I have compiled this and works flawlessly). To compile, change INCLUDEPATH and LIBS in .pro file to correct path to zlib.

About

Simple recording app that saves recordings as matrices containing time-frequency data

License:MIT License


Languages

Language:C++ 97.1%Language:QMake 2.9%