Beagle-Boys / Music-Genre-Classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Methodology

Analyse

  • Waveform

    Waveform

  • FFT Spectrum

    FFT

  • Log Spectogram

    Log Spectogram

  • MFCC

    MFCC


Preprocess

  • divided each .wav file into 5 segments to increse training data.
  • exctracted mfcc for each segment.
  • stored the complete dataset in format :
    {
        "mapping" : ["classical","blues",...],
        "mfcc": [[[...],[...],...,[...]],...],
        "labels": [0,2,...]
    }

Simple ANN

  • Architecture

    Archiecture

  • Result

    Plots

  • Comments :

    • As we can see, the model is overfitting.

Simple ANN with Dropout and Kernel Regularization

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • The model is not overfitting anymore.

Convolutional Neural Network

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • CNN performs very well.
    • The training data had to be reshaped since CNN required 3 dimensional input.
    • Fastest

Recurrent Neural Network (LSTM)

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • LSTM performs very well.
    • Slowest

Recurrent Neural Network (GRU)

  • Architecture

    Architecture

  • Result

    Plots

  • Comments :

    • GRU performs very well. Better than LSTM.
    • Slowest (as slow as LSTM)

About


Languages

Language:PureBasic 60.3%Language:Python 39.7%