jameslyons / python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Why am I getting double the frames that I am expecting? [Answer: I was using stereo audio [facepalm]]

collinalexbell opened this issue · comments

I have 31 second audio @ 16000hz

I run MFCC on the audio at default settings (.01s step sizes)

This should mean that I get 31s/0.01s = 3100 frames.

What I am getting as a result of calling mfcc() is 6200 frames. Am I misunderstanding something?

I am sorry. I was stereo audio. Use mono audio only!