jameslyons / python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

logfbank functionstrange winstep size

LeeYongHyeok opened this issue · comments

Hello, i will use this project, in multimodal speech recognition.

when i use this logfbank function in python3 env, i got 237 frames logfbank feature, using 1.19s wav file.

I check my option, winlen=0.02, winstep=0.01 but output frames are same.

I change the option to winlen=0.04, winstep=0.02, and i got 118 frames.

what's the matter?? i confuse ...

I installed by 'pip3 install python_speech_features' command, and my os is ubuntu.

thanks!

oh, i fix it thank!