Cosin1206 / speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2020-08-12 Tips:

Please use the code in speech_utils.py and feature_extractor.py. The rest files are used when I developed this project.

2018-05-27 Update Feature Extractors and Utils

All implemented feature extractors have been written in the file ‘feature_extractor.py’. Please use this file for the newest version.

Speech Feature Extractors

  1. Features include: MFCC, GFCC, gammatone filterbank, Power Spectrum, Log-Power Spectrum, Amplitude Modulation Spectrum(AMS, two version), Short-Time-Fourier-Transfer Spectrum.
  2. Utils include: Ideal Binary Mask, Ideal Ratio Mask, Speech synthesis method, Mixer by dB
  3. Normalizer include: zero-to-one normalizer, unit-vector normalizer.

About

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

License:MIT License


Languages

Language:Python 100.0%