ae86208 / Emotional_Speech

Emotional Speech Analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Emotional_Speech

Emotional Speech Analysis

This is a project of my undergraduate thesis. In order to make this working properly, please read the stuff as follows:

  1. The main script file is feature_extration.m and it includes feature extration, data purifying and data visualization, actually.

  2. Make sure that you have STRAIGHT library correctly installed, due to copyright reasons, the library itself is not provided.

  3. If you are using Matlab 2012b and before, please change function audioread into legacy waveread for back compatibility. This problem will be fixed sooner or later, but I still suggest using the up-to-date version for a number of reasons.

  4. If you are using this to train your own model, specify the path to your library in line 5 in feature_extration.m.

  5. For those who are tired of training, or do not have a copy of STRAIGHT, please start the program from section Plot to see the results directly.

  6. *.mat files contain features extracted from the audio library, feel free to use these for your own demands. Note that those .mat files end with _parallel are almost the same as those with no suffix.

  7. Through observation, the distribution of time is a little bit unnatural. I find that it is closely related to word seperation errors due to the energy sensitivity of Angry, especially. Given full consideration, I decide to check the seperation of each file and save those who return the same number of words. This leads to the fact that datasets shrink dramatically, while the result makes sense. Thus comes the timeFixTest.m and the corresponding output timeDistributionFixed.jpg

  8. The new .wav files are generated through the most up-to-date amazing tool called TANDEM-STRAIGHT. Compared with the old version, it is easier to do speech manipulation and the parameters are more directly related to human cognition. What matters most, it solves the problem of the SNR that could be sensed appearently before. (Gangamohan, P., Mittal, V. K., & Yegnanarayana, B. (2012). A Flexible Analysis Synthesis Tool (FAST) for studying the characteristic features of emotion in speech. 2012 IEEE Consumer Communications and Networking Conference (CCNC), 250–254. doi:10.1109/CCNC.2012.6181096)

  9. Any problem with this project, please contact ae86208@gmail.com, thanks.

About

Emotional Speech Analysis

License:GNU General Public License v3.0


Languages

Language:HTML 76.1%Language:MATLAB 23.9%