ushareng/AI_ForAutism-SpeechEmotionRecognition

Problem Statement :

Inferring emotions from multiple modalities is very critical for social communication and deficits in emotion recognition are a very important marker in the diagnosis of autism spectrum disorder. This project uses AI to help autistic individuals recognize emotions in speech.

Model :

tf-wav2vec2-base (Keras and Keras Core)

Dataset :

RAVDESS dataset (Ryerson Audio-Visual Database of Emotional Speech and Song) contains 7,356 audio files which are labeled against different emotions in Speech (calm, happy, sad, angry, fearful, surprise, and disgust expressions) and song (calm, happy, sad, angry, and fearful emotions). This dataset contains a sample of the files from the original RAVDESS dataset.

RAVDESS

HuggingFace spaces Demo: https://huggingface.co/spaces/tensorgirl/audio_classification