ElenaRyumina / ABAW-SIU

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

An Audio-Video Deep and Transfer Learning Framework for Multimodal Emotion Recognition in the wild

In this paper, we present our contribution to the ABAW facial expression challenge. We report the proposed system and the official challenge results adhering to the challenge protocol. Using end-to-end deep learning and benefiting from transfer learning approaches, we reached the 3rd place in the Expression challenge with the test set performance measure of 42.10%.

The paper is available via this link.

The weights for models and support files can be downloaded through this link.

Every model from the article is located in separate folders with all needed code for generating, training, and predicting processes.

The repository organised as follows:

  • Video_based_models
    • VGGFace2: VGGFace2 model - a Resnet50 pretrained on the VGGFace2 dataset model, which then was used to transfer it on emotion classification task.
  • Audio_based_models

About

License:Apache License 2.0


Languages

Language:Jupyter Notebook 65.7%Language:Python 34.3%