hxer7963/FacialExpressionRecognition

facial-expression-recognition pre-trained vgg16 cnn-keras svm fer2013

Usage

You can download the fer2013 dataset in Challenges in Representation Learning: Facial Expression Recognition Challenge uncompress it to dataset directory which you should create at the top of the project. Then you can run the Utils.csv2image code fragment to convert the sequential pixels value to images which stored in the dataset directory. If you are confuse about directory hierarchy, you can feel free to inspect the source code in Utils.py.

Furthermore, if you have any trouble in extracting the image pixels value to dataset directory, you can feel free to download my processed dataset, which powered by Baidu cloud disk.

If you just want to inspect the model which I have trained, you can download the Models from Baidu cloud disk which also have no password.

convolutional neural nets architecture

convnet from scratch

shallow cnn (softmax activation) and then replace the top softmax layer with SVM multiple-classifier which implemented in sklearn.svm.SVC, finally hit 62.3% accuracy on PrivateTest dataset.

pre-trained VGG16 convolutional base

First, pre-trained your new stochastic initial fully-connected layer with frozen convolutional base, then unfrozen the top convolutional block to fine-tuning the convolutional base in order to extract better features, Finally the output of flatten layer feed into L2-SVM hit 65.47% accuracy on PrivateTest dataset. As you can see, the Score is not so bad, if you have a look at the kaggle leaderboard.

conclusion

cnn vs. dnn

As you can see, the features extracted with dnn(fine-tuning VGG16 nets) is better than shallow cnn which is implemented from scratch, but former is so expensive that you should attempt it if you have access to a GPU. If you just want to inspect the performance of processed model, the process will be very fast.

softmax vs. L2-SVM

If you run the above code, i believe you already get it. SVM multi-classifier clearly precedes the softmax activation function, at least in my experiment.

Reference:

Deep learning with Python

Deep Learning using Linear Support Vector Machines: the winner(hit 71.2% accuracy) in Facial emotion recognition in Kaggle competition

About

Contrast multiple facial expression recognition experiments and found that using SVM instead of softmax layer can achieve better classification results(65.47% accuracy on fer2013 dataset).

facial-expression-recognition pre-trained vgg16 cnn-keras svm fer2013

Languages

Language:Jupyter Notebook 95.9%Language:Python 4.1%