Di Chang,
Yufeng Yin,
Zongjian Li,
Minh Tran,
Mohammad Soleymani
Institute for Creative Technologies, University of Southern California
WACV 2024
Arxiv | Project page
This is the official implementation of our WACV 2024 Application Track paper: LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis. LibreFace is an open-source and comprehensive toolkit for accurate and real-time facial expression analysis with both CPU-only and GPU-acceleration versions. LibreFace eliminates the gap between cutting-edge research and an easy and free-to-use non-commercial toolbox. We propose to adaptively pre-train the vision encoders with various face datasets and then distillate them to a lightweight ResNet-18 model in a feature-wise matching manner. We conduct extensive experiments of pre-training and distillation to demonstrate that our proposed pipeline achieves comparable results to state-of-the-art works while maintaining real-time efficiency. LibreFace system supports cross-platform running, and the code is open-sourced in C# (model inference and checkpoints) and Python (model training, inference, and checkpoints).
Clone repo:
git clone https://github.com/ihp-lab/LibreFace.git
cd LibreFace
The code is tested with Python == 3.7, PyTorch == 1.10.1 and CUDA == 11.3 on NVIDIA GeForce RTX 3090. We recommend you to use anaconda to manage dependencies. You may need to change the torch and cuda version in the requirements.txt
according to your computer.
conda create -n libreface python=3.7
conda install pytorch==1.10.0 torchvision==0.11.0 torchaudio==0.10.0 cudatoolkit=11.3 -c pytorch -c conda-forge
conda activate libreface
pip install -r requirements.txt
As described in our paper, we first pre-process the input image by mediapipe to obatain facial landmark and mesh. The detected landmark are used to calculate the corresponding positions of the eyes and mouth in the image. We finally use these positions to align the images and center the face area.
To process the image, simpy run following commad:
python detect_mediapipe.py
Download the DISFA dataset from the official website here. Please be reminded that the original format of the dataset are video sequences, you need to manually process them into image frames.
Download the original video provided by DISFA+. Extract it and put it under the folder data/DISFA
.
Preprocess the images by previous mediapipe script and you should get a dataset folder like below:
data
├── DISFA
│ ├── images
│ ├── landmarks
│ └── aligned_images
├── BP4D
├── AffectNet
└── RAF-DB
cd AU_Recognition
bash train.sh
bash inference.sh
Download the BP4D dataset from the official website. Extract it and put it under the folder data/BP4D
.
Preprocess the images by previous mediapipe script and you should get a dataset folder like below:
data
├── DISFA
│ ├── images
│ ├── landmarks
│ └── aligned_images
├── BP4D
│ ├── images
│ ├── landmarks
│ └── aligned_images
├── AffectNet
└── RAF-DB
cd AU_Detection
bash train.sh
bash inference.sh
Download the AffectNet dataset from the official website. Extract it and put it under the folder data/AffectNet
.
Preprocess the images by previous mediapipe script and you should get a dataset folder like below:
data
├── DISFA
│ ├── images
│ ├── landmarks
│ └── aligned_images
├── BP4D
│ ├── images
│ ├── landmarks
│ └── aligned_images
├── AffectNet
│ ├── images
│ ├── landmarks
│ └── aligned_images
└── RAF-DB
cd Facial_Expression_Recognition
bash train.sh
bash inference.sh
There are several options of flags at the beginning of each train/inference files. Several key options are explained below. Other options are self-explanatory in the codes. Before running our codes, you may need to change the device
, data_root
, ckpt_path
, data
and fold
.
ckpt_path
A relative or absolute folder path for writing checkpoints.data_root
The path to your dataset on your local machine.device
Specify cuda or cpu.data
Dataset to be used. [“DSIFA”,“BP4D”,"AffectNet","RAF-DB"]fold
We use five-fold cross-validation to report performance on DISFA and three-fold cross-validation on BP4D. ["0","1","2","3","4"]train_csv
Training csv file to be parsed.test_csv
Testing csv file to be parsed.fm_distillation
Use feature matching distillation for training.
- Upload Training/Validation Split csv files and CSV creation python code for model training
- Upload Facial Expression Recognition code on RAF-DB Dataset
- Open-Source C# code for LibreFace
Our code is distributed under the USC research license. See LICENSE.txt
file for more information.
@inproceedings{chang2023libreface,
title={LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis},
author={Di Chang and Yufeng Yin and Zongjian Li and Minh Tran and Mohammad Soleymani},
year={2024},
booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
month = {January},
note = {To appear}}
If you have any questions, please raise an issue or email to Di Chang (dchang@ict.usc.edu
or dichang@usc.edu
).
Our code follows several awesome repositories. We appreciate them for making their codes available to public.