facial-expression-recognition

This is a class project for the course of Data structure & algorithm. In this project, we were asked to implement a system to recognize human facial action units (AU) from facial images. Each image is represented by a predefined feature set (a vector containing 5,632 elements) and denoted by xi. Given a template image dataset Y = {y1, …, yN} (containing N = 138 images in this project). The recognition of AUs is realized by using a k-nearest neighbor-based method, by comparing the similarity between xi and each of the template images yj and finding the 10-nearest neighbors of xi.

By using this similarity function, the most similar template image (the closest neighbor of xi) has the maximum similarity value.

The methodology I use to solve this problem includes three parts:

Method to read input query and template files into one dimensional or two dimensional vectors for convenience of further computation. Six different routines to read file have been compared in terms of time efficiency: (a) using C API to read directly into a string and then read stringstream to double number vectors, (b) using C++ streams to read into string and convert to double number vectors, (c) using istreambuf_iterator to fast iteration out of stream buffers (files) and then convert to double numbers, (d) also using istream, but assign the size of the string first and then allocate the data into it, (e) fast copy files to another stream via operator << on their internal buffers and then handle the string to vector, (f) using getline function in istream and convert each line to a one dimensional vector and combine different lines to two dimensional vector.
Method to compute similarity value as well as optimize the process of computation. By brute force, it is just to compute the similarity value table of xi for the 138 different elements yj in template files. New algorithm has been proposed by setting an upper bound to filter out unpromising elements and only compute similarity value for elements with higher upper bound. Computation efficiency for this algorithm is hard to analyze theoretically but improved computing time around 33% from experimental results.
Method to select the top 10 largest similarity value for each query file and sort them. A modified quicksort algorithm is applied to sort the 10 largest values from the 138 results: a pivot corresponding to the 10th largest value is being calculated using partial quicksort in the time O(n), and then another quicksort is taken to take care the first ten unsorted results, so the final efficiency comes out as O(n + klogk), k is the number of largest value, in this case 10.

Summary:

Six different I/O algorithms were compared for reading large files into vectors
An upper bound algorithm was designed to improve the time to compute cosine similarity function and improved 33% compared to brute force
A partial quick sort algorithm is applied to sort the k largest number in a time O(n + klogk)

This program is written to recognize AU (action unit) by using the K-nearest neighbor-based method. Compile procedure:

put the code in the same folder of the query and template date file.
open teminal (command line)
type "cd Desktop" to go to Desktop folder
type "cd data" to go to data folder
type "g++" FacialRecognization.cpp to compile
type "./a.out" to run the output

About

Developed a program for facial expression recognition. Designed and evaluated six different I/O algorithms for reading large files into vectors. Adopted an upper bound algorithm and improved the cosine similarity function computing time by 33 percent compared to brute force. Applied a partial quick sort algorithm to sort the k largest number in a time O(n + klogk).

Languages

Language:C++ 100.0%