local-descriptors-for-image-classification

UPDATE: My paper has been accepted! Check it out here.

Each file implements one of the variants of the Local Binary Pattern (LBP).

Center Symmetric LBP
Center Symmetric Local Derivative Pattern
Center Symmetric Local Derivative Mapped Pattern
Center Symmetric Local Mapped Pattern
Center Symmetric Local Ternary Pattern
Extended Center Symmetric Local Binary Pattern
Extended Center Symmetric Local Mapped Pattern
Extended Center Symmetric Local Ternary Pattern

All these files have the same underlying structure with the only difference being in the algorithm being implemented. All algorithms are trained and tested on the CIFAR-10 dataset.

I'll go through the structure of each file now.

The first few lines are the necessary imports.

Reading Input File

def unpickle(file):

    fo = open(file, 'rb')
    dict = cPickle.load(fo)
    fo.close()
    return dict

This function reads in a CIFAR-10 pickle file and stores the data in a dictionary.

Converting to Grayscale

These local descriptors require the input to be in grayscale. The colour2grayscale function implements the conversion formula as defined here.

G_luminance = 0.3R + 0.59G + 0.11B

grayscaleImg = (imRed*0.3 + imGreen*0.59 + imBlue*0.11).astype(int)

Threshold Function

This varies from operator to operator and is defined in the heaviside function.

The Algorithm

First, I pad the image with zeros before running the algorithm.

img = np.concatenate((img, zeroVertical), axis=1)
img = np.concatenate((zeroVertical, img), axis=1)
img = np.concatenate((zeroHorizontal, img), axis=0)
img = np.concatenate((img, zeroHorizontal), axis=0)

The function then goes on to implement the respective algorithm (CS-LBP in this case).

cslbpImg = np.zeros((33, 33))
for x in range(1, 33):
    for y in range(1, 33):		
        s1 = heaviside(img[x-1, y-1] - img[x+1, y+1])
        s2 = heaviside(img[x-1, y] - img[x+1, y])*2 
        s3 = heaviside(img[x-1, y+1] - img[x+1, y-1])*4 
        s4 = heaviside(img[x, y+1] - img[x, y-1])*8

        s = s1 + s2 + s3 + s4

        cslbpImg[x, y] = s

We then compute the histogram of the resultant image to get the feature vector.

hist = np.zeros(16).astype(int)

cslbpImg = cslbpImg.flatten()
for i in cslbpImg:
    hist[i] = hist[i] + 1

Classification

I'm using the XGBoost Classifier for classification.

model = XGBClassifier(n_estimators=800)
model.fit(X_train, y_train)

You have to play with the n_estimators to get the best accuracy.

And that's about it! Feel free to open an issue if required.

Junaid199f / local-descriptors-for-image-classification