DoAI

Do AI models learn better if they see more classes?

(SWENG 2021 Group 6)

Overview

We use classification accuracy metric in machine learning to measure model performance. However, this metric can be misleading for many reasons: data imbalance, percentage of samples incorrectly classified and number of classes. It would be interesting to address the latter.

Now, the simplest (computer vision) classification problems to consider are based on the CIFAR-10 and CIFAR-100 datasets. The best reported performance is 99.70% in CIFAR-10 and 96.08% in CIFAR-100.

The research question

Does the model do a better job on CIFAR-10 (has 10 classes) or CIFAR-100 (has 100 classes)?

To answer this question, we need to know how to statistically quantify the Model Performance when the number of classes/categories changes.

Methods

This work is straightforward: data and code are publicly available. PyTorch platform supports data reading and deep learning models. We use the t-test statistic to investigate the problem.

We built the tests on top of PyTorch models. It would be interesting to test other data sets containing more categories.

About

MIT License

Languages

Language:Python 50.0%Language:Jupyter Notebook 28.0%Language:JavaScript 14.9%Language:CSS 4.5%Language:HTML 2.6%