OID-ImageClassification

Features

Python 3.6 or higher

Other package versions may work too.
Can be installed from requirements.txt

Download the Image IDs, Image labels, Boxes and Class Names from https://storage.googleapis.com/openimages/web/download.html
(Train, Validation and Test of "Subset with Image-Level Labels" and Bounding Boxes of "Subset with Bounding Boxes")
Put them in a folder structure like this:
Create folders named out and processing
Run the script 1_create_class_id_to_image_ids.py
Output:
Run the script 2_create_class_list_by_image_count.py
Output:
Choose class names to train your classifier on from out/class_list_by_image_count and put them into a .txt file inside in/class_lists
Example:
Adjust all options in config.py under # image download to your liking
Run the script 3_download_images.py
Example Output:
Run the script 4_delete_corrupt_images.py
Adjust all options in config.py under # model training to your liking
Run the script 5_train_model.py
Output:

Now you have an Tensorflow Image classifier at out/saved_model
If you killed the previous script because it took too long, run 6_extract_model_from_checkpoint.py
Run the script 7_evaluate_model.py
Output:
DONE

The dataset is very noisy, you might have to manually delete images that do not fit the label
Make sure you have enabled GPU support https://www.tensorflow.org/install/gpu
Place your dataset on a SSD drive (500Mb/s should be enough) for faster training

A collection of scripts to download data, train and evaluate an image classifier on Open Images using TensorFlow

MIT License

Language:Python 100.0%