HistoCleanQUB / HistoClean

HistoClean is a tool for the preprocessing and augmentation of images used in deep learning models. This easy to use application brings together the most popular image processing packages from across the python universe, meaning no more looking at documentation! HistoClean provides real time feedback to augmentations and preprocessing options. This allows users to evaluate their steps before implementation.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HistoClean

Welcome to the HistoClean Gitbub repository!

HistoClean is an open source image processing tool for use in developing deep learning models. Here, we bring together the best image manipulation packages into one easy to use application.

This repository contains all relavent files relating to the paper "HistoClean: open-source software for histological image pre-processing and augmentation to improve development of robust convolutional neural networks"

WARNING: HistoClean is currenlty in pre-release and may contain many bugs. It is recomended you make a copy of any datasets before applying the application.

The latest binary release can be found Here

Currently this application is only availble for Windows, but there are plans to port to MacOS and Linux in the coming weeks.

The current version (v0.1.2) consists of five modules:

  1. Image patching - Divide large images into patches for use in convolutional neural networks or other computer vision tasks. Based on the "Openslide" Python Package

  2. Dataset balancing - Balance an infinate number of image classes by applying random rotation and mirroring to existing images. Class balancing is essential to prevent bias when training deep learining models.

  3. Whitespace Filtering - Set a minimum histological tissue (foreground) threshold for images. Allows for the quick and easy removal on non-informative images. Based on the openCV library -

  1. Image Normalisaton - Match the RGB histograms of images to a target image. This helps remove variations in staining. Based on the Scikit-image package.

  1. Image preprocessing/ augmentation - Add a vast variety of image processsing techniques to your image set. These pre-processing techniques can help accentiate desired features, or add noise to help prevent overfitting during training of deep learning models. This module is based arround both the openCV and Imgaug libraries.

About

HistoClean is a tool for the preprocessing and augmentation of images used in deep learning models. This easy to use application brings together the most popular image processing packages from across the python universe, meaning no more looking at documentation! HistoClean provides real time feedback to augmentations and preprocessing options. This allows users to evaluate their steps before implementation.

License:GNU Affero General Public License v3.0


Languages

Language:Python 100.0%