luisdavid64 / HSI-BandSelection

Developing Low-Cost Multispectral Imagers using Inter-Band Redundancy Analysis and Greedy Spectral Selection in Hyperspectral Imaging.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks

Description

In recent years, Hyperspectral Imaging (HSI) has become a powerful source for reliable data in applications such as agriculture, remote sensing, and biomedicine. However, hyperspectral images are highly data dense and often benefit from methods to reduce the number of spectral bands while retaining the most useful information for a specific application. Here, we present a novel band selection method to determine relevant wavelengths obtained from an HSI system in the context of image classification. Our approach consists of two main steps: the first, called Inter-Band Redundancy Analysis (IBRA), finds relevant spectral bands based on a collinearity analysis between a band and its neighbors in order to determine the distance we need to move away from it to find sufficiently distinct bands. The analysis of the distribution of this metric across the spectrum helps to remove redundant bands and dramatically reduces the search space. The second, called Greedy Spectral Selection (GSS) uses the reduced set of bands and selects the top-k bands, where k is the desired number of bands, according to their information entropy values; then, the band that presents the most severe indication of multicollinearity is removed from the current selection and the next available pre-selected band is considered if the classification performance improves. We present the classification results obtained from our method and compare them to other feature selection methods on two hyperspectral image datasets. Furthermore, we use the original hyperspectral data cube to simulate the process of using actual filters in a multispectral imager. We show that our method produces more suitable results for a multispectral sensor design.

Datasets

We used an in-greenhouse controlled HSI dataset of Kochia leaves in order to classify three different herbicide-resistance levels (herbicide-susceptible, dicamba-resistant, and glyphosate-resistant). A total of 76 images of kochia with varying spatial resolution and 300 spectral bands ranging from 387.12 to 1023.5 nm were captured. From these images, which were previously calibrated and converted to reflectance values, we manually extracted 6,316 25x25 pixel overlapping patches. The Kochia dataset can be downloaded from here.

Furthermore, we used two well-known remote sensing HSI dataset: Indian Pines (IP) and Salinas (SA), which can be downloaded from here or from the "Data" folder from this repository.

Usage

This repository contains the following scripts:

  • interBandRedundancy.py: Executes both the pre-selection and final selection method for a desired number of spectral bands.

  • TrainSelection.py: Class used to train and validate a classifier on the selected dataset type. The main parameters of this class are:

      *nbands: Desired number of bands.
      *method: Band selection method. Options: 'SSA', 'OCF', 'GA', 'PLS', 'FNGBS', 'FullSpec'.
      *classifier: Type of model used to train the classifiers. Options: 'CNN', 'SVM', 'RF'.
      *transform: Flag used to simulate Gaussian bandwidths.
      *average: Flag used to reduce the number of bands averaging consecutive bands.
      *batch_size: Size of the batch used for training.
      *epochs: Number of epochs used for training.
      *data: Name of the dataset. Options: 'Kochia', 'Avocado'
      *size: Percentage of the dataset used for the experiments.
      *selection: Load only the selected bands from the dataset
      *th: Optional index to add in the end of the generated files
      *median: If True, perform a median filtering on the spectral bands.
      *pca: If True, we use the IBRA method to form a set of candidate bands and then we reduce 
      the number of bands using PCA.
    
  • utils.py: Additional methods used to transform the data and calculate the metrics.

  • ClassificationStrategy/network.py: Contains all the network architectures used in this work.

  • ClassificationStrategy/ModelStrategy.py: Interface class used to train and validate a classifier on the selected dataset type. The parameters of the constructor of this class are:

      *classifier: Type of classifier. Options: 'CNN', 'ANN', 'SVM', or 'RF'.
      *data: Type of data. Options: 'Kochia', 'Avocado', 'IP', or 'SA'.
      *device: Type of device used for training (Used for the CNN).
      *nbands: Number of selected spectral ban.
      *windowSize: Window size (Used for the CNN).
      *train_y: Target data.
      *classes: Number of classes.
    

    The parameters of the trainFold method are:

      *trainx: Training set.
      *train_y: Target data of the entire dataset (training + validation sets).
      *train: List of training indexes
      *batch_size: Size of the mini-batch (Used for the CNN).
      *epochs: Number of epochs used to train a CNN.
      *valx: Validation set.
      *test: List of test indexes
      *means: Mean of each spectral band calculated in the training set.
      *stds: Standard deviation of each spectral band calculated in the training set.
      *filepath: Path used to store the trained model.
      *printProc: If True, prints all the training process
    

    The parameters of the evaluateFold method are:

      *valx: Validation set.
      *train_y: Target data.
      *test: List of test indexes
      *means: Mean of each spectral band calculated in the training set.
      *stds: Standard deviation of each spectral band calculated in the training set.
      *batch_size: Size of the mini-batch (Used for the CNN).
    

Results: Metrics Comparison

alt text

alt text

Citation

Use this Bibtex to cite this repository

@Article{rs13183649,
AUTHOR = {Morales, Giorgio and Sheppard, John W. and Logan, Riley D. and Shaw, Joseph A.},
TITLE = {Hyperspectral Dimensionality Reduction Based on Inter-Band Redundancy Analysis and Greedy Spectral Selection},
JOURNAL = {Remote Sensing},
VOLUME = {13},
YEAR = {2021},
NUMBER = {18},
ARTICLE-NUMBER = {3649},
URL = {https://www.mdpi.com/2072-4292/13/18/3649},
ISSN = {2072-4292},
DOI = {10.3390/rs13183649}
}
@misc{2106.00645,
Author = {Giorgio Morales and John Sheppard and Riley Logan and Joseph Shaw},
Title = {Hyperspectral Band Selection for Multispectral Image Classification with Convolutional Networks},
Year = {2021},
Eprint = {arXiv:2106.00645},
}

About

Developing Low-Cost Multispectral Imagers using Inter-Band Redundancy Analysis and Greedy Spectral Selection in Hyperspectral Imaging.


Languages

Language:Python 72.6%Language:Jupyter Notebook 27.4%