sbcblab / GenExpFS

Evaluation and comparison of Feature Selection algorithms over gene expression microarray datasets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Analysis and Comparison of Feature Selection Methods Towards Performance and Stability

This project aims to evaluate and compare Feature Selection algorithms.

The amount of gathered data is increasing at unprecedented rates for machine learning applications such as natural language processing, computer vision, and bioinformatics. This increase implies a higher number of samples and features; thus, some problems regarding highly dimensioned data arise. The curse of dimensionality, small samples, noisy or redundant features, and biased data are among them. Feature selection is fundamental to dealing with such problems. It reduces the data dimensionality by selecting the most relevant and less redundant features. Thus, reducing the computational cost, improving accuracy, and enhancing the data’s interpretability to machine learning models and domain experts. However, there are several selectors options from which to choose. This work compares some of the most representative algorithms from different feature selection groups regarding a broad range of measures, several datasets, and different strategies from diverse perspectives. We employ metrics to appraise selection accuracy, selection redundancy, prediction performance, algorithmic stability, selection reliability, and computational time of several feature selection algorithms. The results highlight the strengths and weaknesses of these algorithms and can guide their application.

How to cite

If you use our code, methods, or results in your research, please consider citing the main publication:

  • Matheus Cezimbra Barbieri, Bruno Iochins Grisci, Marcio Dorn. Analysis and Comparison of Feature Selection Methods Towards Performance and Stability, Expert Systems with Applications, 123667, March 2024, DOI: https://doi.org/10.1016/j.eswa.2024.123667

Bibtex entry:

@article{barbieri2024analysis,
  title={Analysis and comparison of feature selection methods towards performance and stability},
  author={Barbieri, Matheus Cezimbra and Grisci, Bruno Iochins and Dorn, M{\'a}rcio},
  journal={Expert Systems with Applications},
  pages={123667},
  year={2024},
  publisher={Elsevier}
}

Contact information

About

Evaluation and comparison of Feature Selection algorithms over gene expression microarray datasets


Languages

Language:Python 99.4%Language:Makefile 0.6%