Fast Genetic Algorithm For Feature Selection — A Qualitative Approximation Approach
2023 (English)In: Evolutionary Computation Conference Companion (GECCO ’23 Companion), July 15–19, 2023, Lisbon, Portugal, New York, NY: Association for Computing Machinery (ACM), 2023, p. 11-12Conference paper, Published paper (Refereed)
Abstract [en]
We propose a two-stage surrogate-assisted evolutionary approach to address the computational issues arising from using Genetic Algorithm (GA) for feature selection in a wrapper setting for large datasets. The proposed approach involves constructing a lightweight qualitative meta-model by sub-sampling data instances and then using this meta-model to carry out the feature selection task. We define "Approximation Usefulness" to capture the necessary conditions that allow the meta-model to lead the evolutionary computations to the correct maximum of the fitness function. Based on our procedure we create CHCQX a Qualitative approXimations variant of the GA-based algorithm CHC (Cross generational elitist selection, Heterogeneous recombination and Cataclysmic mutation). We show that CHCQX converges faster to feature subset solutions of significantly higher accuracy, particularly for large datasets with over 100K instances. We also demonstrate the applicability of our approach to Swarm Intelligence (SI), with results of PSOQX, a qualitative approximation adaptation of the Particle Swarm Optimization (PSO) method. A GitHub repository with the complete implementation is available2. This paper for the Hot-off-the-Press track at GECCO 2023 summarizes the original work published at [3].
References
[1] Mohammed Ghaith Altarabichi, Yuantao Fan, Sepideh Pashami, Peyman Sheikholharam Mashhadi, and Sławomir Nowaczyk. 2021. Extracting invariant features for predicting state of health of batteries in hybrid energy buses. In 2021 ieee 8th international conference on data science and advanced analytics (dsaa). IEEE, 1–6.
[2] Mohammed Ghaith Altarabichi, Sławomir Nowaczyk, Sepideh Pashami, and Peyman Sheikholharam Mashhadi. 2021. Surrogate-assisted genetic algorithm for wrapper feature selection. In 2021 IEEE Congress on Evolutionary Computation (CEC). IEEE, 776–785.
[3] Mohammed Ghaith Altarabichi, Sławomir Nowaczyk, Sepideh Pashami, and Peyman Sheikholharam Mashhadi. 2023. Fast Genetic Algorithm for feature selection—A qualitative approximation approach. Expert systems with applications 211 (2023), 118528.
© 2023 Copyright held by the owner/author(s).
Place, publisher, year, edition, pages
New York, NY: Association for Computing Machinery (ACM), 2023. p. 11-12
Keywords [en]
Evolutionary computation, Feature selection, Fitness approximation, Genetic Algorithm, Particle Swarm Intelligence
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-51635DOI: 10.1145/3583133.3595823Scopus ID: 2-s2.0-85168991798ISBN: 9798400701207 (electronic)OAI: oai:DiVA.org:hh-51635DiVA, id: diva2:1798610
Conference
2023 Genetic and Evolutionary Computation Conference Companion, GECCO 2023 Companion, 15-19 July, 2023
2023-09-192023-09-192023-09-19Bibliographically approved