hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Surrogate-Assisted Genetic Algorithm for Wrapper Feature Selection
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-6040-2269
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-7796-5201
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0003-3272-4145
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research.ORCID iD: 0000-0002-0051-0954
2021 (English)In: 2021 IEEE Congress on Evolutionary Computation (CEC), IEEE, 2021, p. 776-785Conference paper, Published paper (Refereed)
Abstract [en]

Feature selection is an intractable problem, therefore practical algorithms often trade off the solution accuracy against the computation time. In this paper, we propose a novel multi-stage feature selection framework utilizing multiple levels of approximations, or surrogates. Such a framework allows for using wrapper approaches in a much more computationally efficient way, significantly increasing the quality of feature selection solutions achievable, especially on large datasets. We design and evaluate a Surrogate-Assisted Genetic Algorithm (SAGA) which utilizes this concept to guide the evolutionary search during the early phase of exploration. SAGA only switches to evaluating the original function at the final exploitation phase.

We prove that the run-time upper bound of SAGA surrogate-assisted stage is at worse equal to the wrapper GA, and it scales better for induction algorithms of high order of complexity in number of instances. We demonstrate, using 14 datasets from the UCI ML repository, that in practice SAGA significantly reduces the computation time compared to a baseline wrapper Genetic Algorithm (GA), while converging to solutions of significantly higher accuracy. Our experiments show that SAGA can arrive at near-optimal solutions three times faster than a wrapper GA, on average. We also showcase the importance of evolution control approach designed to prevent surrogates from misleading the evolutionary search towards false optima.

Place, publisher, year, edition, pages
IEEE, 2021. p. 776-785
Keywords [en]
Feature selection, Wrapper, Genetic Algorithm, Progressive Sampling, Surrogates, Meta-models, Evolution Control, Optimization
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-45893DOI: 10.1109/CEC45853.2021.9504718ISI: 000703866100098Scopus ID: 2-s2.0-85122940013ISBN: 978-1-7281-8393-0 (electronic)OAI: oai:DiVA.org:hh-45893DiVA, id: diva2:1612170
Conference
2021 IEEE Congress on Evolutionary Computation (CEC), Kraków, Poland, 28 June - 1 July, 2021
Projects
EVE – Extending Life of Vehicles within Electromobility Era
Note

Som manuscript i avhandling/As manuscript in thesis

Available from: 2021-11-17 Created: 2021-11-17 Last updated: 2024-01-24Bibliographically approved
In thesis
1. Evolving intelligence: Overcoming challenges for Evolutionary Deep Learning
Open this publication in new window or tab >>Evolving intelligence: Overcoming challenges for Evolutionary Deep Learning
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep Learning (DL) has achieved remarkable results in both academic and industrial fields over the last few years. However, DL models are often hard to design and require proper selection of features and tuning of hyper-parameters to achieve high performance. These selections are tedious for human experts and require substantial time and resources. A difficulty that encouraged a growing number of researchers to use Evolutionary Computation (EC) algorithms to optimize Deep Neural Networks (DNN); a research branch called Evolutionary Deep Learning (EDL).

This thesis is a two-fold exploration within the domains of EDL, and more broadly Evolutionary Machine Learning (EML). The first goal is to makeEDL/EML algorithms more practical by reducing the high computational costassociated with EC methods. In particular, we have proposed methods to alleviate the computation burden using approximate models. We show that surrogate-models can speed up EC methods by three times without compromising the quality of the final solutions. Our surrogate-assisted approach allows EC methods to scale better for both, expensive learning algorithms and large datasets with over 100K instances. Our second objective is to leverage EC methods for advancing our understanding of Deep Neural Network (DNN) design. We identify a knowledge gap in DL algorithms and introduce an EC algorithm precisely designed to optimize this uncharted aspect of DL design. Our analytical focus revolves around revealing avant-garde concepts and acquiring novel insights. In our study of randomness techniques in DNN, we offer insights into the design and training of more robust and generalizable neural networks. We also propose, in another study, a novel survival regression loss function discovered based on evolutionary search.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2024. p. 32
Series
Halmstad University Dissertations ; 109
Keywords
neural networks, evolutionary deep learning, evolutionary machine learning, feature selection, hyperparameter optimization, evolutionary computation, particle swarm optimization, genetic algorithm
National Category
Computer Systems Signal Processing
Identifiers
urn:nbn:se:hh:diva-52469 (URN)978-91-89587-31-1 (ISBN)978-91-89587-32-8 (ISBN)
Public defence
2024-02-16, Wigforss, Kristian IV:s väg 3, Halmstad, 08:00 (English)
Opponent
Supervisors
Available from: 2024-01-24 Created: 2024-01-24 Last updated: 2024-03-07

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Altarabichi, Mohammed GhaithNowaczyk, SławomirPashami, SepidehSheikholharam Mashhadi, Peyman

Search in DiVA

By author/editor
Altarabichi, Mohammed GhaithNowaczyk, SławomirPashami, SepidehSheikholharam Mashhadi, Peyman
By organisation
CAISR - Center for Applied Intelligent Systems Research
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 386 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf