Unsupervised feature selection based on variance–covariance subspace distanceShow others and affiliations
2023 (English)In: Neural Networks, ISSN 0893-6080, E-ISSN 1879-2782, Vol. 166, p. 188-203Article in journal (Refereed) Published
Abstract [en]
Subspace distance is an invaluable tool exploited in a wide range of feature selection methods. The power of subspace distance is that it can identify a representative subspace, including a group of features that can efficiently approximate the space of original features. On the other hand, employing intrinsic statistical information of data can play a significant role in a feature selection process. Nevertheless, most of the existing feature selection methods founded on the subspace distance are limited in properly fulfilling this objective. To pursue this void, we propose a framework that takes a subspace distance into account which is called “Variance–Covariance subspace distance”. The approach gains advantages from the correlation of information included in the features of data, thus determines all the feature subsets whose corresponding Variance–Covariance matrix has the minimum norm property. Consequently, a novel, yet efficient unsupervised feature selection framework is introduced based on the Variance–Covariance distance to handle both the dimensionality reduction and subspace learning tasks. The proposed framework has the ability to exclude those features that have the least variance from the original feature set. Moreover, an efficient update algorithm is provided along with its associated convergence analysis to solve the optimization side of the proposed approach. An extensive number of experiments on nine benchmark datasets are also conducted to assess the performance of our method from which the results demonstrate its superiority over a variety of state-of-the-art unsupervised feature selection methods. The source code is available at https://github.com/SaeedKarami/VCSDFS. © 2023 The Author(s)
Place, publisher, year, edition, pages
Oxford: Elsevier, 2023. Vol. 166, p. 188-203
Keywords [en]
Feature selection, Regularization, Subspace distance, Subspace learning
National Category
Computer Vision and Robotics (Autonomous Systems)
Identifiers
URN: urn:nbn:se:hh:diva-51410DOI: 10.1016/j.neunet.2023.06.018ISI: 001047817500001Scopus ID: 2-s2.0-85165713579OAI: oai:DiVA.org:hh-51410DiVA, id: diva2:1788076
Projects
CALLISTOE-VitaScaDS.AI
Funder
EU, Horizon 2020, 101016775
Note
Funding: We acknowledge the support of following EU projects: CALLISTO (101004152), E-Vita (101016453), and the ScaDS.AI (Center for Scalable Data Analytics and Artificial Intelligence) Dresden/Leipzig (IS18026A-F). This work was also supported by the Research Council of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI, and grants 336033, 352986, 358246) and EU (H2020 grant 101016775 and NextGenerationEU).
2023-08-152023-08-152023-10-05Bibliographically approved