hh.sePublications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 139) Show all publications
Shahbazi, Z. & Nowaczyk, S. (2025). Towards personalized cardiometabolic risk prediction: A fusion of exposome and AI. Heliyon, 11(1), 1-13, Article ID e40859.
Open this publication in new window or tab >>Towards personalized cardiometabolic risk prediction: A fusion of exposome and AI
2025 (English)In: Heliyon, E-ISSN 2405-8440, Vol. 11, no 1, p. 1-13, article id e40859Article in journal (Refereed) Published
Abstract [en]

The influence of the exposome on major health conditions like cardiovascular disease (CVD) is widely recognized. However, integrating diverse exposome factors into predictive models for personalized health assessments remains a challenge due to the complexity and variability of environmental exposures and lifestyle factors. A machine learning (ML) model designed for predicting CVD risk is introduced in this study, relying on easily accessible exposome factors. This approach is particularly novel as it prioritizes non-clinical, modifiable exposures, making it applicable for broad public health screening and personalized risk assessments. Assessments were conducted using both internal and external validation groups from a multi-center cohort, comprising 3,237 individuals diagnosed with CVD in South Korea within twelve years of their baseline visit, along with an equal number of participants without these conditions as a control group. Examination of 109 exposome variables from participants' baseline visits spanned physical measures, environmental factors, lifestyle choices, mental health events, and early-life factors. For risk prediction, the Random Forest classifier was employed, with performance compared to an integrative ML model using clinical and physical variables. Furthermore, data preprocessing involved normalization and handling of missing values to enhance model accuracy. The model's decision-making process were using an advanced explainability method. Results indicated comparable performance between the exposome-based ML model and the integrative model, achieving AUC of 0.82(+/-)0.01, 0.70(+/-)0.01, and 0.73(+/-)0.01. The study underscores the potential of leveraging exposome data for early intervention strategies. Additionally, exposome factors significant in identifying CVD risk were pinpointed, including daytime naps, completed full-time education, past tobacco smoking, frequency of tiredness/unenthusiasm, and current work status. © 2024 The Author(s)

Place, publisher, year, edition, pages
London: Elsevier, 2025
Keywords
Artificial intelligence, Cardiovascular disease, Clinical records, Exposome, Machine learning
National Category
Health Care Service and Management, Health Policy and Services and Health Economy Artificial Intelligence
Identifiers
urn:nbn:se:hh:diva-55197 (URN)10.1016/j.heliyon.2024.e40859 (DOI)2-s2.0-85213285821 (Scopus ID)
Available from: 2025-01-23 Created: 2025-01-23 Last updated: 2025-01-23Bibliographically approved
Altarabichi, M. G., Nowaczyk, S., Pashami, S., Sheikholharam Mashhadi, P. & Handl, J. (2024). A Review of Randomness Techniques in Deep Neural Networks. In: GECCO ’24 Companion, July 14–18, 2024, Melbourne, VIC, Australia: . Paper presented at 2024 Genetic and Evolutionary Computation Conference Companion, GECCO 2024 Companion, Melbourne, VIC, Australia, 14-18 July, 2024 (pp. 23-24). New York, NY: Association for Computing Machinery (ACM)
Open this publication in new window or tab >>A Review of Randomness Techniques in Deep Neural Networks
Show others...
2024 (English)In: GECCO ’24 Companion, July 14–18, 2024, Melbourne, VIC, Australia, New York, NY: Association for Computing Machinery (ACM), 2024, p. 23-24Conference paper, Published paper (Refereed)
Abstract [en]

This paper investigates the effects of various randomization techniques on Deep Neural Networks (DNNs) learning performance. We categorize the existing randomness techniques into four key types: injection of noise/randomness at the data, model structure, optimization or learning stage. We use this classification to identify gaps in the current coverage of potential mechanisms for the introduction of randomness, leading to proposing two new techniques: adding noise to the loss function and random masking of the gradient updates. We use a Particle Swarm Optimizer (PSO) for hyperparameter optimization and evaluate over 30,000 configurations across standard computer vision benchmarks. Our study reveals that data augmentation and weight initialization randomness significantly improve performance, and different optimizers prefer distinct randomization types. The complete implementation and dataset are available on GitHub1. This paper for the Hot-off-the-Press track at GECCO 2024 summarizes the original work published at [2]. © 2024 Copyright held by the owner/author(s).

[2] Mohammed Ghaith Altarabichi, Sławomir Nowaczyk, Sepideh Pashami, Peyman Sheikholharam Mashhadi, and Julia Handl. 2024. Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks. Information Sciences 667 (2024), 120500.

Place, publisher, year, edition, pages
New York, NY: Association for Computing Machinery (ACM), 2024
Keywords
convolutional neural network, deep neural network, hyperparameter, particle swarm optimization, randomized neural networks
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54562 (URN)10.1145/3638530.3664077 (DOI)2-s2.0-85201929793 (Scopus ID)9798400704956 (ISBN)
Conference
2024 Genetic and Evolutionary Computation Conference Companion, GECCO 2024 Companion, Melbourne, VIC, Australia, 14-18 July, 2024
Available from: 2024-09-05 Created: 2024-09-05 Last updated: 2024-09-05Bibliographically approved
Jamshidi, P., Nowaczyk, S. & Rahat, M. (2024). Analysis of characteristic functions on Shapley values in Machine Learning. In: 2024 International Conference on Intelligent Environments (IE): . Paper presented at 20th International Conference on Intelligent Environments, IE 2024, Ljubljana, Slovenia, 17-20 June, 2024 (pp. 70-77). Piscataway, NJ: IEEE
Open this publication in new window or tab >>Analysis of characteristic functions on Shapley values in Machine Learning
2024 (English)In: 2024 International Conference on Intelligent Environments (IE), Piscataway, NJ: IEEE, 2024, p. 70-77Conference paper, Published paper (Refereed)
Abstract [en]

In the rapidly evolving field of AI, Explainable Artificial Intelligence (XAI) has become paramount, particularly in Intelligent Environments applications. It offers clarity and understanding in complex decision-making processes, fostering trust and enabling rigorous scrutiny. The Shapley value, renowned for its accurate quantification of feature importance, has emerged as a prevalent standard in both academic research and practical application. Nevertheless, the Shapley value's reliance on the calculation of all possible coalitions poses a significant computational challenge, as it falls within the class of NP-hard problems. Consequently, approximation techniques are employed in most practical scenarios as a substitute for precise computations. The most common of those is the SHAP (SHapley Additive exPlanations) technique, which quantifies the influence exerted by a specific feature on decision outcomes of a specific Machine Learning model. However, the Shapley value's theoretical underpinnings focus on assessing and understanding feature impact on model evaluation metrics, rather than just alterations in the responses. This paper conducts a comparative analysis using controlled synthetic data with established ground truths. It juxtaposes the practical implementation of the SHAP approach with the theoretical model in two distinct scenarios: one using the F1-score and the other, the accuracy metric. These are two representative characteristic functions, capturing different aspects and whose appropriateness depends on the specific requirements and context of the task to be solved. We analyze how the three alternatives exhibit similarity and disparity in their manifestation of feature effects. We explore the parallels and differences between these approaches in reflecting feature effects. Ultimately, our research seeks to determine the conditions under which SHAP outcomes are more aligned with either the F1-score or the accuracy metric, thereby providing valuable insights for their application in various Intelligent Environment contexts. © 2024 IEEE.

Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2024
Series
International Conference on Intelligent Environments, ISSN 2469-8792, E-ISSN 2472-7571
Keywords
accuracy, F1-score, imbalanced data, Shapley values, XAI
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54471 (URN)10.1109/IE61493.2024.10599897 (DOI)2-s2.0-85200723106 (Scopus ID)979-8-3503-8679-0 (ISBN)
Conference
20th International Conference on Intelligent Environments, IE 2024, Ljubljana, Slovenia, 17-20 June, 2024
Funder
Swedish Research Council, CHIST-ERA-19-XAI-012
Available from: 2024-08-21 Created: 2024-08-21 Last updated: 2024-12-04Bibliographically approved
Calikus, E., Nowaczyk, S. & Dikmen, O. (2024). Context Discovery for Anomaly Detection. International Journal of Data Science and Analytics
Open this publication in new window or tab >>Context Discovery for Anomaly Detection
2024 (English)In: International Journal of Data Science and Analytics, ISSN 2364-415XArticle in journal (Refereed) Epub ahead of print
Abstract [en]

Contextual anomaly detection aims to identify objects that are anomalous only within specific contexts, while appearing normal otherwise. However, most existing methods are limited to a single context defined by user-specified features. In practice, identifying the right context is not trivial, even for domain experts. Moreover, for high-dimensional data, the notion of meaningful contexts that can unveil anomalies becomes substantially more complex. For instance, multiple useful contexts can often capture different phenomena. In this work, we introduce ConQuest, a new unsupervised contextual anomaly detection approach that automatically discovers and incorporates multiple contexts useful for detecting and interpreting anomalies. Through experiments on 25 datasets, we show that ConQuest outperforms various state-of-the-art methods. We also demonstrate its benefits in terms of increased direct interpretability. © The Author(s) 2024.

Place, publisher, year, edition, pages
Heidelberg: Springer, 2024
Keywords
anomaly detection, contextual anomaly detection
National Category
Computer Sciences
Research subject
Smart Cities and Communities
Identifiers
urn:nbn:se:hh:diva-46402 (URN)10.1007/s41060-024-00586-x (DOI)001250244900003 ()2-s2.0-85196295899 (Scopus ID)
Funder
Knowledge Foundation, 20160103
Note

Som manuskript i avhandling / As manuscript in thesis.

Funding: Open access funding provided by Royal Institute of Technology.

Available from: 2022-02-22 Created: 2022-02-22 Last updated: 2025-01-17Bibliographically approved
Kanwal, S., Nowaczyk, S., Rahat, M., Lundström, J. & Khan, F. (2024). Deep Learning for Generating Synthetic Traffic Data. In: Xin-She Yang; Simon Sherratt; Nilanjan Dey; Amit Joshi (Ed.), Proceedings of Ninth International Congress on Information and Communication Technology: ICICT 2024, London, Volume 8. Paper presented at 9th International Congress on Information and Communication Technology, ICICT 2024, London, United Kingdom, 19-22 February, 2024 (pp. 431-454). Singapore: Springer, 1004 LNNS
Open this publication in new window or tab >>Deep Learning for Generating Synthetic Traffic Data
Show others...
2024 (English)In: Proceedings of Ninth International Congress on Information and Communication Technology: ICICT 2024, London, Volume 8 / [ed] Xin-She Yang; Simon Sherratt; Nilanjan Dey; Amit Joshi, Singapore: Springer, 2024, Vol. 1004 LNNS, p. 431-454Conference paper, Published paper (Refereed)
Abstract [en]

The purpose of the study is to demonstrate the feasibility of combining traffic simulator technology with machine learning (ML) methods to create realistic and comprehensive synthetic traffic data. Synthetic data alleviates many ethical and privacy concerns, significantly reduces the costs associated with data collection, and enables researchers to study scenarios and conditions that are difficult or impossible to replicate in real-world environments. Access to large amounts of diverse and controlled data is essential for developing and testing artificial intelligence (AI) models and leads to more reliable and robust results. Traffic simulators like SUMO have been successfully used for that purpose in the past, creating realistic vehicular traces. One drawback is that, without coupling them with complex physics emulators, they are not capable of generating internal vehicle parameters. Such parameters, on the other hand, are crucial for many purposes, from understanding energy efficiency and optimizing driver behavior to predictive maintenance and monitoring the degradation of key components, such as driveline batteries. In this paper, we propose Synthetic Traffic Data Generator (STDG) and demonstrate that an ML model that is trained on the internal parameters of a vehicle in one set of conditions (Sweden) can be used to generate synthetic data corresponding to another setting (Monaco). The proposed method promises to eliminate the need for an expensive collection of the original vehicle parameters across many different settings. Moreover, sharing the synthetic data with additional stakeholders is easier due to the reduced security and integrity risk of exposing the vehicle’s privacy-sensitive original parameters. This study compares several ML techniques, including deep learning (DL) based, for generating internal parameters of vehicles, such as fuel rate, engine speed, and wet tank air pressure. Using the actual bus data from a small city to train our ML models, we attempt to forecast the internal parameters of the buses in various scenarios. The proposed method first utilizes SUMO to generate synthetic waypoints for the bus and then predicts the other parameters using the trained model, thereby producing synthetic data with internal parameters for buses operating in a new urban environment. Our preliminary results indicated that our model is performing well within a 90% confidence interval. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

Place, publisher, year, edition, pages
Singapore: Springer, 2024
Series
Lecture Notes in Networks and Systems, ISSN 2367-3370, E-ISSN 2367-3389 ; 1004
Keywords
Deep learning, Machine learning, Synthetic data, Traffic simulation
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54494 (URN)10.1007/978-981-97-3305-7_36 (DOI)001327002400036 ()2-s2.0-85201095610 (Scopus ID)
Conference
9th International Congress on Information and Communication Technology, ICICT 2024, London, United Kingdom, 19-22 February, 2024
Funder
Knowledge FoundationVinnova
Available from: 2024-08-26 Created: 2024-08-26 Last updated: 2024-11-20Bibliographically approved
Jamshidi, P., Nowaczyk, S. & Rahat, M. (2024). EcoShap: Save Computations by only Calculating Shapley Values for Relevant Features. In: Nowaczyk, Sławomir et al. (Ed.), Artificial Intelligence. ECAI 2023 International Workshops: XAI^3, TACTIFUL, XI-ML, SEDAMI, RAAIT, AI4S, HYDRA, AI4AI, Kraków, Poland, September 30 – October 4, 2023, Proceedings, Part I. Paper presented at International Workshops of the 26th European Conference on Artificial Intelligence (ECAI 2023), Kraków, Poland, 30 September-4 October, 2023 (pp. 24-42). Cham: Springer, 1947
Open this publication in new window or tab >>EcoShap: Save Computations by only Calculating Shapley Values for Relevant Features
2024 (English)In: Artificial Intelligence. ECAI 2023 International Workshops: XAI^3, TACTIFUL, XI-ML, SEDAMI, RAAIT, AI4S, HYDRA, AI4AI, Kraków, Poland, September 30 – October 4, 2023, Proceedings, Part I / [ed] Nowaczyk, Sławomir et al., Cham: Springer, 2024, Vol. 1947, p. 24-42Conference paper, Published paper (Refereed)
Abstract [en]

One of the most widely adopted approaches for eXplainable Artificial Intelligence (XAI) involves employing of Shapley values (SVs) to determine the relative importance of input features. While based on a solid mathematical foundation derived from cooperative game theory, SVs have a significant drawback: high computational cost. Calculating the exact SV is an NP-hard problem, necessitating the use of approximations, particularly when dealing with more than twenty features. On the other hand, determining SVs for all features is seldom necessary in practice; users are primarily interested in the most important ones only. This paper introduces the Economic Hierarchical Shapley values (ecoShap) method for calculating SVs for the most crucial features only, with reduced computational cost. EcoShap iteratively expands disjoint groups of features in a tree-like manner, avoiding the expensive computations for the majority of less important features. Our experimental results across eight datasets demonstrate that the proposed technique efficiently identifies top features; at a 50% reduction in computational costs, it can determine between three and seven of the most important features. © The Author(s) 2024.

Place, publisher, year, edition, pages
Cham: Springer, 2024
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 1947
Keywords
Explainable Artificial Intelligence (XAI), Feature Importance, Shapley Value
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-52744 (URN)10.1007/978-3-031-50396-2_2 (DOI)2-s2.0-85184111581 (Scopus ID)978-3-031-50395-5 (ISBN)978-3-031-50396-2 (ISBN)
Conference
International Workshops of the 26th European Conference on Artificial Intelligence (ECAI 2023), Kraków, Poland, 30 September-4 October, 2023
Funder
Swedish Research Council, CHIST-ERA-19-XAI-012
Available from: 2024-02-23 Created: 2024-02-23 Last updated: 2024-12-04Bibliographically approved
Shahbazi, Z. & Nowaczyk, S. (2024). Effective Elements of Climate Change Videos on the YouTube Platform. In: Panagiotis Fotaris (Ed.), Proceedings of the 11th European Conference on Social Media ECSM 2024: . Paper presented at 11th European Conference on Social Media (ECSM 2024): University of Brighton, Brighton, United Kingdom, 30-31 May, 2024 (pp. 243-250). Reading: Academic Conferences and Publishing International Limited, 11(1)
Open this publication in new window or tab >>Effective Elements of Climate Change Videos on the YouTube Platform
2024 (English)In: Proceedings of the 11th European Conference on Social Media ECSM 2024 / [ed] Panagiotis Fotaris, Reading: Academic Conferences and Publishing International Limited, 2024, Vol. 11, no 1, p. 243-250Conference paper, Published paper (Refereed)
Abstract [en]

In an era where visual communication is important, understanding the key components that make climate change videos effective is essential for improving awareness and driving meaningful actions. This research presents an overview of YouTube’s educational content on climate change, aiming to identify elements that contribute to the effectiveness of these videos. We used a questionnaire targeting bachelor’s and master’s students to learn about their preferences regarding the available videos and their beliefs concerning the use of YouTube as an educational platform. A curated list of videos was used to explore how students perceive their influence on personal interest and engagement in climate change. Accordingly, each student watched three videos related to climate change and provided information concerning their impressions. By reviewing various attributes of the videos related to climate change, such as the content structure, engagement, and similar, we extracted the essential characteristics that are associated with more positive reactions to these videos as significant educational tools.

Place, publisher, year, edition, pages
Reading: Academic Conferences and Publishing International Limited, 2024
Series
Proceedings of the European Conference on Social Media, ISSN 2055-7213, E-ISSN 2055-7221
Keywords
YouTube Platform, Climate Changes, Content-based Analysis
National Category
Media and Communications
Identifiers
urn:nbn:se:hh:diva-55260 (URN)10.34190/ecsm.11.1.2134 (DOI)978-1-917204-01-9 (ISBN)978-1-917204-00-2 (ISBN)
Conference
11th European Conference on Social Media (ECSM 2024): University of Brighton, Brighton, United Kingdom, 30-31 May, 2024
Available from: 2025-01-17 Created: 2025-01-17 Last updated: 2025-01-17Bibliographically approved
Shahbazi, Z., Shahbazi, Z. & Nowaczyk, S. (2024). Enhancing Air Quality Forecasting Using Machine Learning Techniques. IEEE Access, 12, 197290-197299
Open this publication in new window or tab >>Enhancing Air Quality Forecasting Using Machine Learning Techniques
2024 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 12, p. 197290-197299Article in journal (Refereed) Published
Abstract [en]

Urbanization is rapidly shaping our world, with more people than ever residing in cities. While cities offer numerous opportunities and conveniences, they also face critical challenges, including air pollution. Addressing these challenges is vital for creating healthier and more liveable urban environments. A transformative solution emerges, bridging the gap between sustainable urban mobility and air quality control through cutting-edge data-driven strategies. Finding a balance between efficient urban living and environmental stewardship is a pressing concern for cities worldwide. In envisioning a future where urban commuting becomes synonymous with eco-friendliness and air quality improvement, a comprehensive platform harnesses the power of data analytics and real-time information to empower commuters and city planners alike. Its intelligent algorithms continuously analyse air quality information, allowing it to predict and address poor air quality. This platform seamlessly integrates with existing urban infrastructure, making it accessible to commuters through user-friendly mobile applications and web interfaces. Commuters can receive personalized recommendations for eco-friendly commuting options. One standout feature is its ability to forecast air quality in urban areas, enabling users to make informed decisions that prioritize their health and environmental sustainability. Encouraging a sense of community among eco-conscious urban residents, it incentivizes sustainable behaviours and offers rewards for reducing emissions. By collecting data on commuting choices and air quality conditions, the platform contributes valuable insights to city authorities for urban planning and pollution control. Representing a paradigm shift in urban living, it aligns individual choices with broader sustainability and air quality goals. It is a testament to the power of technology, data, and community engagement in building smarter, greener, and healthier cities. The proposed approach presents the significance of the system as a transformative solution for sustainable urban living and air quality control, emphasizing the use of cutting-edge technology, data-driven insights, and community engagement to address pressing urban challenges. © 2024 The Authors.

Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2024
Keywords
Air Quality, Artificial Intelligence, Machine Learning, Smart City, Sustainability
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:hh:diva-55179 (URN)10.1109/ACCESS.2024.3516883 (DOI)2-s2.0-85212569013 (Scopus ID)
Available from: 2025-01-08 Created: 2025-01-08 Last updated: 2025-01-08Bibliographically approved
Fan, Y., Nowaczyk, S., Wang, Z. & Pashami, S. (2024). Evaluating Multi-task Curriculum Learning for Forecasting Energy Consumption in Electric Heavy-duty Vehicles. In: Nowaczyk S.; Spiliopoulou M.; Ragni M.; Fink O. (Ed.), Proceedings of Workshop on Embracing Human-Aware AI in Industry 5.0 (HAII5.0 2024): . Paper presented at 2024 Workshop on Embracing Human-Aware AI in Industry 5.0, HAII5.0 2024, Santiago de Compostela, Spain, 19 October, 2024. Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen, 3765
Open this publication in new window or tab >>Evaluating Multi-task Curriculum Learning for Forecasting Energy Consumption in Electric Heavy-duty Vehicles
2024 (English)In: Proceedings of Workshop on Embracing Human-Aware AI in Industry 5.0 (HAII5.0 2024) / [ed] Nowaczyk S.; Spiliopoulou M.; Ragni M.; Fink O., Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen , 2024, Vol. 3765Conference paper, Published paper (Refereed)
Abstract [en]

Accurate energy consumption prediction is crucial for optimising the operation of electric commercial heavy-duty vehicles, particularly for efficient route planning, refining charging strategies, and ensuring optimal truck configuration for specific tasks. This study investigates the application of multi-task curriculum learning to enhance machine learning models for forecasting the energy consumption of various onboard systems in electric vehicles. Multi-task learning, unlike traditional training approaches, leverages auxiliary tasks to provide additional training signals, which has been shown to enhance predictive performance in many domains. By further incorporating curriculum learning, where simpler tasks are learned before progressing to more complex ones, neural network training becomes more efficient and effective. We evaluate the suitability of these methodologies in the context of electric vehicle energy forecasting, examining whether the combination of multi-task learning and curriculum learning enhances algorithm generalisation, even with limited training data. We primarily focus on understanding the efficacy of different curriculum learning strategies, including sequential learning and progressive continual learning, using complex, real-world industrial data. Our research further explores a set of auxiliary tasks designed to facilitate the learning process by targeting key consumption characteristics projected into future time frames. The findings illustrate the potential of multi-task curriculum learning to advance energy consumption forecasting, significantly contributing to the optimisation of electric heavy-duty vehicle operations. This work offers a novel perspective on integrating advanced machine learning techniques to enhance energy efficiency in the exciting field of electromobility. © 2024 Copyright for this paper by its authors.

Place, publisher, year, edition, pages
Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen, 2024
Series
CEUR Workshop Proceedings, ISSN 1613-0073 ; 3765
Keywords
Curriculum Learning, Electric Vehicles, Energy Consumption Forecasting, Multi-task Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54807 (URN)2-s2.0-85206261149 (Scopus ID)
Conference
2024 Workshop on Embracing Human-Aware AI in Industry 5.0, HAII5.0 2024, Santiago de Compostela, Spain, 19 October, 2024
Note

12 sidor

Available from: 2024-11-06 Created: 2024-11-06 Last updated: 2024-11-06Bibliographically approved
Jamshidi, P., Nowaczyk, S., Rahat, M. & Taghiyarrenani, Z. (2024). Explainable Federated Learning by Incremental Decision Trees. In: Zahraa Abdallah; Fabian Fumagalli; Barbara Hammer; Eyke Hüllermeier; Matthias Jakobs; Emmanuel Müller; Maximilian Muschalik; Panagiotis Papapetrou; Amal Saadallah; George Tzagkarakis (Ed.), Explainable AI for Time Series and Data Streams 2024: Proceedings of the Workshop on Explainable AI for Time Series and Data Streams. Paper presented at 2024 Workshop on Explainable AI for Time Series and Data Streams, TempXAI 2024, Vilnius, Lithuania, 9 September, 2024 (pp. 58-69). Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen, 3761
Open this publication in new window or tab >>Explainable Federated Learning by Incremental Decision Trees
2024 (English)In: Explainable AI for Time Series and Data Streams 2024: Proceedings of the Workshop on Explainable AI for Time Series and Data Streams / [ed] Zahraa Abdallah; Fabian Fumagalli; Barbara Hammer; Eyke Hüllermeier; Matthias Jakobs; Emmanuel Müller; Maximilian Muschalik; Panagiotis Papapetrou; Amal Saadallah; George Tzagkarakis, Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen , 2024, Vol. 3761, p. 58-69Conference paper, Published paper (Refereed)
Abstract [en]

Explainable Artificial Intelligence (XAI) is crucial in ensuring transparency, accountability, and trust in machine learning models, especially in applications involving high-stakes decision-making. This paper focuses on addressing the research gap in federated learning (FL), specifically emphasizing the use of inherently interpretable underlying models. While most FL frameworks rely on complex, black-box models such as Artificial Neural Networks (ANNs), we propose using Decision Tree (DT) classifiers to maintain explainability. More specifically, we introduce a novel framework for horizontal federated learning using Extremely Fast Decision Trees (EFDTs) with streaming data on the client side. Our approach involves aggregating clients' EFDTs on the server side without centralizing raw data, and the training process occurs on the clients' sides. We outline three aggregation strategies and demonstrate that our methods outperform local models and achieve performance levels close to centralized models while retaining inherent explainability. © 2024 CEUR-WS. All rights reserved.

Place, publisher, year, edition, pages
Aachen: Rheinisch-Westfaelische Technische Hochschule Aachen, 2024
Series
CEUR Workshop Proceedings, ISSN 1613-0073 ; 3761
Keywords
Data Stream, eXplainable AI (XAI), Extremely Fast Decision Tree, Federated Learning, Incremental Decision Tree
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-54760 (URN)2-s2.0-85204974127 (Scopus ID)
Conference
2024 Workshop on Explainable AI for Time Series and Data Streams, TempXAI 2024, Vilnius, Lithuania, 9 September, 2024
Available from: 2024-10-29 Created: 2024-10-29 Last updated: 2024-12-04Bibliographically approved
Projects
iMedA: Improving MEDication Adherence through Person Centered Care and Adaptive Interventions [2017-04617_Vinnova]; Halmstad University; Publications
Galozy, A. (2021). Data-driven personalized healthcare: Towards personalized interventions via reinforcement learning for Mobile Health. (Licentiate dissertation). Halmstad: Halmstad University PressGalozy, A., Nowaczyk, S., Pinheiro Sant'Anna, A., Ohlsson, M. & Lingman, M. (2020). Pitfalls of medication adherence approximation through EHR and pharmacy records: Definitions, data and computation. International Journal of Medical Informatics, 136, Article ID 104092. Galozy, A. & Nowaczyk, S. (2020). Prediction and pattern analysis of medication refill adherence through electronic health records and dispensation data. Journal of Biomedical Informatics: X, 6-7, Article ID 100075. Galozy, A., Nowaczyk, S. & Ohlsson, M.Corrupted Contextual Bandits with Action Order Constraints.
eXplainable Predictive Maintenance [2020-00767_VR]; Halmstad University; Publications
Amirhossein, B., Taghiyarrenani, Z. & Nowaczyk, S. (2023). curr2vib: Modality Embedding Translation for Broken-Rotor Bar Detection. In: Irena Koprinska et al. (Ed.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part II. Paper presented at ECML PKDD: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Grenoble, France, September 19–23, 2022 (pp. 423-437). Cham: Springer Nature, 1753Berenji, A., Nowaczyk, S. & Taghiyarrenani, Z. (2023). Data-Centric Perspective on Explainability Versus Performance Trade-Off. In: Bruno Crémilleux, Sibylle Hess, Siegfried Nijssen (Ed.), Advances in Intelligent Data Analysis XXI: 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023, Proceedings. Paper presented at 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023 (pp. 42-54). Cham: Springer, 13876Alabdallah, A., Pashami, S., Rögnvaldsson, T. & Ohlsson, M. (2022). SurvSHAP: A Proxy-Based Algorithm for Explaining Survival Models with SHAP. In: Joshua Zhexue Huang; Yi Pan; Barbara Hammer; Muhammad Khurram Khan; Xing Xie; Laizhong Cui; Yulin He (Ed.), 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA): . Paper presented at The 9th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2022), Shenzhen, China, October 13-16, 2022. Piscataway, NJ: IEEE
Automatic Idea Detection: Implementing artificial intelligence in medical technology innovation (AID); Halmstad UniversityFrom Connected to Sustainable Mobility (FREEDOM) [2021-02548_Vinnova]; Halmstad UniversityAI-driven Automotive Service Market: Towards more Resource-Efficient and Sustainable Vehicle Maintenance [2023-02594_Vinnova]; Halmstad University
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-7796-5201

Search in DiVA

Show all publications