hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
SurvSHAP: A Proxy-Based Algorithm for Explaining Survival Models with SHAP
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).ORCID iD: 0000-0001-9416-5647
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR). RISE Research Institutes of Sweden.ORCID iD: 0000-0003-3272-4145
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR).ORCID iD: 0000-0001-5163-2997
Halmstad University, School of Information Technology, Center for Applied Intelligent Systems Research (CAISR). Lund University, Lund, Sweden.ORCID iD: 0000-0003-1145-4297
2022 (English)In: 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA) / [ed] Joshua Zhexue Huang; Yi Pan; Barbara Hammer; Muhammad Khurram Khan; Xing Xie; Laizhong Cui; Yulin He, Piscataway, NJ: IEEE, 2022Conference paper, Published paper (Refereed)
Abstract [en]

Survival Analysis models usually output functions (survival or hazard functions) rather than point predictions like regression and classification models. This makes the explanations of such models a challenging task, especially using the Shapley values. We propose SurvSHAP, a new model-agnostic algorithm to explain survival models that predict survival curves. The algorithm is based on discovering patterns in the predicted survival curves, the output of the survival model, that would identify significantly different survival behaviors, and utilizing a proxy model and SHAP method to explain these distinct survival behaviors. Experiments on synthetic and real datasets demonstrate that the SurvSHAP is able to capture the underlying factors of the survival patterns. Moreover, SurvSHAP results on the Cox Proportional Hazard model are compared with the weights of the model to show that we provide faithful overall explanations, with more fine-grained explanations of the sub-populations. We also illustrate the wrong model and explanations learned by a Cox model when applied to heterogeneous sub-populations. We show that a non-linear machine learning survival model with SurvSHAP can better model the data and provide better explanations than linear models.

Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2022.
Keywords [en]
SurvSHAP, Explainable AI, Survival Patterns, SHAP, Shapley values, Proxy Model, Survival Analysis, Machine Learning
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-49149DOI: 10.1109/DSAA54385.2022.10032392ISI: 000967751000099Scopus ID: 2-s2.0-85148538187ISBN: 978-1-6654-7330-9 (electronic)ISBN: 978-1-6654-7331-6 (print)OAI: oai:DiVA.org:hh-49149DiVA, id: diva2:1736098
Conference
The 9th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2022), Shenzhen, China, October 13-16, 2022
Part of project
eXplainable Predictive Maintenance, Swedish Research Council
Funder
Knowledge Foundation
Note

Funding: This research was funded by the CHIST-ERA grant CHIST-ERA-19-XAI-012 and CAISR+ project funded by the Swedish Knowledge Foundation.

Som manuscript i avhandling/As manuscript in thesis.

Available from: 2023-02-10 Created: 2023-02-10 Last updated: 2025-01-09Bibliographically approved
In thesis
1. Machine Learning Survival Models: Performance and Explainability
Open this publication in new window or tab >>Machine Learning Survival Models: Performance and Explainability
2023 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Survival analysis is an essential statistics and machine learning field in various critical applications like medical research and predictive maintenance. In these domains understanding models' predictions is paramount. While machine learning techniques are increasingly applied to enhance the predictive performance of survival models, they simultaneously sacrifice transparency and explainability. 

Survival models, in contrast to regular machine learning models, predict functions rather than point estimates like regression and classification models. This creates a challenge regarding explaining such models using the known off-the-shelf machine learning explanation techniques, like Shapley Values, Counterfactual examples, and others.   

Censoring is also a major issue in survival analysis where the target time variable is not fully observed for all subjects. Moreover, in predictive maintenance settings, recorded events do not always map to actual failures, where some components could be replaced because it is considered faulty or about to fail in the future based on an expert's opinion. Censoring and noisy labels create problems in terms of modeling and evaluation that require to be addressed during the development and evaluation of the survival models.

Considering the challenges in survival modeling and the differences from regular machine learning models, this thesis aims to bridge this gap by facilitating the use of machine learning explanation methods to produce plausible and actionable explanations for survival models. It also aims to enhance survival modeling and evaluation revealing a better insight into the differences among the compared survival models.

In this thesis, we propose two methods for explaining survival models which rely on discovering survival patterns in the model's predictions that group the studied subjects into significantly different survival groups. Each pattern reflects a specific survival behavior common to all the subjects in their respective group. We utilize these patterns to explain the predictions of the studied model in two ways. In the first, we employ a classification proxy model that can capture the relationship between the descriptive features of subjects and the learned survival patterns. Explaining such a proxy model using Shapley Values provides insights into the feature attribution of belonging to a specific survival pattern. In the second method, we addressed the "what if?" question by generating plausible and actionable counterfactual examples that would change the predicted pattern of the studied subject. Such counterfactual examples provide insights into actionable changes required to enhance the survivability of subjects.

We also propose a variational-inference-based generative model for estimating the time-to-event distribution. The model relies on a regression-based loss function with the ability to handle censored cases. It also relies on sampling for estimating the conditional probability of event times. Moreover, we propose a decomposition of the C-index into a weighted harmonic average of two quantities, the concordance among the observed events and the concordance between observed and censored cases. These two quantities, weighted by a factor representing the balance between the two, can reveal differences between survival models previously unseen using only the total Concordance index. This can give insight into the performances of different models and their relation to the characteristics of the studied data.

Finally, as part of enhancing survival modeling, we propose an algorithm that can correct erroneous event labels in predictive maintenance time-to-event data. we adopt an expectation-maximization-like approach utilizing a genetic algorithm to find better labels that would maximize the survival model's performance. Over iteration, the algorithm builds confidence about events' assignments which improves the search in the following iterations until convergence.

We performed experiments on real and synthetic data showing that our proposed methods enhance the performance in survival modeling and can reveal the underlying factors contributing to the explainability of survival models' behavior and performance.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2023. p. 25
Series
Halmstad University Dissertations ; 108
Keywords
Survival Analysis, Explainable Artificial Intelligence, Survival Patterns, Counterfactual Explanations, Evaluation Metrics, Concordance Index
National Category
Signal Processing
Identifiers
urn:nbn:se:hh:diva-52269 (URN)978-91-89587-30-4 (ISBN)978-91-89587-29-8 (ISBN)
Presentation
2024-01-18, Wigforss, Hus J, Kristan IV:s väg 3, Halmstad, 09:00 (English)
Opponent
Supervisors
Available from: 2023-12-19 Created: 2023-12-18 Last updated: 2024-02-01Bibliographically approved
2. Towards Trustworthy Survival Analysis with Machine Learning Models
Open this publication in new window or tab >>Towards Trustworthy Survival Analysis with Machine Learning Models
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Survival Analysis is a major sub-field of statistics that studies the time to an event, like a patient's death or a machine's failure. This makes survival analysis crucial in critical applications like medical studies and predictive maintenance. In such applications, safety is critical creating a demand for trustworthy models. Machine learning and deep learning techniques started to be used, spurred by the growing volume of collected data. While this direction holds promise for improving certain qualities, such as model performance, it also introduces new challenges in other areas, particularly model explainability. This challenge is general in machine learning due to the black-box nature of most machine learning models, especially deep neural networks (DNN). However, survival models usually output functions rather than point estimates like regression and classification models which makes their explainability even more challenging task. 

Other challenges also exist due to the nature of time-to-event data, such as censoring. This phenomenon happens due to several reasons, most commonly due to the limited study time, resulting in a considerable number of studied subjects not experiencing the event during the study. Moreover, in industrial settings, recorded events do not always correspond to actual failures. This is because companies tend to replace machine parts before their failure due to safety or cost considerations resulting in noisy event labels. Censoring and noisy labels create a challenge in building and evaluating survival models.    

This thesis addresses these challenges by following two tracks, one focusing on explainability and the other on improving performance. The two tracks eventually merge providing an explainable survival model while maintaining the performance of its black-box counterpart.

In the explainability track, we propose two post-hoc explanation methods based on what we define as Survival Patterns. These are patterns in the predictions of the survival model that represent distinct survival behaviors in the studied population. We propose an algorithm for discovering the survival patterns upon which the two post-hoc explanation methods rely. The first method, SurvSHAP, utilizes a proxy classification model that learns the relationship between the input space and the discovered survival patterns. The proxy model is then explained using the SHAP method resulting in per-pattern explanations. The second post-hoc method relies on finding counterfactual explanations that would change the decision of the survival model from one source survival pattern to another. The algorithm uses Particle Swarm Optimization (PSO) with a tailored objective function to guarantee certain explanation qualities in plausibility and actionability.

On the performance track, we propose a Variational Encoder-Decoder model for estimating the survival function using a sampling-based approach. The model is trained using a regression-based objective function that accounts for censored instances assisted with a differentiable lower bound of the concordance index (C-index). In the same work, we propose a decomposition of the C-index where we found out that it can be expressed as a weighted harmonic average of two quantities; one quantifies the concordance among the observed event cases and the other quantifies the concordance between observed events and censored cases. The two quantities are weighted by a factor that balances the contribution of event and censored cases to the total C-index. Such decomposition uncovers hidden differences among survival models that seem equivalent based on the C-index. We also used genetic programming to search for a regression-based loss function for survival analysis with an improved concordance ability. The search results uncovered an interesting phenomenon, upon which we propose the use of the continuously differentiable Softplus function instead of the sharp-cut Relu function for handling censored cases. Lastly in the performance track, we propose an algorithm for correcting erroneous observed event labels that can be caused by preventive maintenance activities. The algorithm adopts an iterative expectation-maximization-like approach utilizing a genetic algorithm to search for better event labels that can maximize a surrogate survival model's performance.

Finally, the two tracks merge and we propose CoxSE a Cox-based deep neural network model that provides inherent explanations while maintaining the performance of its black-box counterpart. The model relies on the Self-Explaining Neural Networks (SENN) and the Cox Proportional Hazard formulation. We also propose CoxSENAM, an enhancement to the Neural Additive Model (NAM) by adopting the NAM structure along with the SENN loss function and type of output. The CoxSENAM model demonstrated better explanations than the NAM-based model with enhanced robustness to noise.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2025. p. 29
Series
Halmstad University Dissertations ; 128
National Category
Computer Sciences Information Systems
Identifiers
urn:nbn:se:hh:diva-55202 (URN)978-91-89587-72-4 (ISBN)978-91-89587-73-1 (ISBN)
Public defence
2025-01-31, S3030, Högskolan i Halmstad, Kristian IV:s väg 3, Halmstad, 09:00 (English)
Opponent
Supervisors
Available from: 2025-01-10 Created: 2025-01-08 Last updated: 2025-01-10Bibliographically approved
3.
The record could not be found. The reason may be that the record is no longer available or you may have typed in a wrong id in the address field.

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Alabdallah, AbdallahPashami, SepidehRögnvaldsson, ThorsteinnOhlsson, Mattias

Search in DiVA

By author/editor
Alabdallah, AbdallahPashami, SepidehRögnvaldsson, ThorsteinnOhlsson, Mattias
By organisation
Center for Applied Intelligent Systems Research (CAISR)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 2179 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf