hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
KAFE: Knowledge and Frequency Adapted Embeddings
Halmstad University, School of Information Technology.ORCID iD: 0000-0001-5688-0156
Halland Hospital, Region Halland, Sweden.
Halmstad University, School of Information Technology.ORCID iD: 0000-0002-7796-5201
2022 (English)In: Machine Learning, Optimization, and Data Science: 7th International Conference, LOD 2021, Grasmere, UK, October 4–8, 2021, Revised Selected Papers, Part II / [ed] Giuseppe Nicosia; Varun Ojha; Emanuele La Malfa; Gabriele La Malfa; Giorgio Jansen; Panos M. Pardalos; Giovanni Giuffrida; Renato Umeton, Cham: Springer, 2022, Vol. 13164, p. 132-146Conference paper, Published paper (Refereed) [Artistic work]
Abstract [en]

Word embeddings are widely used in several Natural Language Processing (NLP) applications. The training process typically involves iterative gradient updates of each word vector. This makes word frequency a major factor in the quality of embedding, and in general the embedding of words with few training occurrences end up being of poor quality. This is problematic since rare and frequent words, albeit semantically similar, might end up far from each other in the embedding space.

In this study, we develop KAFE (Knowledge And Frequency adapted Embeddings) which combines adversarial principles and knowledge graph to efficiently represent both frequent and rare words. The goal of adversarial training in KAFE is to minimize the spatial distinguishability (separability) of frequent and rare words in the embedding space. The knowledge graph encourages the embedding to follow the structure of the domain-specific hierarchy, providing an informative prior that is particularly important for words with low amount of training data. We demonstrate the performance of KAFE in representing clinical diagnoses using real-world Electronic Health Records (EHR) data coupled with a knowledge graph. EHRs are notorious for including ever-increasing numbers of rare concepts that are important to consider when defining the state of the patient for various downstream applications. Our experiments demonstrate better intelligibility through visualisation, as well as higher prediction and stability scores of KAFE over state-of-the-art. © Springer Nature Switzerland AG 2022

Place, publisher, year, edition, pages
Cham: Springer, 2022. Vol. 13164, p. 132-146
Series
Forskning i Halmstad, ISSN 1400-5409
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 13164
Keywords [en]
Word embeddings, Knowledge graphs, Adversarial learning
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:hh:diva-46333DOI: 10.1007/978-3-030-95470-3_10ISI: 000772650800010Scopus ID: 2-s2.0-85125483733ISBN: 978-3-030-95469-7 (print)ISBN: 978-3-030-95470-3 (electronic)OAI: oai:DiVA.org:hh-46333DiVA, id: diva2:1636927
Conference
The 7th International Online & Onsite Conference on Machine Learning, Optimization, and Data Science, Grasmere, Lake District, England, United Kingdom, October 4 – 8, 2021
Available from: 2022-02-11 Created: 2022-02-11 Last updated: 2023-10-05Bibliographically approved
In thesis
1. Deep Evidential Doctor
Open this publication in new window or tab >>Deep Evidential Doctor
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Recent years have witnessed an unparalleled surge in deep neural networks (DNNs) research, surpassing traditional machine learning (ML) and statistical methods on benchmark datasets in computer vision, audio processing and natural language processing (NLP). Much of this success can be attributed to the availability of numerous open-source datasets, advanced computational resources and algorithms. These algorithms learn multiple levels of simple to complex abstractions (or representations) of data resulting in superior performances on downstream applications. This has led to an increasing interest in reaping the potential of DNNs in real-life safety-critical domains such as autonomous driving, security systems and healthcare. Each of them comes with their own set of complexities and requirements, thereby necessitating the development of new approaches to address domain-specific problems, even if building on common foundations.

In this thesis, we address data science related challenges involved in learning effective prediction models from structured electronic health records (EHRs). In particular, questions related to numerical representation of complex and heterogeneous clinical concepts, modelling the sequential structure of EHRs and quantifying prediction uncertainties are studied. From a clinical perspective, the question of predicting onset of adverse outcomes for individual patients is considered to enable early interventions, improve patient outcomes, curb unnecessary expenditures and expand clinical knowledge.

This is a compilation thesis including five articles. It begins by describing a healthcare information platform that encapsulates clinical, operational and financial data of patients across all public care delivery units in Halland, Sweden. Thus, the platform overcomes the technical and legislative data-related challenges inherent to the modern era's complex and fragmented healthcare sector. The thesis presents evidence that expert clinical features are powerful predictors of adverse patient outcomes. However, they are well complemented by clinical concept embeddings; gleaned via NLP inspired language models. In particular, a novel representation learning framework (KAFE: Knowledge And Frequency adapted Embeddings) has been proposed that leverages medical knowledge schema and adversarial principles to learn high quality embeddings of both frequent and rare clinical concepts. In the context of sequential EHR modelling, benchmark experiments on cost-sensitive recurrent nets have shown significant improvements compared to non-sequential networks. In particular, an attention based hierarchical recurrent net is proposed that represents individual patients as weighted sums of ordered visits, where visits are, in turn, represented as weighted sums of unordered clinical concepts. In the context of uncertainty quantification and building trust in models, the field of deep evidential learning has been extended. In particular for multi-label tasks, simple extensions to current neural network architecture are proposed, coupled with a novel loss criterion to infer prediction uncertainties without compromising on accuracy. Moreover, a qualitative assessment of the model behaviour has also been an important part of the research articles, to analyse the correlations learned by the model in relation to established clinical science.

Put together, we develop DEep Evidential Doctor (DEED). DEED is a generic predictive model that learns efficient representations of patients and clinical concepts from EHRs and quantifies its confidence in individual predictions. It is also equipped to infer unseen labels.

Overall, this thesis presents a few small steps towards solving the bigger goal of artificial intelligence (AI) in healthcare. The research has consistently shown impressive prediction performance for multiple adverse outcomes. However, we believe that there are numerous emerging challenges to be addressed in order to reap the full benefits of data and AI in healthcare. For future works, we aim to extend the DEED framework to incorporate wider data modalities such as clinical notes, signals and daily lifestyle information. We will also work to equip DEED with explainability features.

Place, publisher, year, edition, pages
Halmstad: Halmstad University Press, 2022. p. 21
Series
Halmstad University Dissertations ; 88
National Category
Computer Sciences
Identifiers
urn:nbn:se:hh:diva-46347 (URN)978-91-88749-85-7 (ISBN)978-91-88749-86-4 (ISBN)
Public defence
2022-03-15, J102 (Wigforss), Visionen, Kristian IV:s väg 3, Halmstad, 13:00 (English)
Opponent
Supervisors
Available from: 2022-02-15 Created: 2022-02-15 Last updated: 2022-05-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ashfaq, AwaisNowaczyk, Sławomir

Search in DiVA

By author/editor
Ashfaq, AwaisNowaczyk, Sławomir
By organisation
School of Information Technology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 211 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf