hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Synthetic electronic health records generated with variational graph autoencoders
Ecole Polytechnique, Palaiseau, France.ORCID iD: 0000-0002-0336-5879
Ecole Polytechnique, Palaiseau, France; Royal Institute of Technology, Stockholm, Sweden.
Ecole Polytechnique, Palaiseau, France.
Halmstad University, School of Information Technology. Sahlgrenska Academy, Gothenburg, Sweden.
Show others and affiliations
2023 (English)In: npj Digital Medicine, E-ISSN 2398-6352, Vol. 6, no 1, article id 83Article in journal (Refereed) Published
Abstract [en]

Data-driven medical care delivery must always respect patient privacy—a requirement that is not easily met. This issue has impeded improvements to healthcare software and has delayed the long-predicted prevalence of artificial intelligence in healthcare. Until now, it has been very difficult to share data between healthcare organizations, resulting in poor statistical models due to unrepresentative patient cohorts. Synthetic data, i.e., artificial but realistic electronic health records, could overcome the drought that is troubling the healthcare sector. Deep neural network architectures, in particular, have shown an incredible ability to learn from complex data sets and generate large amounts of unseen data points with the same statistical properties as the training data. Here, we present a generative neural network model that can create synthetic health records with realistic timelines. These clinical trajectories are generated on a per-patient basis and are represented as linear-sequence graphs of clinical events over time. We use a variational graph autoencoder (VGAE) to generate synthetic samples from real-world electronic health records. Our approach generates health records not seen in the training data. We show that these artificial patient trajectories are realistic and preserve patient privacy and can therefore support the safe sharing of data across organizations. © 2023, The Author(s).

Place, publisher, year, edition, pages
London: Nature Publishing Group, 2023. Vol. 6, no 1, article id 83
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hh:diva-51399DOI: 10.1038/s41746-023-00822-xISI: 000978596300002Scopus ID: 2-s2.0-85156276935OAI: oai:DiVA.org:hh-51399DiVA, id: diva2:1788035
Funder
Swedish Research Council, 2019-00198Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Funding: M.V. is partially supported by the “Wallenberg AI, Autonomous Systems and Software Program” (WASP). M.L. is partially supported by AIR Lund (Artificially Intelligent use of Registers at Lund University) research environment and received funding from the Swedish Research Council (VR; grant No. 2019-00198). G.N. is supported by the French National research agency via the AML-HELAS (ANR-19-CHIA-0020) project.

Available from: 2023-08-15 Created: 2023-08-15 Last updated: 2023-08-16Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Lingman, Markus

Search in DiVA

By author/editor
Nikolentzos, GiannisLingman, Markus
By organisation
School of Information Technology
In the same journal
npj Digital Medicine
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 22 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf