hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Visual Transformers for 3D Medical Images Classification: Use-Case Neurodegenerative Disorders
Halmstad University, School of Information Technology.
2022 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

A Neurodegenerative Disease (ND) is progressive damage to brain neurons, which the human body cannot repair or replace. The well-known examples of such conditions are Dementia and Alzheimer’s Disease (AD), which affect millions of lives each year. Although conducting numerous researches, there are no effective treatments for the mentioned diseases today. However, early diagnosis is crucial in disease management.

Diagnosing NDs is challenging for neurologists and requires years of training and experience. So, there has been a trend to harness the power of deep learning, including state-of-the-art Convolutional Neural Network (CNN), to assist doctors in diagnosing such conditions using brain scans. The CNN models lead to promising results comparable to experienced neurologists in their diagnosis. But, the advent of transformers in the Natural Language Processing (NLP) domain and their outstanding performance persuaded Computer Vision (CV) researchers to adapt them to solve various CV tasks in multiple areas, including the medical field.

This research aims to develop Vision Transformer (ViT) models using Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset to classify NDs. More specifically, the models can classify three categories (Cognitively Normal (CN), Mild Cognitive Impairment (MCI), Alzheimer’s Disease (AD)) using brain Fluorodeoxyglucose (18F-FDG) Positron Emission Tomography (PET) scans. Also, we take advantage of Automated Anatomical Labeling (AAL) brain atlas and attention maps to develop explainable models.

We propose three ViTs, the best of which obtains an accuracy of 82% on the test dataset with the help of transfer learning. Also, we encode the AAL brain atlas information into the best performing ViT, so the model outputs the predicted label, the most critical region in its prediction, and overlaid attention map on the input scan with the crucial areas highlighted. Furthermore, we develop two CNN models with 2D and 3D convolutional kernels as baselines to classify NDs, which achieve accuracy of 77% and 73%, respectively, on the test dataset.

We also conduct a study to find out the importance of brain regions and their combinations in classifying NDs using ViTs and the AAL brain atlas.

Place, publisher, year, edition, pages
2022. , p. 68
Keywords [en]
Artificial intelligence, Explainable AI, Machine learning, Deep learning, Computer vision, Vision Transformer, Visual Transformer, ViT, Convolutional Neural Network, CNN, Neurodegenerative disorder, Mild cognitive impairment, Alzheimer's disease, Fluorodeoxyglucose, 18F-FDG, Positron Emission Tomography, PET, Brain, Brain scan
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:hh:diva-47250OAI: oai:DiVA.org:hh-47250DiVA, id: diva2:1673452
Subject / course
Computer science and engineering
Educational program
Master's Programme in Embedded and Intelligent Systems, 120 credits
Presentation
2022-06-02, Halmstad University, Halmstad, 09:45
Supervisors
Examiners
Note

This thesis was awarded a prize of 50,000 SEK by Getinge Sterilization for projects within Health Innovation.

Available from: 2022-06-21 Created: 2022-06-21 Last updated: 2022-06-23Bibliographically approved

Open Access in DiVA

fulltext(17177 kB)1130 downloads
File information
File name FULLTEXT02.pdfFile size 17177 kBChecksum SHA-512
04300f2f3c807a2ab4af72fa48b0d401fc7c64e446826d67d4fa5908e9c453a46cf36cde0bec3a44432c8da517ac624a2aef85f5cbc09e16f7092b106771dd06
Type fulltextMimetype application/pdf

By organisation
School of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1132 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 3098 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf