hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Detecting Parkinson's disease from sustained phonation and speech signals
Kaunas University of Technology, Kaunas, Lithuania.
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research. Kaunas University of Technology, Kaunas, Lithuania.ORCID iD: 0000-0003-2185-8973
Kaunas University of Technology, Kaunas, Lithuania.
Kaunas University of Technology, Kaunas, Lithuania.
2017 (English)In: PLOS ONE, E-ISSN 1932-6203, Vol. 12, no 10, article id e0185613Article in journal (Refereed) Published
Abstract [en]

This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels simultaneously, namely, acoustic cardioid (AC) and smart phone (SP) microphones. Additional modalities were obtained by splitting speech recording into voiced and unvoiced parts. Information in each modality is summarized by 18 well-known audio feature sets. Random forest (RF) is used as a machine learning algorithm, both for individual feature sets and for decision-level fusion. Detection performance is measured by the out-of-bag equal error rate (EER) and the cost of log-likelihood-ratio. Essentia audio feature set was the best using the AC speech modality and YAAFE audio feature set was the best using the SP unvoiced modality, achieving EER of 20.30% and 25.57%, respectively. Fusion of all feature sets and modalities resulted in EER of 19.27% for the AC and 23.00% for the SP channel. Non-linear projection of a RF-based proximity matrix into the 2D space enriched medical decision support by visualization. © 2017 Vaiciukynas et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Place, publisher, year, edition, pages
San Francisco, CA: Public Library of Science , 2017. Vol. 12, no 10, article id e0185613
Keywords [en]
Speech analysis, Pathology detection, Parkinson's disease
National Category
Signal Processing
Identifiers
URN: urn:nbn:se:hh:diva-35229DOI: 10.1371/journal.pone.0185613ISI: 000412360300047PubMedID: 28982171Scopus ID: 2-s2.0-85030766664OAI: oai:DiVA.org:hh-35229DiVA, id: diva2:1150426
Note

Funding: The Research Council of Lithuania (No. MIP-075/2015)

Available from: 2017-10-19 Created: 2017-10-19 Last updated: 2021-06-14Bibliographically approved

Open Access in DiVA

fulltext(3884 kB)372 downloads
File information
File name FULLTEXT01.pdfFile size 3884 kBChecksum SHA-512
339218341561640d8dc96ecce66de0992760c281ec4e6a5827e00be3439e2ebd65e6bf7e175ef827c87839f5db0f3b645ba8088c728d84f6eb96ddc30f02a8b2
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Verikas, Antanas

Search in DiVA

By author/editor
Verikas, Antanas
By organisation
CAISR - Center for Applied Intelligent Systems Research
In the same journal
PLOS ONE
Signal Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 372 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 369 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf