hh.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Fusing Various Audio Feature Sets for Detection of Parkinson’s Disease from Sustained Voice and Speech Recordings
Kaunas University of Technology, Kaunas, Lithuania.
Halmstad University, School of Information Technology, Halmstad Embedded and Intelligent Systems Research (EIS), CAISR - Center for Applied Intelligent Systems Research. Kaunas University of Technology, Kaunas, Lithuania.ORCID iD: 0000-0003-2185-8973
Kaunas University of Technology, Kaunas, Lithuania.
Kaunas University of Technology, Kaunas, Lithuania.
Show others and affiliations
2016 (English)In: Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349, Vol. 9811, p. 328-337Article in journal (Refereed) Published
Abstract [en]

The aim of this study is the analysis of voice and speech recordings for the task of Parkinson’s disease detection. Voice modality corresponds to sustained phonation /a/ and speech modality to a short sentence in Lithuanian language. Diverse information from recordings is extracted by 22 well-known audio feature sets. Random forest is used as a learner, both for individual feature sets and for decision-level fusion. Essentia descriptors were found as the best individual feature set, achieving equal error rate of 16.3 % for voice and 13.3 % for speech. Fusion of feature sets and modalities improved detection and achieved equal error rate of 10.8 %. Variable importance in fusion revealed speech modality as more important than voice. © Springer International Publishing Switzerland 2016

Place, publisher, year, edition, pages
Heidelberg: Springer Berlin/Heidelberg, 2016. Vol. 9811, p. 328-337
Keywords [en]
Parkinson’s disease, Audio signal processing, OpenSMILE, Essentia, MPEG-7, jAudio, YAAFE, Random forest, Information fusion
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:hh:diva-31872DOI: 10.1007/978-3-319-43958-7_39ISI: 000389335600039Scopus ID: 2-s2.0-84984851988OAI: oai:DiVA.org:hh-31872DiVA, id: diva2:955931
Conference
18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016
Note

Funding: Research Council of Lithuania (No. MIP-075/2015)

Available from: 2016-08-27 Created: 2016-08-27 Last updated: 2018-01-10Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Verikas, Antanas

Search in DiVA

By author/editor
Verikas, Antanas
By organisation
CAISR - Center for Applied Intelligent Systems Research
In the same journal
Lecture Notes in Computer Science
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 743 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf