dc.contributor.author | Niemelä, Marko | |
dc.contributor.author | von Bonsdorff, Mikaela | |
dc.contributor.author | Äyrämö, Sami | |
dc.contributor.author | Kärkkäinen, Tommi | |
dc.date.accessioned | 2024-08-16T12:26:19Z | |
dc.date.available | 2024-08-16T12:26:19Z | |
dc.date.issued | 2024 | |
dc.identifier.citation | Niemelä, M., von Bonsdorff, M., Äyrämö, S., & Kärkkäinen, T. (2024). Classification of dementia from spoken speech using feature selection and the bag of acoustic words model. <i>Applied Computing and Intelligence</i>, <i>4</i>(1), 45-65. <a href="https://doi.org/10.3934/aci.2024004" target="_blank">https://doi.org/10.3934/aci.2024004</a> | |
dc.identifier.other | CONVID_233334862 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/96655 | |
dc.description.abstract | Memory disorders and dementia are a central factor in the decline of functioning and daily activities in older individuals. The workload related to standardized speech tests in clinical settings has led to a growing emphasis on developing automatic machine learning techniques for analyzing naturally spoken speech. This study presented a bag of acoustic words approach for distinguishing dementia patients from control individuals based on audio speech recordings. In this approach, each individual's speech was segmented into voiced periods, and these segments were characterized by acoustic features using the open-source openSMILE library. Word histogram representations were formed from the characterized speech segments of each speaker, which were used for classifying subjects. The formation of word histograms involved a clustering phase where feature vectors were quantized. It is well-known that partitional clustering involves instability in clustering results due to the selection of starting points, which can cause variability in classification outcomes. This study aimed to address instability by utilizing robust K-spatial-medians clustering, efficient K-means
clustering initialization, and selecting the smallest clustering error from repeated clusterings. Additionally, the study employed feature selection based on the Wilcoxon signed-rank test to achieve computational efficiency in the methods. The results showed that it is possible to achieve a consistent 75% classification accuracy using only twenty-five features, both with the external ADReSS 2020 test data and through leave-one-subject-out cross-validation of the entire dataset. The results rank at the top compared to international research, where the same dataset and only acoustic features have been used to diagnose patients. | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | American Institute of Mathematical Sciences (AIMS) | |
dc.relation.ispartofseries | Applied Computing and Intelligence | |
dc.rights | CC BY 4.0 | |
dc.subject.other | Alzheimer | |
dc.subject.other | classification | |
dc.subject.other | spontaneous speech | |
dc.subject.other | acoustic features | |
dc.subject.other | bag of acoustic words | |
dc.title | Classification of dementia from spoken speech using feature selection and the bag of acoustic words model | |
dc.type | article | |
dc.identifier.urn | URN:NBN:fi:jyu-202408165539 | |
dc.contributor.laitos | Informaatioteknologian tiedekunta | fi |
dc.contributor.laitos | Liikuntatieteellinen tiedekunta | fi |
dc.contributor.laitos | Faculty of Information Technology | en |
dc.contributor.laitos | Faculty of Sport and Health Sciences | en |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | |
dc.type.coar | http://purl.org/coar/resource_type/c_2df8fbb1 | |
dc.description.reviewstatus | peerReviewed | |
dc.format.pagerange | 45-65 | |
dc.relation.issn | 2771-392X | |
dc.relation.numberinseries | 1 | |
dc.relation.volume | 4 | |
dc.type.version | publishedVersion | |
dc.rights.copyright | © 2024 the Authors | |
dc.rights.accesslevel | openAccess | fi |
dc.relation.grantnumber | 349336 | |
dc.subject.yso | ikääntyminen | |
dc.subject.yso | muistisairaudet | |
dc.subject.yso | dementia | |
dc.subject.yso | Alzheimerin tauti | |
dc.subject.yso | puhe (puhuminen) | |
dc.subject.yso | ikääntyneet | |
dc.format.content | fulltext | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p5056 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p22037 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p1711 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p8412 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p2492 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p2433 | |
dc.rights.url | https://creativecommons.org/licenses/by/4.0/ | |
dc.relation.doi | 10.3934/aci.2024004 | |
dc.relation.funder | Research Council of Finland | en |
dc.relation.funder | Suomen Akatemia | fi |
jyx.fundingprogram | Academy Project, AoF | en |
jyx.fundingprogram | Akatemiahanke, SA | fi |
jyx.fundinginformation | The work of the first Author (MN) was supported by the Finnish Cultural Foundation (Grant Number 30231766). The work of the second author (MvB) was supported by the Samfundet Folkhalsan, and the Research Council of Finland (Grant Number 349336). | |
dc.type.okm | A1 | |