基于机器学习的帕金森病语音检测方法在跨数据集上的泛化能力。

On the inter-dataset generalization of machine learning approaches to Parkinson's disease detection from voice.

机构信息

Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001 Kosice, Slovakia.

出版信息

Int J Med Inform. 2023 Nov;179:105237. doi: 10.1016/j.ijmedinf.2023.105237. Epub 2023 Sep 29.

DOI:10.1016/j.ijmedinf.2023.105237

PMID:37801807

Abstract

BACKGROUND AND OBJECTIVE

Parkinson's disease is the second-most-common neurodegenerative disorder that affects motor skills, cognitive processes, mood, and everyday tasks such as speaking and walking. The voices of people with Parkinson's disease may become weak, breathy, or hoarse and may sound emotionless, with slurred words and mumbling. Algorithms for computerized voice analysis have been proposed and have shown highly accurate results. However, these algorithms were developed on single, limited datasets, with participants possessing similar demographics. Such models are prone to overfitting and are unsuitable for generalization, which is essential in real-world applications.

METHODS

We evaluated the computerized Parkinson's disease diagnosis performance of various machine learning models and showed that these models degraded rapidly when used on different datasets. We evaluated two mainstream state-of-the-art approaches, one based on deep convolutional neural networks and another based on voice feature extraction followed by a shallow classifier (i.e., extreme gradient boosting (XGBoost)).

RESULTS

An investigation with four datasets (CzechPD, PC-GITA, ITA, and RMIT-PD) proved that even if the algorithms yielded excellent performance on a single dataset, the results obtained on new data or even a mix of datasets were very unsatisfactory.

CONCLUSIONS

More work needs to be done to make computerized voice analysis methods for Parkinson's disease diagnosis suitable for real-world applications.

摘要

背景与目的

帕金森病是第二常见的神经退行性疾病，它会影响运动技能、认知过程、情绪以及说话和行走等日常活动。帕金森病患者的声音可能变得微弱、气喘或嘶哑，听起来可能没有感情，言语含糊不清且含糊不清。已经提出了用于计算机语音分析的算法，并取得了非常准确的结果。但是，这些算法是在单一的，有限的数据集上开发的，参与者具有相似的人口统计学特征。这样的模型容易过度拟合，不适合推广，而这在实际应用中是至关重要的。

方法

我们评估了各种机器学习模型在计算机化帕金森氏病诊断中的性能，并表明这些模型在使用不同数据集时会迅速降级。我们评估了两种主流的最先进方法，一种基于深度卷积神经网络，另一种基于语音特征提取和浅层分类器（即极端梯度提升（XGBoost））。

结果

对四个数据集（捷克 PD、PC-GITA、ITA 和 RMIT-PD）的调查表明，即使算法在单个数据集上表现出色，在新数据甚至混合数据集上获得的结果也非常不理想。

结论

需要做更多的工作，使用于帕金森氏病诊断的计算机语音分析方法适用于实际应用。

相似文献

On the inter-dataset generalization of machine learning approaches to Parkinson's disease detection from voice.基于机器学习的帕金森病语音检测方法在跨数据集上的泛化能力。

Int J Med Inform. 2023 Nov;179:105237. doi: 10.1016/j.ijmedinf.2023.105237. Epub 2023 Sep 29.

Gradient boosting for Parkinson's disease diagnosis from voice recordings.基于语音记录的梯度提升算法用于帕金森病诊断

BMC Med Inform Decis Mak. 2020 Sep 15;20(1):228. doi: 10.1186/s12911-020-01250-7.

EEG-based emotion charting for Parkinson's disease patients using Convolutional Recurrent Neural Networks and cross dataset learning.基于 EEG 的帕金森病患者情绪图表分析，使用卷积循环神经网络和跨数据集学习。

Comput Biol Med. 2022 May;144:105327. doi: 10.1016/j.compbiomed.2022.105327. Epub 2022 Mar 11.

Convolutional neural network ensemble for Parkinson's disease detection from voice recordings.用于从语音记录中检测帕金森病的卷积神经网络集成

Comput Biol Med. 2022 Feb;141:105021. doi: 10.1016/j.compbiomed.2021.105021. Epub 2021 Nov 9.

Late feature fusion using neural network with voting classifier for Parkinson's disease detection.基于投票分类器的神经网络晚期特征融合在帕金森病检测中的应用。

BMC Med Inform Decis Mak. 2024 Sep 27;24(1):269. doi: 10.1186/s12911-024-02683-0.

Diagnosis of Parkinson's disease based on voice signals using SHAP and hard voting ensemble method.基于 SHAP 和硬投票集成方法的语音信号帕金森病诊断。

Comput Methods Biomech Biomed Engin. 2024 Oct;27(13):1858-1874. doi: 10.1080/10255842.2023.2263125. Epub 2023 Sep 28.

Automated Parkinson's disease recognition based on statistical pooling method using acoustic features.基于统计池化方法的利用声学特征的帕金森病自动识别

Med Hypotheses. 2020 Feb;135:109483. doi: 10.1016/j.mehy.2019.109483. Epub 2019 Nov 11.

Phonemes based detection of parkinson's disease for telehealth applications.基于音素的帕金森病检测，用于远程医疗应用。

Sci Rep. 2022 Jun 11;12(1):9687. doi: 10.1038/s41598-022-13865-z.

A mobile-assisted voice condition analysis system for Parkinson's disease: assessment of usability conditions.移动辅助语音状况分析系统用于帕金森病：可用性条件评估。

Biomed Eng Online. 2021 Nov 21;20(1):114. doi: 10.1186/s12938-021-00951-y.

Deep Learning Approach to Parkinson's Disease Detection Using Voice Recordings and Convolutional Neural Network Dedicated to Image Classification.使用语音记录和专用于图像分类的卷积神经网络的深度学习方法进行帕金森病检测。

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:717-720. doi: 10.1109/EMBC.2019.8856972.

引用本文的文献

Motor symptoms of Parkinson's disease: critical markers for early AI-assisted diagnosis.帕金森病的运动症状：早期人工智能辅助诊断的关键标志物。

Front Aging Neurosci. 2025 Jul 18;17:1602426. doi: 10.3389/fnagi.2025.1602426. eCollection 2025.

Speech-Based Parkinson's Detection Using Pre-Trained Self-Supervised Automatic Speech Recognition (ASR) Models and Supervised Contrastive Learning.基于语音的帕金森病检测：使用预训练的自监督自动语音识别（ASR）模型和监督对比学习

Bioengineering (Basel). 2025 Jul 1;12(7):728. doi: 10.3390/bioengineering12070728.

Listening to the Mind: Integrating Vocal Biomarkers into Digital Health.倾听内心：将声音生物标志物整合到数字健康中。

Brain Sci. 2025 Jul 18;15(7):762. doi: 10.3390/brainsci15070762.

Machine learning for Parkinson's disease: a comprehensive review of datasets, algorithms, and challenges.帕金森病的机器学习：数据集、算法及挑战的全面综述

NPJ Parkinsons Dis. 2025 Jul 1;11(1):187. doi: 10.1038/s41531-025-01025-9.

Ranking pre-trained speech embeddings in Parkinson's disease detection: Does Wav2Vec 2.0 outperform its 1.0 version across speech modes and languages?帕金森病检测中预训练语音嵌入的排名：在语音模式和语言方面，Wav2Vec 2.0是否优于其1.0版本？

Comput Struct Biotechnol J. 2025 Jun 7;27:2584-2601. doi: 10.1016/j.csbj.2025.06.022. eCollection 2025.

Analyzing Wav2Vec 1.0 Embeddings for Cross-Database Parkinson's Disease Detection and Speech Features Extraction.分析 Wav2Vec 1.0 嵌入以进行跨数据库帕金森病检测和语音特征提取。

Sensors (Basel). 2024 Aug 26;24(17):5520. doi: 10.3390/s24175520.

Leveraging Deep Learning for Fine-Grained Categorization of Parkinson's Disease Progression Levels through Analysis of Vocal Acoustic Patterns.通过分析语音声学模式，利用深度学习对帕金森病进展水平进行细粒度分类。

Bioengineering (Basel). 2024 Mar 21;11(3):295. doi: 10.3390/bioengineering11030295.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习的帕金森病语音检测方法在跨数据集上的泛化能力。

On the inter-dataset generalization of machine learning approaches to Parkinson's disease detection from voice.

机构信息

出版信息

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景与目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献