Suppr超能文献

基于机器学习的帕金森病语音检测方法在跨数据集上的泛化能力。

On the inter-dataset generalization of machine learning approaches to Parkinson's disease detection from voice.

机构信息

Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001 Kosice, Slovakia.

Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001 Kosice, Slovakia.

出版信息

Int J Med Inform. 2023 Nov;179:105237. doi: 10.1016/j.ijmedinf.2023.105237. Epub 2023 Sep 29.

Abstract

BACKGROUND AND OBJECTIVE

Parkinson's disease is the second-most-common neurodegenerative disorder that affects motor skills, cognitive processes, mood, and everyday tasks such as speaking and walking. The voices of people with Parkinson's disease may become weak, breathy, or hoarse and may sound emotionless, with slurred words and mumbling. Algorithms for computerized voice analysis have been proposed and have shown highly accurate results. However, these algorithms were developed on single, limited datasets, with participants possessing similar demographics. Such models are prone to overfitting and are unsuitable for generalization, which is essential in real-world applications.

METHODS

We evaluated the computerized Parkinson's disease diagnosis performance of various machine learning models and showed that these models degraded rapidly when used on different datasets. We evaluated two mainstream state-of-the-art approaches, one based on deep convolutional neural networks and another based on voice feature extraction followed by a shallow classifier (i.e., extreme gradient boosting (XGBoost)).

RESULTS

An investigation with four datasets (CzechPD, PC-GITA, ITA, and RMIT-PD) proved that even if the algorithms yielded excellent performance on a single dataset, the results obtained on new data or even a mix of datasets were very unsatisfactory.

CONCLUSIONS

More work needs to be done to make computerized voice analysis methods for Parkinson's disease diagnosis suitable for real-world applications.

摘要

背景与目的

帕金森病是第二常见的神经退行性疾病,它会影响运动技能、认知过程、情绪以及说话和行走等日常活动。帕金森病患者的声音可能变得微弱、气喘或嘶哑,听起来可能没有感情,言语含糊不清且含糊不清。已经提出了用于计算机语音分析的算法,并取得了非常准确的结果。但是,这些算法是在单一的,有限的数据集上开发的,参与者具有相似的人口统计学特征。这样的模型容易过度拟合,不适合推广,而这在实际应用中是至关重要的。

方法

我们评估了各种机器学习模型在计算机化帕金森氏病诊断中的性能,并表明这些模型在使用不同数据集时会迅速降级。我们评估了两种主流的最先进方法,一种基于深度卷积神经网络,另一种基于语音特征提取和浅层分类器(即极端梯度提升(XGBoost))。

结果

对四个数据集(捷克 PD、PC-GITA、ITA 和 RMIT-PD)的调查表明,即使算法在单个数据集上表现出色,在新数据甚至混合数据集上获得的结果也非常不理想。

结论

需要做更多的工作,使用于帕金森氏病诊断的计算机语音分析方法适用于实际应用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验