Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001 Kosice, Slovakia.
Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001 Kosice, Slovakia.
Int J Med Inform. 2023 Nov;179:105237. doi: 10.1016/j.ijmedinf.2023.105237. Epub 2023 Sep 29.
Parkinson's disease is the second-most-common neurodegenerative disorder that affects motor skills, cognitive processes, mood, and everyday tasks such as speaking and walking. The voices of people with Parkinson's disease may become weak, breathy, or hoarse and may sound emotionless, with slurred words and mumbling. Algorithms for computerized voice analysis have been proposed and have shown highly accurate results. However, these algorithms were developed on single, limited datasets, with participants possessing similar demographics. Such models are prone to overfitting and are unsuitable for generalization, which is essential in real-world applications.
We evaluated the computerized Parkinson's disease diagnosis performance of various machine learning models and showed that these models degraded rapidly when used on different datasets. We evaluated two mainstream state-of-the-art approaches, one based on deep convolutional neural networks and another based on voice feature extraction followed by a shallow classifier (i.e., extreme gradient boosting (XGBoost)).
An investigation with four datasets (CzechPD, PC-GITA, ITA, and RMIT-PD) proved that even if the algorithms yielded excellent performance on a single dataset, the results obtained on new data or even a mix of datasets were very unsatisfactory.
More work needs to be done to make computerized voice analysis methods for Parkinson's disease diagnosis suitable for real-world applications.
帕金森病是第二常见的神经退行性疾病,它会影响运动技能、认知过程、情绪以及说话和行走等日常活动。帕金森病患者的声音可能变得微弱、气喘或嘶哑,听起来可能没有感情,言语含糊不清且含糊不清。已经提出了用于计算机语音分析的算法,并取得了非常准确的结果。但是,这些算法是在单一的,有限的数据集上开发的,参与者具有相似的人口统计学特征。这样的模型容易过度拟合,不适合推广,而这在实际应用中是至关重要的。
我们评估了各种机器学习模型在计算机化帕金森氏病诊断中的性能,并表明这些模型在使用不同数据集时会迅速降级。我们评估了两种主流的最先进方法,一种基于深度卷积神经网络,另一种基于语音特征提取和浅层分类器(即极端梯度提升(XGBoost))。
对四个数据集(捷克 PD、PC-GITA、ITA 和 RMIT-PD)的调查表明,即使算法在单个数据集上表现出色,在新数据甚至混合数据集上获得的结果也非常不理想。
需要做更多的工作,使用于帕金森氏病诊断的计算机语音分析方法适用于实际应用。