Department of Computer Engineering, Bahcesehir University, 34353, Istanbul, Turkey.
Department of Computer and Information Science, University of Konstanz, Konstanz, Germany.
Med Biol Eng Comput. 2020 Nov;58(11):2757-2773. doi: 10.1007/s11517-020-02250-5. Epub 2020 Sep 10.
In recent years, there is an increasing interest in building e-health systems. The systems built to deliver the health services with the use of internet and communication technologies aim to reduce the costs arising from outpatient visits of patients. Some of the related recent studies propose machine learning-based telediagnosis and telemonitoring systems for Parkinson's disease (PD). Motivated from the studies showing the potential of speech disorders in PD telemonitoring systems, in this study, we aim to estimate the severity of PD from voice recordings of the patients using motor Unified Parkinson's Disease Rating Scale (UPDRS) as the evaluation metric. For this purpose, we apply various speech processing algorithms to the voice signals of the patients and then use these features as input to a two-stage estimation model. The first step is to apply a wrapper-based feature selection algorithm, called Boruta, and select the most informative speech features. The second step is to feed the selected set of features to a decision tree-based boosting algorithm, extreme gradient boosting, which has been recently applied successfully in many machine learning tasks due to its generalization ability and speed. The feature selection analysis showed that the vibration pattern of the vocal fold is an important indicator of PD severity. Besides, we also investigate the effectiveness of using age and years passed since diagnosis as covariates together with speech features. The lowest mean absolute error with 3.87 was obtained by combining these covariates and speech features with prediction level fusion. Graphical Abstract Framework for the proposed UPDRS estimation model.
近年来,人们对构建电子健康系统越来越感兴趣。这些系统利用互联网和通信技术提供医疗服务,旨在降低患者门诊就诊的成本。一些相关的最新研究提出了基于机器学习的帕金森病(PD)远程诊断和远程监测系统。受研究表明语音障碍在 PD 远程监测系统中具有潜力的启发,在这项研究中,我们旨在使用运动统一帕金森病评定量表(UPDRS)作为评估指标,通过患者的语音记录来估计 PD 的严重程度。为此,我们将各种语音处理算法应用于患者的语音信号,然后将这些特征作为输入到一个两阶段估计模型中。第一步是应用基于包装器的特征选择算法 Boruta,选择最具信息量的语音特征。第二步是将选定的特征集输入到基于决策树的提升算法极端梯度提升中,由于其泛化能力和速度,该算法最近在许多机器学习任务中得到了成功应用。特征选择分析表明,声带的振动模式是 PD 严重程度的一个重要指标。此外,我们还研究了同时使用年龄和诊断后经过的年数作为协变量与语音特征一起使用的效果。通过将这些协变量与语音特征相结合并采用预测水平融合,获得了最低的平均绝对误差 3.87。