Rios-Urrego Cristian David, Rusz Jan, Orozco-Arroyave Juan Rafael
GITA Lab, Faculty of Engineering, University of Antioquia, Medellín, Colombia.
Department of Circuit Theory, Czech Technical University in Prague, Prague, Czech Republic.
NPJ Digit Med. 2024 Feb 17;7(1):37. doi: 10.1038/s41746-024-01027-6.
Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns-hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET-the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
帕金森病(PD)和特发性震颤(ET)是常见的运动障碍,主要影响老年人,由于临床特征相似,给诊断带来挑战。虽然这两种疾病都表现出不同的言语模式——帕金森病为运动减少型构音障碍,特发性震颤为运动增多型构音障碍,但言语评估在鉴别诊断中的有效性仍未得到探索。开发自动鉴别技术可以实现早期诊断和持续监测。然而,缺乏用于研究这些患者言语行为的数据阻碍了诊断支持框架的发展。此外,不同语言之间的语音变异性给建立通用的言语评估系统带来了实际挑战。因此,有必要开发对全球不同语言中存在的语音变异性具有鲁棒性的模型。我们提出了一种基于高斯混合模型的方法,以评估从在德语和西班牙语中训练的模型到捷克语中对帕金森病和特发性震颤患者进行分类的域适应情况。我们对三种不同的言语维度进行建模:发音、发声和韵律,并在二分类和三分类场景(增加健康对照)中评估了模型的性能。我们的结果表明,在二分类中,三种言语维度的融合取得了最佳结果,在独白和 /pa-ta-ka/ 任务中的准确率分别高达81.4%和86.2%。在三分类场景中,纳入健康语音信号后,独白和 /pa-ta-ka/ 任务的准确率分别为63.3%和71.6%。我们的研究结果表明,结合机器学习的自动语音分析具有鲁棒性、准确性,并且可以适应不同语言以区分帕金森病和特发性震颤患者。