Gosztolya Gabor, Svindt Veronika, Bona Judit, Hoffmann Ildiko
IEEE Trans Neural Syst Rehabil Eng. 2023;31:3234-3244. doi: 10.1109/TNSRE.2023.3300532. Epub 2023 Aug 14.
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system which, in addition to affecting motor and cognitive functions, may also lead to specific changes in the speech of patients. Speech production, comprehension, repetition and naming tasks, as well as structural and content changes in narratives, might indicate a limitation of executive functions. In this study we present a speech-based machine learning technique to distinguish speakers with relapsing-remitting subtype MS and healthy controls (HC). We exploit the fact that MS might cause a motor speech disorder similar to dysarthria, which, with our hypothesis, might affect the phonetic posterior estimates supplied by a Deep Neural Network acoustic model. From our experimental results, the proposed posterior posteriorgram-based feature extraction approach is useful for detecting MS: depending on the actual speech task, we obtained Equal Error Rate values as low as 13.3%, and AUC scores up to 0.891, indicating a competitive and more consistent classification performance compared to both the x-vector and the openSMILE 'ComParE functionals' attributes. Besides this discrimination performance, the interpretable nature of the phonetic posterior features might also make our method suitable for automatic MS screening or monitoring the progression of the disease. Furthermore, by examining which specific phonetic groups are the most useful for this feature extraction process, the potential utility of the proposed phonetic features could also be utilized in the speech therapy of MS patients.
多发性硬化症(MS)是一种中枢神经系统的慢性炎症性疾病,除了影响运动和认知功能外,还可能导致患者言语出现特定变化。言语产生、理解、重复和命名任务,以及叙述中的结构和内容变化,可能表明执行功能存在局限性。在本研究中,我们提出了一种基于语音的机器学习技术,以区分复发缓解型多发性硬化症患者和健康对照者(HC)。我们利用多发性硬化症可能导致类似于构音障碍的运动性言语障碍这一事实,根据我们的假设,这可能会影响深度神经网络声学模型提供的语音后验估计。从我们的实验结果来看,所提出的基于后验图的特征提取方法对于检测多发性硬化症很有用:根据实际的语音任务,我们获得了低至13.3%的等错误率值和高达0.891的AUC分数,表明与x向量和openSMILE的“ComParE功能”属性相比,具有竞争性且更一致的分类性能。除了这种区分性能外,语音后验特征的可解释性也可能使我们的方法适用于自动多发性硬化症筛查或监测疾病进展。此外,通过检查哪些特定的语音组对该特征提取过程最有用,所提出的语音特征的潜在效用也可用于多发性硬化症患者的言语治疗。