Department of Translation and Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain.
Laboratoire de Linguistique Formelle (LLF), CNRS, Université Paris Cité, France.
Am J Speech Lang Pathol. 2023 Sep 11;32(5):2075-2086. doi: 10.1044/2023_AJSLP-22-00403. Epub 2023 Jul 24.
Decline in language has emerged as a new potential biomarker for the early detection of Alzheimer's disease (AD). It remains unclear how sensitive language measures are across different tasks, language domains, and languages, and to what extent changes can be reliably detected in early stages such as subjective cognitive decline (SCD) and mild cognitive impairment (MCI).
Using a scene construction task for speech elicitation in a new Spanish/Catalan speaking cohort ( = 119), we automatically extracted features across seven domains, three acoustic (spectral, cepstral, and voice quality), one prosodic, and three from text (morpholexical, semantic, and syntactic). They were forwarded to a random forest classifier to evaluate the discriminability of participants with probable AD dementia, amnestic and nonamnestic MCI, SCD, and cognitively healthy controls. Repeated-measures analyses of variance and paired-samples Wilcoxon signed-ranks test were used to assess whether and how performance differs significantly across groups and linguistic domains.
The performance scores of the machine learning classifier were generally satisfactorily high, with the highest scores over .9. Model performance was significantly different for linguistic domains ( < .001), and speech versus text ( = .043), with speech features outperforming textual features, and voice quality performing best. High diagnostic classification accuracies were seen even within both cognitively healthy (controls vs. SCD) and MCI (amnestic and nonamnestic) groups.
Speech-based machine learning is powerful in detecting cognitive decline and probable AD dementia across a range of different feature domains, though important differences exist between these domains as well.
语言能力下降已成为阿尔茨海默病(AD)早期检测的一个新的潜在生物标志物。目前尚不清楚不同任务、语言领域和语言中语言测量的敏感性如何,以及在主观认知下降(SCD)和轻度认知障碍(MCI)等早期阶段,变化可以在多大程度上被可靠地检测到。
我们在一个新的讲西班牙语/加泰罗尼亚语的队列中使用言语诱发的场景构建任务(n=119),自动提取了七个领域的特征,包括三个声学(光谱、倒谱和语音质量)、一个韵律和三个文本特征(形态、语义和句法)。这些特征被转发给随机森林分类器,以评估可能患有 AD 痴呆、遗忘型和非遗忘型 MCI、SCD 以及认知健康对照者的参与者的可区分性。重复测量方差分析和配对样本 Wilcoxon 符号秩检验用于评估性能是否以及如何在组和语言领域之间显著不同。
机器学习分类器的性能评分通常相当高,最高评分超过.9。语言领域的模型性能差异显著(<.001),言语与文本之间存在差异(=.043),言语特征的性能优于文本特征,而语音质量的表现最佳。即使在认知健康(对照与 SCD)和 MCI(遗忘型和非遗忘型)组内,也可以看到较高的诊断分类准确率。
基于语音的机器学习在检测认知能力下降和可能的 AD 痴呆方面具有强大的能力,尽管这些领域之间存在着重要的差异。