Wang Rumi, Kuang Chen, Guo Chengyu, Chen Yong, Li Canyang, Matsumura Yoshihiro, Ishimaru Masashi, Van Pelt Alice J, Chen Fei
Rehabilitation Medicine Department, Speech and Language Pathology Therapy Section, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China.
School of Foreign Languages, Hunan University, Hunan, China.
J Alzheimers Dis. 2023;95(3):901-914. doi: 10.3233/JAD-230373.
To date, the reliable detection of mild cognitive impairment (MCI) remains a significant challenge for clinicians. Very few studies investigated the sensitivity of acoustic features in detecting Mandarin-speaking elders at risk for MCI, defined as "putative MCI" (pMCI).
This study sought to investigate the possibility of using automatically extracted speech acoustic features to detect elderly people with pMCI and reveal the potential acoustic markers of cognitive decline at an early stage.
Forty-one older adults with pMCI and 41 healthy elderly controls completed four reading tasks (syllable utterance, tongue twister, diadochokinesis, and short sentence reading), from which acoustic features were extracted automatically to train machine learning classifiers. Correlation analysis was employed to evaluate the relationship between classifier predictions and participants' cognitive ability measured by Mini-Mental State Examination 2.
Classification results revealed that some temporal features (e.g., speech rate, utterance duration, and the number of silent pauses), spectral features (e.g., variability of F1 and F2), and energy features (e.g., SD of peak intensity and SD of intensity range) were effective predictors of pMCI. The best classification result was achieved in the Random Forest classifier (accuracy = 0.81, AUC = 0.81). Correlation analysis uncovered a strong negative correlation between participants' cognitive test scores and the probability estimates of pMCI in the Random Forest classifier, and a modest negative correlation in the Support Vector Machine classifier.
The automatic acoustic analysis of speech could provide a promising non-invasive way to assess and monitor the early cognitive decline in Mandarin-speaking elders.
迄今为止,对临床医生而言,可靠检测轻度认知障碍(MCI)仍是一项重大挑战。极少有研究调查声学特征在检测有MCI风险的讲普通话老年人(定义为“疑似MCI”,即pMCI)方面的敏感性。
本研究旨在探讨使用自动提取的语音声学特征检测患有pMCI的老年人的可能性,并揭示认知衰退早期潜在的声学标志物。
41名患有pMCI的老年人和41名健康老年对照者完成了四项阅读任务(音节发音、绕口令、连续语言和短句阅读),从中自动提取声学特征以训练机器学习分类器。采用相关性分析来评估分类器预测结果与通过简易精神状态检查表2测量的参与者认知能力之间的关系。
分类结果显示,一些时间特征(如语速、发声时长和无声停顿次数)、频谱特征(如F1和F2的变异性)和能量特征(如峰值强度标准差和强度范围标准差)是pMCI的有效预测指标。随机森林分类器取得了最佳分类结果(准确率 = 0.81,曲线下面积 = 0.81)。相关性分析发现,参与者的认知测试分数与随机森林分类器中pMCI的概率估计值之间存在强负相关,在支持向量机分类器中存在中等程度的负相关。
语音自动声学分析可为评估和监测讲普通话老年人的早期认知衰退提供一种有前景的非侵入性方法。