School of Foreign Studies, Tongji University, Shanghai, China.
Research Center for Ageing, Language and Care, Tongji University, Shanghai, China.
Front Public Health. 2024 Aug 8;12:1417966. doi: 10.3389/fpubh.2024.1417966. eCollection 2024.
Speech analysis has been expected to help as a screening tool for early detection of Alzheimer's disease (AD) and mild-cognitively impairment (MCI). Acoustic features and linguistic features are usually used in speech analysis. However, no studies have yet determined which type of features provides better screening effectiveness, especially in the large aging population of China.
Firstly, to compare the screening effectiveness of acoustic features, linguistic features, and their combination using the same dataset. Secondly, to develop Chinese automated diagnosis model using self-collected natural discourse data obtained from native Chinese speakers.
A total of 92 participants from communities in Shanghai, completed MoCA-B and a picture description task based on the Cookie Theft under the guidance of trained operators, and were divided into three groups including AD, MCI, and heathy control (HC) based on their MoCA-B score. Acoustic features (Pitches, Jitter, Shimmer, MFCCs, Formants) and linguistic features (part-of-speech, type-token ratio, information words, information units) are extracted. The machine algorithms used in this study included logistic regression, random forest (RF), support vector machines (SVM), Gaussian Naive Bayesian (GNB), and k-Nearest neighbor (kNN). The validation accuracies of the same ML model using acoustic features, linguistic features, and their combination were compared.
The accuracy with linguistic features is generally higher than acoustic features in training. The highest accuracy to differentiate HC and AD is 80.77% achieved by SVM, based on all the features extracted from the speech data, while the highest accuracy to differentiate HC and AD or MCI is 80.43% achieved by RF, based only on linguistic features.
Our results suggest the utility and validity of linguistic features in the automated diagnosis of cognitive impairment, and validated the applicability of automated diagnosis for Chinese language data.
语音分析有望作为一种筛查工具,用于阿尔茨海默病(AD)和轻度认知障碍(MCI)的早期检测。语音分析通常使用声学特征和语言特征。然而,目前还没有研究确定哪种类型的特征提供更好的筛查效果,尤其是在中国庞大的老龄化人口中。
首先,使用相同的数据集比较声学特征、语言特征及其组合的筛查效果。其次,使用从母语为汉语的人那里收集的自然话语数据,开发中文自动诊断模型。
共有 92 名来自上海社区的参与者在经过培训的操作人员的指导下完成了 MoCA-B 和基于 Cookie 盗窃的图片描述任务,并根据 MoCA-B 得分分为 AD、MCI 和健康对照组(HC)三组。提取声学特征(音高、抖动、颤动、MFCC、共振峰)和语言特征(词性、词类比、信息词、信息单位)。本研究中使用的机器算法包括逻辑回归、随机森林(RF)、支持向量机(SVM)、高斯朴素贝叶斯(GNB)和 k-最近邻(kNN)。比较了使用声学特征、语言特征及其组合的相同 ML 模型的验证精度。
在训练中,语言特征的准确性通常高于声学特征。SVM 基于从语音数据中提取的所有特征,区分 HC 和 AD 的准确率最高为 80.77%,而 RF 仅基于语言特征,区分 HC 和 AD 或 MCI 的准确率最高为 80.43%。
我们的结果表明语言特征在认知障碍的自动诊断中具有实用性和有效性,并验证了自动诊断对中文语言数据的适用性。