Lee Jiho, Kim Nayeon, Ha Ji-Wan, Kang Kyunghun, Park Eunhee, Yoon Janghyeok, Park Ki-Su
Neopons Inc., Daegu 41260, Republic of Korea.
Department of Speech-Language Pathology, Daegu University, Gyeongsan 38453, Republic of Korea.
Diagnostics (Basel). 2024 Dec 17;14(24):2837. doi: 10.3390/diagnostics14242837.
: To develop a non-invasive cognitive impairment detection system using speech data analysis, addressing the growing global dementia crisis and enabling accessible early screening through daily health monitoring. : Speech data from 223 Korean patients were collected across eight tasks. Patients were classified based on Korean Mini-Mental State Examination scores. Four machine learning models were tested for three binary classification tasks. Voice acoustic features were extracted and analyzed. : The Deep Neural Network model performed best in two classification tasks, with Precision-Recall Area Under the Curve scores of 0.737 for severe vs. no impairment and 0.726 for mild vs. no impairment, while Random Forest achieved 0.715 for severe + mild vs. no impairment. Several acoustic features emerged as potentially important indicators, with DDA shimmer from the /i/ task and stdevF0 from the /puh-tuh-kuh/ task showing consistent patterns across classification tasks. : This preliminary study suggests that certain acoustic features may be associated with cognitive status, though demographic factors significantly influence these relationships. Further research with demographically matched populations is needed to validate these findings.
开发一种使用语音数据分析的非侵入性认知障碍检测系统,以应对全球日益严重的痴呆危机,并通过日常健康监测实现可及的早期筛查。收集了223名韩国患者在八项任务中的语音数据。根据韩国简易精神状态检查表得分对患者进行分类。针对三项二分类任务测试了四种机器学习模型。提取并分析了语音声学特征。深度神经网络模型在两项分类任务中表现最佳,重度与无损伤的精确召回曲线下面积得分为0.737,轻度与无损伤的得分为0.726,而随机森林模型在重度+轻度与无损伤的分类中得分为0.715。几种声学特征成为潜在的重要指标,/i/任务中的DDA闪烁和/puh-tuh-kuh/任务中的stdevF0在各项分类任务中呈现出一致的模式。这项初步研究表明,某些声学特征可能与认知状态相关,尽管人口统计学因素会显著影响这些关系。需要对人口统计学匹配的人群进行进一步研究以验证这些发现。