Centre for Research in Psychology and Human Well-being, Faculty of Social Sciences and Humanities, Universiti Kebangsaan Malaysia, Bangi, Malaysia.
Department of Psychology, Chengde Medical University, Chengde, China.
Int J Public Health. 2023 Jan 19;68:1605322. doi: 10.3389/ijph.2023.1605322. eCollection 2023.
To explore the predictive value of machine learning in cognitive impairment, and identify important factors for cognitive impairment. A total of 2,326 middle-aged and elderly people completed questionnaire, and physical examination evaluation at baseline, Year 2, and Year 4 follow-ups. A random forest machine learning (ML) model was used to predict the cognitive impairment at Year 2 and Year 4 longitudinally. Based on Year 4 cross-sectional data, the same method was applied to establish a prediction model and verify its longitudinal prediction accuracy for cognitive impairment. Meanwhile, the ability of random forest and traditional logistic regression model to longitudinally predict 2-year and 4-year cognitive impairment was compared. Random forest models showed high accuracy for all outcomes at Year 2, Year 4, and cross-sectional Year 4 [AUC = 0.81, 0.79, 0.80] compared with logistic regression [AUC = 0.61, 0.62, 0.70]. Baseline physical examination (e.g., BMI, Blood pressure), biomarkers (e.g., cholesterol), functioning (e.g., functional limitations), demography (e.g., age), and emotional status (e.g., depression) characteristics were identified as the top ten important predictors of cognitive impairment. ML algorithms could enhance the prediction of cognitive impairment among the middle-aged and older Chinese for 4 years and identify essential risk markers.
为了探究机器学习在认知障碍预测中的价值,并确定认知障碍的重要因素。共有 2326 名中老年人群在基线、第 2 年和第 4 年随访时完成了问卷调查和体格检查评估。采用随机森林机器学习(ML)模型对第 2 年和第 4 年的认知障碍进行纵向预测。基于第 4 年的横断面数据,应用相同方法建立预测模型,并验证其对认知障碍的纵向预测准确性。同时,比较了随机森林和传统逻辑回归模型对 2 年和 4 年认知障碍的纵向预测能力。随机森林模型在第 2 年、第 4 年和横断面第 4 年的所有结果中均显示出较高的准确性(AUC = 0.81、0.79、0.80),而逻辑回归模型的 AUC 值分别为 0.61、0.62 和 0.70。基线体检(如 BMI、血压)、生物标志物(如胆固醇)、功能(如功能障碍)、人口统计学(如年龄)和情绪状态(如抑郁)特征被确定为认知障碍的前 10 个重要预测因素。ML 算法可以提高对中国中老年人群 4 年认知障碍的预测能力,并确定重要的风险标志物。