Yuan Shiqi, Liu Qing, Huang Xiaxuan, Tan Shanyuan, Bai Zihong, Yu Juan, Lei Fazhen, Le Huan, Ye Qingqing, Peng Xiaoxue, Yang Juying, Ling Yitong, Lyu Jun
Department of Neurology, The First Affiliated Hospital of Jinan University, No.613, Huangpu Road West, Guangzhou, Guangdong Province, 510630, China.
Department of Neurology, The Second People's Hospital of Guiyang (The Affiliated Jinyang Hospital of Guizhou Medical University), Guiyang, Guizhou Province, 550000, China.
Alzheimers Res Ther. 2024 Dec 30;16(1):278. doi: 10.1186/s13195-024-01663-w.
Dementia is a major public health challenge in modern society. Early detection of high-risk dementia patients and timely intervention or treatment are of significant clinical importance. Neural network survival analysis represents the most advanced technology for survival analysis to date. However, there is a lack of deep learning-based survival analysis models that integrate both genetic and clinical factors to develop and validate individualized dynamic dementia risk prediction models.
This study is based on a large prospective cohort from the UK Biobank, which includes a total of 41,484 participants with an average follow-up period of 12.6 years. Initially, 364 candidate features (predictor variables) were screened. The top 30 key features were then identified by ranking the importance of each predictor variable using the Gradient Boosting Machine (GBM) model. A multi-model comparison strategy was employed to evaluate the predictive performance of four survival analysis models: DeepSurv, DeepHit, Kaplan-Meier estimation, and the Cox proportional hazards model (CoxPH). The results showed that the average Harrell's C-index for the DeepSurv model was 0.743, for the DeepHit model it was 0.633, for the CoxPH model it was 0.749, and for the Kaplan-Meier estimator model it was 0.500. In addition, the average D-Calibration Survival Measure was 6.014, 4408.086, 32274.743, and 1.508, respectively. The Brier score (BS) was used to assess the importance of features for the DeepSurv dementia prediction model, and the relationship between features and dementia was visualized using a partial dependence plot (PDP). To facilitate further research, the team deployed the DeepSurv dementia prediction model on AliCloud servers and designated it as the UKB-DementiaPre Tool.
This study successfully developed and validated the DeepSurv dementia prediction model for individuals aged 60 years and above, integrating both genetic and clinical data. The model was then deployed on AliCloud servers to promote its clinical translation. It is anticipated that this prediction model will provide more accurate decision support for clinical treatment and will serve as a valuable tool for the primary prevention of dementia.
痴呆是现代社会面临的一项重大公共卫生挑战。早期发现高危痴呆患者并及时进行干预或治疗具有重要的临床意义。神经网络生存分析是迄今为止生存分析领域最先进的技术。然而,目前缺乏基于深度学习的生存分析模型,该模型能整合遗传和临床因素来开发并验证个性化的动态痴呆风险预测模型。
本研究基于英国生物银行的一个大型前瞻性队列,该队列共有41484名参与者,平均随访期为12.6年。最初,筛选出364个候选特征(预测变量)。然后使用梯度提升机(GBM)模型对每个预测变量的重要性进行排序,从而确定前30个关键特征。采用多模型比较策略来评估四种生存分析模型的预测性能:深度生存模型(DeepSurv)、深度命中模型(DeepHit)、卡普兰 - 迈耶估计法以及考克斯比例风险模型(CoxPH)。结果显示,DeepSurv模型的平均哈雷尔C指数为0.743,DeepHit模型为0.633,CoxPH模型为0.749,卡普兰 - 迈耶估计模型为0.500。此外,平均D - 校准生存度量分别为6.014、4408.086、32274.743和1.508。使用布里尔分数(BS)评估特征对DeepSurv痴呆预测模型的重要性,并使用部分依赖图(PDP)直观展示特征与痴呆之间的关系。为便于进一步研究,该团队将DeepSurv痴呆预测模型部署在阿里云服务器上,并将其指定为UKB - DementiaPre工具。
本研究成功开发并验证了针对60岁及以上人群的DeepSurv痴呆预测模型,该模型整合了遗传和临床数据。随后将该模型部署在阿里云服务器上以促进其临床转化。预计该预测模型将为临床治疗提供更准确的决策支持,并成为痴呆一级预防的宝贵工具。