Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.
Janssen Research and Development, Raritan, NJ, USA.
BMC Med. 2024 Jul 29;22(1):308. doi: 10.1186/s12916-024-03530-9.
A prediction model can be a useful tool to quantify the risk of a patient developing dementia in the next years and take risk-factor-targeted intervention. Numerous dementia prediction models have been developed, but few have been externally validated, likely limiting their clinical uptake. In our previous work, we had limited success in externally validating some of these existing models due to inadequate reporting. As a result, we are compelled to develop and externally validate novel models to predict dementia in the general population across a network of observational databases. We assess regularization methods to obtain parsimonious models that are of lower complexity and easier to implement.
Logistic regression models were developed across a network of five observational databases with electronic health records (EHRs) and claims data to predict 5-year dementia risk in persons aged 55-84. The regularization methods L1 and Broken Adaptive Ridge (BAR) as well as three candidate predictor sets to optimize prediction performance were assessed. The predictor sets include a baseline set using only age and sex, a full set including all available candidate predictors, and a phenotype set which includes a limited number of clinically relevant predictors.
BAR can be used for variable selection, outperforming L1 when a parsimonious model is desired. Adding candidate predictors for disease diagnosis and drug exposure generally improves the performance of baseline models using only age and sex. While a model trained on German EHR data saw an increase in AUROC from 0.74 to 0.83 with additional predictors, a model trained on US EHR data showed only minimal improvement from 0.79 to 0.81 AUROC. Nevertheless, the latter model developed using BAR regularization on the clinically relevant predictor set was ultimately chosen as best performing model as it demonstrated more consistent external validation performance and improved calibration.
We developed and externally validated patient-level models to predict dementia. Our results show that although dementia prediction is highly driven by demographic age, adding predictors based on condition diagnoses and drug exposures further improves prediction performance. BAR regularization outperforms L1 regularization to yield the most parsimonious yet still well-performing prediction model for dementia.
预测模型可以是一种有用的工具,用于量化患者在未来几年内发展为痴呆的风险,并采取针对风险因素的干预措施。已经开发了许多痴呆预测模型,但很少有经过外部验证的,这可能限制了它们在临床上的应用。在我们之前的工作中,由于报告不充分,我们在对一些现有模型进行外部验证方面取得的成功有限。因此,我们必须开发和验证新的模型,以在一个包含观察性数据库的网络中预测一般人群的痴呆症。我们评估了正则化方法,以获得更简单、更易于实施的简约模型。
使用包含电子健康记录(EHR)和索赔数据的五个观察性数据库网络开发逻辑回归模型,以预测 55-84 岁人群 5 年内痴呆风险。评估了正则化方法 L1 和破碎自适应岭(BAR)以及三个候选预测器集,以优化预测性能。预测器集包括仅使用年龄和性别构建的基本集、包含所有可用候选预测器的完整集以及包含有限数量临床相关预测器的表型集。
BAR 可用于变量选择,当需要简约模型时,其性能优于 L1。添加疾病诊断和药物暴露的候选预测器通常可以提高仅使用年龄和性别构建的基线模型的性能。尽管使用德国 EHR 数据训练的模型的 AUC 从 0.74 增加到 0.83,但使用美国 EHR 数据训练的模型的 AUC 仅从 0.79 增加到 0.81,增幅很小。尽管如此,使用 BAR 正则化在临床相关预测器集上开发的后者模型最终被选为表现最佳的模型,因为它表现出更一致的外部验证性能和改进的校准。
我们开发并验证了预测痴呆症的患者水平模型。我们的结果表明,尽管痴呆症预测主要由人口统计学年龄驱动,但添加基于疾病诊断和药物暴露的预测器可以进一步提高预测性能。BAR 正则化优于 L1 正则化,可产生最简约但性能仍然良好的痴呆症预测模型。