竞争风险模型的图形校准曲线和综合校准指数（ICI）

Graphical calibration curves and the integrated calibration index (ICI) for competing risk models.

作者信息

Austin Peter C, Putter Hein, Giardiello Daniele, van Klaveren David

机构信息

ICES, G106, 2075 Bayview Avenue, Toronto, Ontario, M4N 3M5, Canada.

Institute of Health Management, Policy and Evaluation, University of Toronto, Toronto, Ontario, Canada.

出版信息

Diagn Progn Res. 2022 Jan 17;6(1):2. doi: 10.1186/s41512-021-00114-6.

DOI:10.1186/s41512-021-00114-6

PMID:35039069

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8762819/

Abstract

BACKGROUND

Assessing calibration-the agreement between estimated risk and observed proportions-is an important component of deriving and validating clinical prediction models. Methods for assessing the calibration of prognostic models for use with competing risk data have received little attention.

METHODS

We propose a method for graphically assessing the calibration of competing risk regression models. Our proposed method can be used to assess the calibration of any model for estimating incidence in the presence of competing risk (e.g., a Fine-Gray subdistribution hazard model; a combination of cause-specific hazard functions; or a random survival forest). Our method is based on using the Fine-Gray subdistribution hazard model to regress the cumulative incidence function of the cause-specific outcome of interest on the predicted outcome risk of the model whose calibration we want to assess. We provide modifications of the integrated calibration index (ICI), of E50 and of E90, which are numerical calibration metrics, for use with competing risk data. We conducted a series of Monte Carlo simulations to evaluate the performance of these calibration measures when the underlying model has been correctly specified and when the model was mis-specified and when the incidence of the cause-specific outcome differed between the derivation and validation samples. We illustrated the usefulness of calibration curves and the numerical calibration metrics by comparing the calibration of a Fine-Gray subdistribution hazards regression model with that of random survival forests for predicting cardiovascular mortality in patients hospitalized with heart failure.

RESULTS

The simulations indicated that the method for constructing graphical calibration curves and the associated calibration metrics performed as desired. We also demonstrated that the numerical calibration metrics can be used as optimization criteria when tuning machine learning methods for competing risk outcomes.

CONCLUSIONS

The calibration curves and numeric calibration metrics permit a comprehensive comparison of the calibration of different competing risk models.

摘要

背景

评估校准——估计风险与观察比例之间的一致性——是推导和验证临床预测模型的重要组成部分。用于评估具有竞争风险数据的预后模型校准的方法很少受到关注。

方法

我们提出了一种用于以图形方式评估竞争风险回归模型校准的方法。我们提出的方法可用于评估在存在竞争风险的情况下估计发病率的任何模型（例如，Fine-Gray 亚分布风险模型；特定病因风险函数的组合；或随机生存森林）的校准。我们的方法基于使用 Fine-Gray 亚分布风险模型，将感兴趣的特定病因结局的累积发病率函数回归到我们想要评估其校准的模型的预测结局风险上。我们对综合校准指数（ICI）、E50 和 E90 进行了修改，这些是数值校准指标，用于竞争风险数据。我们进行了一系列蒙特卡罗模拟，以评估当基础模型被正确设定、模型被错误设定以及推导样本和验证样本之间特定病因结局的发病率不同时，这些校准措施的性能。我们通过比较 Fine-Gray 亚分布风险回归模型与随机生存森林在预测心力衰竭住院患者心血管死亡率方面的校准情况，说明了校准曲线和数值校准指标的有用性。

结果

模拟表明，构建图形校准曲线的方法和相关的校准指标按预期执行。我们还证明，在调整用于竞争风险结局的机器学习方法时，数值校准指标可以用作优化标准。

结论

校准曲线和数值校准指标允许对不同竞争风险模型的校准进行全面比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d7e/8762819/e236df38be2d/41512_2021_114_Fig1_HTML.jpg

相似文献

Graphical calibration curves and the integrated calibration index (ICI) for competing risk models.

Diagn Progn Res. 2022 Jan 17;6(1):2. doi: 10.1186/s41512-021-00114-6.

Graphical calibration curves and the integrated calibration index (ICI) for survival models.

Stat Med. 2020 Sep 20;39(21):2714-2742. doi: 10.1002/sim.8570. Epub 2020 Jun 16.

The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models.

Stat Med. 2019 Sep 20;38(21):4051-4065. doi: 10.1002/sim.8281. Epub 2019 Jul 3.

Tree-based models for survival data with competing risks.

Comput Methods Programs Biomed. 2018 Jun;159:185-198. doi: 10.1016/j.cmpb.2018.03.017. Epub 2018 Mar 21.

Fine-Gray subdistribution hazard models to simultaneously estimate the absolute risk of different event types: Cumulative total failure probability may exceed 1.

Stat Med. 2021 Aug 30;40(19):4200-4212. doi: 10.1002/sim.9023. Epub 2021 May 9.

Comparison of machine learning and conventional statistical modeling for predicting readmission following acute heart failure hospitalization.

Am Heart J. 2024 Nov;277:93-103. doi: 10.1016/j.ahj.2024.07.017. Epub 2024 Jul 31.

Estimation of the Absolute Risk of Cardiovascular Disease and Other Events: Issues With the Use of Multiple Fine-Gray Subdistribution Hazard Models.

Circ Cardiovasc Qual Outcomes. 2022 Feb;15(2):e008368. doi: 10.1161/CIRCOUTCOMES.121.008368. Epub 2022 Jan 31.

Accounting for the Competing Risk of Death to Predict Kidney Failure in Adults With Stage 4 Chronic Kidney Disease.

JAMA Netw Open. 2021 May 3;4(5):e219225. doi: 10.1001/jamanetworkopen.2021.9225.

Statistical models versus machine learning for competing risks: development and validation of prognostic models.

BMC Med Res Methodol. 2023 Feb 24;23(1):51. doi: 10.1186/s12874-023-01866-z.

Practical recommendations for reporting Fine-Gray model analyses for competing risk data.

Stat Med. 2017 Nov 30;36(27):4391-4400. doi: 10.1002/sim.7501. Epub 2017 Sep 15.

引用本文的文献

Adaption of the Memorial Sloan Kettering Cancer Center Nomograms for the Prediction of Prostate Cancer-specific Death in Sweden: A Population-based Cohort Study.

Eur Urol Open Sci. 2025 Jul 14;78:41-50. doi: 10.1016/j.euros.2025.06.003. eCollection 2025 Aug.

calibmsm: An R package for calibration plots of the transition probabilities in a multistate model.

PLoS One. 2025 Jun 4;20(6):e0320504. doi: 10.1371/journal.pone.0320504. eCollection 2025.

Comment on: "A Machine Learning Approach to Concussion Risk Estimation Among Players Exhibiting Visible Signs in Professional Hockey".

Sports Med. 2025 Apr 7. doi: 10.1007/s40279-025-02211-8.

A decision-analytical perspective on incorporating multiple outcomes in the production of clinical prediction models: defining a taxonomy of risk estimands.

BMC Med. 2025 Mar 6;23(1):142. doi: 10.1186/s12916-025-03978-3.

A Bayesian Joint Model of Multiple Nonlinear Longitudinal and Competing Risks Outcomes for Dynamic Prediction in Multiple Myeloma: Joint Estimation and Corrected Two-Stage Approaches.

Stat Med. 2025 Feb 10;44(3-4):e10322. doi: 10.1002/sim.10322.

Development, validation and recalibration of a prediction model for prediabetes: an EHR and NHANES-based study.

BMC Med Inform Decis Mak. 2024 Dec 18;24(1):387. doi: 10.1186/s12911-024-02803-w.

Derivation and validation of an algorithm to predict transitions from community to residential long-term care among persons with dementia-A retrospective cohort study.

PLOS Digit Health. 2024 Oct 18;3(10):e0000441. doi: 10.1371/journal.pdig.0000441. eCollection 2024 Oct.

Predicting a reduction in intraocular pressure in glaucoma patients in the early period after a trabeculectomy: Development and assessment of a new predictive nomogram.

Front Ophthalmol (Lausanne). 2022 Oct 27;2:987742. doi: 10.3389/fopht.2022.987742. eCollection 2022.

A novel web-based prognostic nomogram and the features influencing the curative effect of chemotherapy and radiotherapy for Paget's disease with invasive ductal carcinoma.

Am J Cancer Res. 2023 Oct 15;13(10):4508-4530. eCollection 2023.

Validation of the European SCORE2 models in a Canadian primary care cohort.

Eur J Prev Cardiol. 2024 Apr 18;31(6):668-676. doi: 10.1093/eurjpc/zwad352.

本文引用的文献

Fine-Gray subdistribution hazard models to simultaneously estimate the absolute risk of different event types: Cumulative total failure probability may exceed 1.

Stat Med. 2021 Aug 30;40(19):4200-4212. doi: 10.1002/sim.9023. Epub 2021 May 9.

Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting.

Stat Methods Med Res. 2021 Jun;30(6):1465-1483. doi: 10.1177/09622802211002867. Epub 2021 Apr 13.

Machine Learning Compared With Conventional Statistical Models for Predicting Myocardial Infarction Readmission and Mortality: A Systematic Review.

Can J Cardiol. 2021 Aug;37(8):1207-1214. doi: 10.1016/j.cjca.2021.02.020. Epub 2021 Mar 5.

Machine learning vs. conventional statistical models for predicting heart failure readmission and mortality.

ESC Heart Fail. 2021 Feb;8(1):106-115. doi: 10.1002/ehf2.13073. Epub 2020 Nov 17.

Graphical calibration curves and the integrated calibration index (ICI) for survival models.

Stat Med. 2020 Sep 20;39(21):2714-2742. doi: 10.1002/sim.8570. Epub 2020 Jun 16.

Evaluation of Machine Learning Algorithms for Predicting Readmission After Acute Myocardial Infarction Using Routinely Collected Clinical Data.

Can J Cardiol. 2020 Jun;36(6):878-885. doi: 10.1016/j.cjca.2019.10.023. Epub 2019 Oct 25.

The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models.

Stat Med. 2019 Sep 20;38(21):4051-4065. doi: 10.1002/sim.8281. Epub 2019 Jul 3.

Propensity-score matching with competing risks in survival analysis.

Stat Med. 2019 Feb 28;38(5):751-777. doi: 10.1002/sim.8008. Epub 2018 Oct 22.

The number of primary events per variable affects estimation of the subdistribution hazard competing risks model.

J Clin Epidemiol. 2017 Mar;83:75-84. doi: 10.1016/j.jclinepi.2016.11.017. Epub 2017 Jan 12.

Introduction to the Analysis of Survival Data in the Presence of Competing Risks.

Circulation. 2016 Feb 9;133(6):601-9. doi: 10.1161/CIRCULATIONAHA.115.017719.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

竞争风险模型的图形校准曲线和综合校准指数（ICI）

Graphical calibration curves and the integrated calibration index (ICI) for competing risk models.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献