利用美国大型观察性医疗保健数据中的回顾性队列来训练预后模型时，研究开发和内部验证设计的影响。

Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.

机构信息

Observational Health Data Sciences and Informatics Community, New York, New York, USA

Epidemiology, Janssen Research and Development LLC, Raritan, New Jersey, USA.

出版信息

BMJ Open. 2021 Dec 24;11(12):e050146. doi: 10.1136/bmjopen-2021-050146.

DOI:10.1136/bmjopen-2021-050146

PMID:34952871

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8710861/

Abstract

OBJECTIVE

The internal validation of prediction models aims to quantify the generalisability of a model. We aim to determine the impact, if any, that the choice of development and internal validation design has on the internal performance bias and model generalisability in big data (n~500 000).

DESIGN

Retrospective cohort.

SETTING

Primary and secondary care; three US claims databases.

PARTICIPANTS

1 200 769 patients pharmaceutically treated for their first occurrence of depression.

METHODS

We investigated the impact of the development/validation design across 21 real-world prediction questions. Model discrimination and calibration were assessed. We trained LASSO logistic regression models using US claims data and internally validated the models using eight different designs: 'no test/validation set', 'test/validation set' and cross validation with 3-fold, 5-fold or 10-fold with and without a test set. We then externally validated each model in two new US claims databases. We estimated the internal validation bias per design by empirically comparing the differences between the estimated internal performance and external performance.

RESULTS

The differences between the models' internal estimated performances and external performances were largest for the 'no test/validation set' design. This indicates even with large data the 'no test/validation set' design causes models to overfit. The seven alternative designs included some validation process to select the hyperparameters and a fair testing process to estimate internal performance. These designs had similar internal performance estimates and performed similarly when externally validated in the two external databases.

CONCLUSIONS

Even with big data, it is important to use some validation process to select the optimal hyperparameters and fairly assess internal validation using a test set or cross-validation.

摘要

目的

预测模型的内部验证旨在量化模型的通用性。我们旨在确定开发和内部验证设计的选择对大数据（n~500000）中模型内部性能偏差和通用性的影响。

设计

回顾性队列。

设置

初级和二级保健；三个美国索赔数据库。

参与者

1200769 名接受药物治疗首次出现抑郁症的患者。

方法

我们调查了 21 个真实世界预测问题的开发/验证设计的影响。评估了模型的区分度和校准度。我们使用美国索赔数据训练了 LASSO 逻辑回归模型，并使用 8 种不同的设计进行了内部验证：“无测试/验证集”、“测试/验证集”和交叉验证，采用 3 折、5 折或 10 折，有无测试集。然后，我们在两个新的美国索赔数据库中对每个模型进行了外部验证。我们通过经验比较估计的内部性能和外部性能之间的差异，来估计每个设计的内部验证偏差。

结果

“无测试/验证集”设计的模型内部估计性能与外部性能之间的差异最大。这表明，即使有大量数据，“无测试/验证集”设计也会导致模型过度拟合。其他 7 种设计包括一些验证过程来选择超参数，以及公平的测试过程来估计内部性能。这些设计具有相似的内部性能估计值，并且在两个外部数据库中进行外部验证时表现相似。

结论

即使有大数据，使用一些验证过程来选择最优的超参数，并使用测试集或交叉验证公平地评估内部验证仍然很重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/18bd1e6700ac/bmjopen-2021-050146f01.jpg

相似文献

Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.

BMJ Open. 2021 Dec 24;11(12):e050146. doi: 10.1136/bmjopen-2021-050146.

Estimating real-world performance of a predictive model: a case-study in predicting mortality.

JAMIA Open. 2020 Apr 26;3(2):243-251. doi: 10.1093/jamiaopen/ooaa008. eCollection 2020 Jul.

External validation: a simulation study to compare cross-validation versus holdout or external testing to assess the performance of clinical prediction models using PET data from DLBCL patients.

EJNMMI Res. 2022 Sep 11;12(1):58. doi: 10.1186/s13550-022-00931-w.

Prognostic models for identifying risk of poor outcome in people with acute ankle sprains: the SPRAINED development and external validation study.

Health Technol Assess. 2018 Nov;22(64):1-112. doi: 10.3310/hta22640.

Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources.

Drug Saf. 2022 May;45(5):563-570. doi: 10.1007/s40264-022-01161-8. Epub 2022 May 17.

Risk models to predict late-onset seizures after stroke: A systematic review.

Epilepsy Behav. 2021 Aug;121(Pt A):108003. doi: 10.1016/j.yebeh.2021.108003. Epub 2021 May 21.

Learning patient-level prediction models across multiple healthcare databases: evaluation of ensembles for increasing model transportability.

BMC Med Inform Decis Mak. 2022 May 25;22(1):142. doi: 10.1186/s12911-022-01879-6.

Performance of prediction models for nephropathy in people with type 2 diabetes: systematic review and external validation study.

BMJ. 2021 Sep 28;374:n2134. doi: 10.1136/bmj.n2134.

External Validation and Optimization of the SPRING Model for Prediction of Survival After Surgical Treatment of Bone Metastases of the Extremities.

Clin Orthop Relat Res. 2018 Aug;476(8):1591-1599. doi: 10.1097/01.blo.0000534678.44152.ee.

Overview of the epidemiology methods and applications: strengths and limitations of observational study designs.

Crit Rev Food Sci Nutr. 2010;50 Suppl 1(s1):10-2. doi: 10.1080/10408398.2010.526838.

引用本文的文献

Integrating tumor location into artificial intelligence-based prognostic models in cancer.

World J Clin Oncol. 2025 Aug 24;16(8):109934. doi: 10.5306/wjco.v16.i8.109934.

Identification and validation of HOXC6 as a diagnostic biomarker for Ewing sarcoma: insights from machine learning algorithms and experiments.

Front Immunol. 2025 Apr 4;16:1449355. doi: 10.3389/fimmu.2025.1449355. eCollection 2025.

Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes.

Diagn Progn Res. 2025 Apr 17;9(1):10. doi: 10.1186/s41512-025-00191-x.

Evaluation of machine learning approach for surgical results of Ahmed valve implantation in patients with glaucoma.

BMC Ophthalmol. 2024 Jun 11;24(1):248. doi: 10.1186/s12886-024-03510-w.

Comparing penalization methods for linear models on large observational health data.

J Am Med Inform Assoc. 2024 Jun 20;31(7):1514-1521. doi: 10.1093/jamia/ocae109.

Moderate-to-Heavy Alcohol Consumption May Cause a Significant Decrease in Serum High-Density Lipoprotein Cholesterol in Middle-Aged Women: A Cohort Study of the National Database Study in the Kanto 7 Prefectures-4.

Cureus. 2024 Mar 4;16(3):e55467. doi: 10.7759/cureus.55467. eCollection 2024 Mar.

Discordant and Converting Receptor Expressions in Brain Metastases from Breast Cancer: MRI-Based Non-Invasive Receptor Status Tracking.

Cancers (Basel). 2023 May 23;15(11):2880. doi: 10.3390/cancers15112880.

Machine Learning and Real-World Data to Predict Lung Cancer Risk in Routine Care.

Cancer Epidemiol Biomarkers Prev. 2023 Mar 6;32(3):337-343. doi: 10.1158/1055-9965.EPI-22-0873.

Machine learning in point-of-care automated classification of oral potentially malignant and malignant disorders: a systematic review and meta-analysis.

Sci Rep. 2022 Aug 13;12(1):13797. doi: 10.1038/s41598-022-17489-1.

Elevated All-Cause Mortality among Overweight Older People: AI Predicts a High Normal Weight Is Optimal.

Geriatrics (Basel). 2022 Jun 16;7(3):68. doi: 10.3390/geriatrics7030068.

本文引用的文献

Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data.

J Am Med Inform Assoc. 2018 Aug 1;25(8):969-975. doi: 10.1093/jamia/ocy032.

Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.

J Am Med Inform Assoc. 2017 Jan;24(1):198-208. doi: 10.1093/jamia/ocw042. Epub 2016 May 17.

Prediction models need appropriate internal, internal-external, and external validation.

J Clin Epidemiol. 2016 Jan;69:245-7. doi: 10.1016/j.jclinepi.2015.04.005. Epub 2015 Apr 18.

Massive parallelization of serial inference algorithms for a complex generalized linear model.

ACM Trans Model Comput Simul. 2013 Jan;23(1). doi: 10.1145/2414416.2414791.

Prognosis Research Strategy (PROGRESS) 3: prognostic model research.

PLoS Med. 2013;10(2):e1001381. doi: 10.1371/journal.pmed.1001381. Epub 2013 Feb 5.

Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.

J Clin Epidemiol. 2001 Aug;54(8):774-81. doi: 10.1016/s0895-4356(01)00341-9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用美国大型观察性医疗保健数据中的回顾性队列来训练预后模型时，研究开发和内部验证设计的影响。

Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.

机构信息

出版信息

OBJECTIVE

DESIGN

SETTING

PARTICIPANTS

METHODS

RESULTS

CONCLUSIONS

目的

设计

设置

参与者

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献