• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用美国大型观察性医疗保健数据中的回顾性队列来训练预后模型时,研究开发和内部验证设计的影响。

Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.

机构信息

Observational Health Data Sciences and Informatics Community, New York, New York, USA

Epidemiology, Janssen Research and Development LLC, Raritan, New Jersey, USA.

出版信息

BMJ Open. 2021 Dec 24;11(12):e050146. doi: 10.1136/bmjopen-2021-050146.

DOI:10.1136/bmjopen-2021-050146
PMID:34952871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8710861/
Abstract

OBJECTIVE

The internal validation of prediction models aims to quantify the generalisability of a model. We aim to determine the impact, if any, that the choice of development and internal validation design has on the internal performance bias and model generalisability in big data (n~500 000).

DESIGN

Retrospective cohort.

SETTING

Primary and secondary care; three US claims databases.

PARTICIPANTS

1 200 769 patients pharmaceutically treated for their first occurrence of depression.

METHODS

We investigated the impact of the development/validation design across 21 real-world prediction questions. Model discrimination and calibration were assessed. We trained LASSO logistic regression models using US claims data and internally validated the models using eight different designs: 'no test/validation set', 'test/validation set' and cross validation with 3-fold, 5-fold or 10-fold with and without a test set. We then externally validated each model in two new US claims databases. We estimated the internal validation bias per design by empirically comparing the differences between the estimated internal performance and external performance.

RESULTS

The differences between the models' internal estimated performances and external performances were largest for the 'no test/validation set' design. This indicates even with large data the 'no test/validation set' design causes models to overfit. The seven alternative designs included some validation process to select the hyperparameters and a fair testing process to estimate internal performance. These designs had similar internal performance estimates and performed similarly when externally validated in the two external databases.

CONCLUSIONS

Even with big data, it is important to use some validation process to select the optimal hyperparameters and fairly assess internal validation using a test set or cross-validation.

摘要

目的

预测模型的内部验证旨在量化模型的通用性。我们旨在确定开发和内部验证设计的选择对大数据(n~500000)中模型内部性能偏差和通用性的影响。

设计

回顾性队列。

设置

初级和二级保健;三个美国索赔数据库。

参与者

1200769 名接受药物治疗首次出现抑郁症的患者。

方法

我们调查了 21 个真实世界预测问题的开发/验证设计的影响。评估了模型的区分度和校准度。我们使用美国索赔数据训练了 LASSO 逻辑回归模型,并使用 8 种不同的设计进行了内部验证:“无测试/验证集”、“测试/验证集”和交叉验证,采用 3 折、5 折或 10 折,有无测试集。然后,我们在两个新的美国索赔数据库中对每个模型进行了外部验证。我们通过经验比较估计的内部性能和外部性能之间的差异,来估计每个设计的内部验证偏差。

结果

“无测试/验证集”设计的模型内部估计性能与外部性能之间的差异最大。这表明,即使有大量数据,“无测试/验证集”设计也会导致模型过度拟合。其他 7 种设计包括一些验证过程来选择超参数,以及公平的测试过程来估计内部性能。这些设计具有相似的内部性能估计值,并且在两个外部数据库中进行外部验证时表现相似。

结论

即使有大数据,使用一些验证过程来选择最优的超参数,并使用测试集或交叉验证公平地评估内部验证仍然很重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/b0e76aa87d81/bmjopen-2021-050146f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/18bd1e6700ac/bmjopen-2021-050146f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/c2eaac1c8716/bmjopen-2021-050146f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/b0e76aa87d81/bmjopen-2021-050146f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/18bd1e6700ac/bmjopen-2021-050146f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/c2eaac1c8716/bmjopen-2021-050146f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7b/8710861/b0e76aa87d81/bmjopen-2021-050146f03.jpg

相似文献

1
Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.利用美国大型观察性医疗保健数据中的回顾性队列来训练预后模型时,研究开发和内部验证设计的影响。
BMJ Open. 2021 Dec 24;11(12):e050146. doi: 10.1136/bmjopen-2021-050146.
2
Estimating real-world performance of a predictive model: a case-study in predicting mortality.评估预测模型的实际性能:预测死亡率的案例研究
JAMIA Open. 2020 Apr 26;3(2):243-251. doi: 10.1093/jamiaopen/ooaa008. eCollection 2020 Jul.
3
External validation: a simulation study to compare cross-validation versus holdout or external testing to assess the performance of clinical prediction models using PET data from DLBCL patients.外部验证:一项模拟研究,比较交叉验证与留出法或外部测试,以使用弥漫性大B细胞淋巴瘤(DLBCL)患者的PET数据评估临床预测模型的性能。
EJNMMI Res. 2022 Sep 11;12(1):58. doi: 10.1186/s13550-022-00931-w.
4
Prognostic models for identifying risk of poor outcome in people with acute ankle sprains: the SPRAINED development and external validation study.用于识别急性踝关节扭伤患者不良结局风险的预测模型:SPRAINED 研究的开发和外部验证。
Health Technol Assess. 2018 Nov;22(64):1-112. doi: 10.3310/hta22640.
5
Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources.利用迭代成对外部验证来情境化预测模型性能:一个预测糖尿病患者 1 年心力衰竭风险的案例,涉及五个数据源。
Drug Saf. 2022 May;45(5):563-570. doi: 10.1007/s40264-022-01161-8. Epub 2022 May 17.
6
Risk models to predict late-onset seizures after stroke: A systematic review.卒中后迟发性癫痫的风险预测模型:系统评价。
Epilepsy Behav. 2021 Aug;121(Pt A):108003. doi: 10.1016/j.yebeh.2021.108003. Epub 2021 May 21.
7
Learning patient-level prediction models across multiple healthcare databases: evaluation of ensembles for increasing model transportability.跨多个医疗保健数据库学习患者级预测模型:评估集成模型以提高模型可转移性。
BMC Med Inform Decis Mak. 2022 May 25;22(1):142. doi: 10.1186/s12911-022-01879-6.
8
Performance of prediction models for nephropathy in people with type 2 diabetes: systematic review and external validation study.2 型糖尿病患者肾病预测模型的性能:系统评价和外部验证研究。
BMJ. 2021 Sep 28;374:n2134. doi: 10.1136/bmj.n2134.
9
External Validation and Optimization of the SPRING Model for Prediction of Survival After Surgical Treatment of Bone Metastases of the Extremities.四肢骨转移手术治疗后生存预测的 SPRING 模型的外部验证和优化。
Clin Orthop Relat Res. 2018 Aug;476(8):1591-1599. doi: 10.1097/01.blo.0000534678.44152.ee.
10
Overview of the epidemiology methods and applications: strengths and limitations of observational study designs.流行病学方法与应用概述:观察性研究设计的优势与局限性。
Crit Rev Food Sci Nutr. 2010;50 Suppl 1(s1):10-2. doi: 10.1080/10408398.2010.526838.

引用本文的文献

1
Integrating tumor location into artificial intelligence-based prognostic models in cancer.将肿瘤位置纳入基于人工智能的癌症预后模型。
World J Clin Oncol. 2025 Aug 24;16(8):109934. doi: 10.5306/wjco.v16.i8.109934.
2
Identification and validation of HOXC6 as a diagnostic biomarker for Ewing sarcoma: insights from machine learning algorithms and experiments.鉴定和验证HOXC6作为尤因肉瘤的诊断生物标志物:来自机器学习算法和实验的见解
Front Immunol. 2025 Apr 4;16:1449355. doi: 10.3389/fimmu.2025.1449355. eCollection 2025.
3
Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes.

本文引用的文献

1
Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data.利用观察性医疗保健数据生成和评估患者水平预测模型的标准化框架的设计与实现。
J Am Med Inform Assoc. 2018 Aug 1;25(8):969-975. doi: 10.1093/jamia/ocy032.
2
Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.利用电子健康记录数据开发风险预测模型的机遇与挑战:一项系统综述
J Am Med Inform Assoc. 2017 Jan;24(1):198-208. doi: 10.1093/jamia/ocw042. Epub 2016 May 17.
3
Prediction models need appropriate internal, internal-external, and external validation.
我们能否利用观察性医疗保健数据开发真实世界的预后模型?调查模型对数据库和表型敏感性的大规模实验。
Diagn Progn Res. 2025 Apr 17;9(1):10. doi: 10.1186/s41512-025-00191-x.
4
Evaluation of machine learning approach for surgical results of Ahmed valve implantation in patients with glaucoma.青光眼患者 Ahmed 阀植入手术结果的机器学习方法评估。
BMC Ophthalmol. 2024 Jun 11;24(1):248. doi: 10.1186/s12886-024-03510-w.
5
Comparing penalization methods for linear models on large observational health data.比较大型观测性健康数据中线性模型的惩罚方法。
J Am Med Inform Assoc. 2024 Jun 20;31(7):1514-1521. doi: 10.1093/jamia/ocae109.
6
Moderate-to-Heavy Alcohol Consumption May Cause a Significant Decrease in Serum High-Density Lipoprotein Cholesterol in Middle-Aged Women: A Cohort Study of the National Database Study in the Kanto 7 Prefectures-4.中度至重度饮酒可能导致中年女性血清高密度脂蛋白胆固醇显著下降:关东7县全国数据库队列研究-4
Cureus. 2024 Mar 4;16(3):e55467. doi: 10.7759/cureus.55467. eCollection 2024 Mar.
7
Discordant and Converting Receptor Expressions in Brain Metastases from Breast Cancer: MRI-Based Non-Invasive Receptor Status Tracking.乳腺癌脑转移中不一致和转变的受体表达:基于MRI的非侵入性受体状态追踪
Cancers (Basel). 2023 May 23;15(11):2880. doi: 10.3390/cancers15112880.
8
Machine Learning and Real-World Data to Predict Lung Cancer Risk in Routine Care.机器学习和真实世界数据在常规护理中预测肺癌风险。
Cancer Epidemiol Biomarkers Prev. 2023 Mar 6;32(3):337-343. doi: 10.1158/1055-9965.EPI-22-0873.
9
Machine learning in point-of-care automated classification of oral potentially malignant and malignant disorders: a systematic review and meta-analysis.基于机器学习的即时检测中口腔潜在恶性和恶性疾病的自动分类:系统评价和荟萃分析。
Sci Rep. 2022 Aug 13;12(1):13797. doi: 10.1038/s41598-022-17489-1.
10
Elevated All-Cause Mortality among Overweight Older People: AI Predicts a High Normal Weight Is Optimal.超重老年人全因死亡率升高:人工智能预测高正常体重最为适宜。
Geriatrics (Basel). 2022 Jun 16;7(3):68. doi: 10.3390/geriatrics7030068.
预测模型需要进行适当的内部验证、内部-外部联合验证以及外部验证。
J Clin Epidemiol. 2016 Jan;69:245-7. doi: 10.1016/j.jclinepi.2015.04.005. Epub 2015 Apr 18.
4
Massive parallelization of serial inference algorithms for a complex generalized linear model.用于复杂广义线性模型的串行推理算法的大规模并行化。
ACM Trans Model Comput Simul. 2013 Jan;23(1). doi: 10.1145/2414416.2414791.
5
Prognosis Research Strategy (PROGRESS) 3: prognostic model research.预后研究策略(PROGRESS)3:预后模型研究。
PLoS Med. 2013;10(2):e1001381. doi: 10.1371/journal.pmed.1001381. Epub 2013 Feb 5.
6
Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.预测模型的内部验证:逻辑回归分析中一些程序的效率
J Clin Epidemiol. 2001 Aug;54(8):774-81. doi: 10.1016/s0895-4356(01)00341-9.