• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

内部-外部交叉验证有助于评估大型聚类数据集预测模型的泛化能力。

Internal-external cross-validation helped to evaluate the generalizability of prediction models in large clustered datasets.

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.

Health Data Research UK and Institute of Health Informatics, University College London, Gibbs Building, 215 Euston Road, London, NW1 2BE, United Kingdom; The Alan Turing Institute, British Library, 96 Euston Road, London, NW1 2DB, United Kingdom; The National Institute for Health Research University College London Hospitals Biomedical Research Centre, University College London, Suite A, 1(st) floor, Maple House, 149 Tottenham Court Road, London, W1T 7DN, United Kingdom; British Heart Foundation Research Accelerator, University College London, Gower Street, London, WC1E 6BT, United Kingdom.

出版信息

J Clin Epidemiol. 2021 Sep;137:83-91. doi: 10.1016/j.jclinepi.2021.03.025. Epub 2021 Apr 6.

DOI:10.1016/j.jclinepi.2021.03.025
PMID:33836256
Abstract

OBJECTIVE

To illustrate how to evaluate the need of complex strategies for developing generalizable prediction models in large clustered datasets.

STUDY DESIGN AND SETTING

We developed eight Cox regression models to estimate the risk of heart failure using a large population-level dataset. These models differed in the number of predictors, the functional form of the predictor effects (non-linear effects and interaction) and the estimation method (maximum likelihood and penalization). Internal-external cross-validation was used to evaluate the models' generalizability across the included general practices.

RESULTS

Among 871,687 individuals from 225 general practices, 43,987 (5.5%) developed heart failure during a median follow-up time of 5.8 years. For discrimination, the simplest prediction model yielded a good concordance statistic, which was not much improved by adopting complex strategies. Between-practice heterogeneity in discrimination was similar in all models. For calibration, the simplest model performed satisfactorily. Although accounting for non-linear effects and interaction slightly improved the calibration slope, it also led to more heterogeneity in the observed/expected ratio. Similar results were found in a second case study involving patients with stroke.

CONCLUSION

In large clustered datasets, prediction model studies may adopt internal-external cross-validation to evaluate the generalizability of competing models, and to identify promising modelling strategies.

摘要

目的

举例说明如何评估在大型聚类数据集开发可推广预测模型时所需的复杂策略。

研究设计和设置

我们开发了八个 Cox 回归模型,使用大型人群水平数据集来估计心力衰竭的风险。这些模型在预测因子的数量、预测因子效应的函数形式(非线性效应和交互作用)和估计方法(最大似然和惩罚)方面有所不同。内部-外部交叉验证用于评估模型在纳入的常规实践中的可推广性。

结果

在来自 225 个常规实践的 871687 个人中,43987(5.5%)人在中位数为 5.8 年的随访期间发生心力衰竭。对于判别能力,最简单的预测模型产生了良好的一致性统计量,采用复杂策略并没有显著提高。在所有模型中,实践间的判别异质性相似。对于校准,最简单的模型表现良好。虽然考虑非线性效应和交互作用略微提高了校准斜率,但也导致了观察到的/预期比率的更多异质性。在涉及中风患者的第二个案例研究中也发现了类似的结果。

结论

在大型聚类数据集中,预测模型研究可以采用内部-外部交叉验证来评估竞争模型的可推广性,并确定有前途的建模策略。

相似文献

1
Internal-external cross-validation helped to evaluate the generalizability of prediction models in large clustered datasets.内部-外部交叉验证有助于评估大型聚类数据集预测模型的泛化能力。
J Clin Epidemiol. 2021 Sep;137:83-91. doi: 10.1016/j.jclinepi.2021.03.025. Epub 2021 Apr 6.
2
Developing more generalizable prediction models from pooled studies and large clustered data sets.从汇集的研究和大型聚类数据集开发更具通用性的预测模型。
Stat Med. 2021 Jul 10;40(15):3533-3559. doi: 10.1002/sim.8981. Epub 2021 May 5.
3
Development and validation of prediction models for fetal growth restriction and birthweight: an individual participant data meta-analysis.胎儿生长受限和出生体重预测模型的建立与验证:个体参与者数据的荟萃分析。
Health Technol Assess. 2024 Aug;28(47):1-119. doi: 10.3310/DABW4814.
4
Prediction models for clustered data: comparison of a random intercept and standard regression model.聚集数据的预测模型:随机截距和标准回归模型的比较。
BMC Med Res Methodol. 2013 Feb 15;13:19. doi: 10.1186/1471-2288-13-19.
5
Prognostic models for newly-diagnosed chronic lymphocytic leukaemia in adults: a systematic review and meta-analysis.成人新诊断慢性淋巴细胞白血病的预后模型:一项系统评价和荟萃分析。
Cochrane Database Syst Rev. 2020 Jul 31;7(7):CD012022. doi: 10.1002/14651858.CD012022.pub2.
6
Minimum sample size for external validation of a clinical prediction model with a binary outcome.具有二元结局的临床预测模型外部验证的最小样本量
Stat Med. 2021 Aug 30;40(19):4230-4251. doi: 10.1002/sim.9025. Epub 2021 May 24.
7
Validation and development of models using clinical, biochemical and ultrasound markers for predicting pre-eclampsia: an individual participant data meta-analysis.利用临床、生化和超声标志物预测子痫前期的模型的验证和建立:一项个体参与者数据荟萃分析。
Health Technol Assess. 2020 Dec;24(72):1-252. doi: 10.3310/hta24720.
8
Pitfalls of single-study external validation illustrated with a model predicting functional outcome after aneurysmal subarachnoid hemorrhage.单研究外部验证的陷阱,以一个预测脑动脉瘤性蛛网膜下腔出血后功能结局的模型为例。
BMC Med Res Methodol. 2024 Aug 8;24(1):176. doi: 10.1186/s12874-024-02280-9.
9
A framework for developing, implementing, and evaluating clinical prediction models in an individual participant data meta-analysis.个体参与者数据荟萃分析中开发、实施和评估临床预测模型的框架。
Stat Med. 2013 Aug 15;32(18):3158-80. doi: 10.1002/sim.5732. Epub 2013 Jan 11.
10
Personalised risk-prediction tools for cryptococcal meningitis mortality to guide treatment stratification in sub-Saharan Africa: a prognostic modelling study based on pooled analysis of two randomised controlled trials.用于隐球菌性脑膜炎死亡率的个性化风险预测工具,以指导撒哈拉以南非洲地区的治疗分层:一项基于两项随机对照试验汇总分析的预后建模研究
Lancet Glob Health. 2025 May;13(5):e920-e930. doi: 10.1016/S2214-109X(25)00010-5.

引用本文的文献

1
Combining Self-Reported Information with Radiographic Bone Loss to Screen Periodontitis: A Performance Study.结合自我报告信息与影像学骨丧失情况筛查牙周炎:一项效能研究。
J Clin Med. 2025 Jun 26;14(13):4531. doi: 10.3390/jcm14134531.
2
Combining genetic and non-genetic factors to predict the risk of pancreatic cancer in patients with new-onset diabetes mellitus.结合遗传和非遗传因素预测新发糖尿病患者患胰腺癌的风险。
BMC Med. 2025 Apr 15;23(1):224. doi: 10.1186/s12916-025-04048-4.
3
Development and validation of a prognostic model to predict relapse in adults with remitted depression in primary care: secondary analysis of pooled individual participant data from multiple studies.
开发和验证一个预测成人在初级保健中缓解期抑郁症复发的预后模型:来自多个研究的 pooled 个体参与者数据的二次分析。
BMJ Ment Health. 2024 Oct 28;27(1):e301226. doi: 10.1136/bmjment-2024-301226.
4
Predictive Models for Sustained, Uncontrolled Hypertension and Hypertensive Crisis Based on Electronic Health Record Data: Algorithm Development and Validation.基于电子健康记录数据的持续未控制高血压和高血压危象的预测模型:算法的开发和验证。
JMIR Med Inform. 2024 Oct 28;12:e58732. doi: 10.2196/58732.
5
Sudden cardiac death after myocardial infarction: individual participant data from pooled cohorts.心肌梗死后心源性猝死:来自汇总队列的个体参与者数据。
Eur Heart J. 2024 Nov 14;45(43):4616-4626. doi: 10.1093/eurheartj/ehae326.
6
Developing clinical prediction models: a step-by-step guide.临床预测模型的建立:分步指南。
BMJ. 2024 Sep 3;386:e078276. doi: 10.1136/bmj-2023-078276.
7
Protocol for the development and validation of a Polypharmacy Assessment Score.多重用药评估分数的制定与验证方案。
Diagn Progn Res. 2024 Jul 16;8(1):10. doi: 10.1186/s41512-024-00171-7.
8
Evaluation of clinical prediction models (part 1): from development to external validation.临床预测模型的评估(第 1 部分):从建立到外部验证。
BMJ. 2024 Jan 8;384:e074819. doi: 10.1136/bmj-2023-074819.
9
The potential benefit of statin prescription based on prediction of treatment responsiveness in older individuals: an application to the PROSPER randomized controlled trial.基于对老年个体治疗反应性预测的他汀类药物处方的潜在益处:PROSPER 随机对照试验的应用。
Eur J Prev Cardiol. 2024 Jun 3;31(8):945-953. doi: 10.1093/eurjpc/zwad383.
10
Metabolic Fingerprinting for the Diagnosis of Clinically Similar Long COVID and Fibromyalgia Using a Portable FT-MIR Spectroscopic Combined with Chemometrics.使用便携式傅里叶变换中红外光谱结合化学计量学对临床症状相似的长新冠和纤维肌痛进行代谢指纹识别以用于诊断
Biomedicines. 2023 Oct 5;11(10):2704. doi: 10.3390/biomedicines11102704.