• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于逻辑回归模型进行外部验证评分系统的样本量计算。

Sample size calculation to externally validate scoring systems based on logistic regression models.

作者信息

Palazón-Bru Antonio, Folgado-de la Rosa David Manuel, Cortés-Castell Ernesto, López-Cascales María Teresa, Gil-Guillén Vicente Francisco

机构信息

Department of Clinical Medicine, Miguel Hernández University, San Juan de Alicante, Alicante, Spain.

Department of Pharmacology, Pediatrics and Organic Chemistry, Miguel Hernández University, San Juan de Alicante, Alicante, Spain.

出版信息

PLoS One. 2017 May 1;12(5):e0176726. doi: 10.1371/journal.pone.0176726. eCollection 2017.

DOI:10.1371/journal.pone.0176726
PMID:28459847
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5411086/
Abstract

BACKGROUND

A sample size containing at least 100 events and 100 non-events has been suggested to validate a predictive model, regardless of the model being validated and that certain factors can influence calibration of the predictive model (discrimination, parameterization and incidence). Scoring systems based on binary logistic regression models are a specific type of predictive model.

OBJECTIVE

The aim of this study was to develop an algorithm to determine the sample size for validating a scoring system based on a binary logistic regression model and to apply it to a case study.

METHODS

The algorithm was based on bootstrap samples in which the area under the ROC curve, the observed event probabilities through smooth curves, and a measure to determine the lack of calibration (estimated calibration index) were calculated. To illustrate its use for interested researchers, the algorithm was applied to a scoring system, based on a binary logistic regression model, to determine mortality in intensive care units.

RESULTS

In the case study provided, the algorithm obtained a sample size with 69 events, which is lower than the value suggested in the literature.

CONCLUSION

An algorithm is provided for finding the appropriate sample size to validate scoring systems based on binary logistic regression models. This could be applied to determine the sample size in other similar cases.

摘要

背景

有人建议采用一个包含至少100个事件和100个非事件的样本量来验证一个预测模型,无论该模型是否正在被验证,且某些因素会影响预测模型的校准(区分度、参数化和发生率)。基于二元逻辑回归模型的评分系统是一种特定类型的预测模型。

目的

本研究的目的是开发一种算法,以确定用于验证基于二元逻辑回归模型的评分系统的样本量,并将其应用于一个案例研究。

方法

该算法基于自助抽样,其中计算了ROC曲线下面积、通过平滑曲线得到的观察到的事件概率以及一种用于确定校准不足的度量(估计校准指数)。为了向感兴趣的研究人员说明其用法,该算法被应用于一个基于二元逻辑回归模型的评分系统,以确定重症监护病房的死亡率。

结果

在所提供的案例研究中,该算法得到了一个包含69个事件的样本量,这低于文献中建议的值。

结论

提供了一种算法,用于找到合适的样本量来验证基于二元逻辑回归模型的评分系统。这可应用于确定其他类似案例中的样本量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/3da2ff4eea27/pone.0176726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/060e609b4894/pone.0176726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/1e854f6c15ed/pone.0176726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/3da2ff4eea27/pone.0176726.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/060e609b4894/pone.0176726.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/1e854f6c15ed/pone.0176726.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb99/5411086/3da2ff4eea27/pone.0176726.g003.jpg

相似文献

1
Sample size calculation to externally validate scoring systems based on logistic regression models.基于逻辑回归模型进行外部验证评分系统的样本量计算。
PLoS One. 2017 May 1;12(5):e0176726. doi: 10.1371/journal.pone.0176726. eCollection 2017.
2
A method to validate scoring systems based on logistic regression models to predict binary outcomes via a mobile application for Android with an example of a real case.一种通过适用于安卓系统的移动应用程序,基于逻辑回归模型验证评分系统以预测二元结果的方法,并给出一个实际案例示例。
Comput Methods Programs Biomed. 2020 Nov;196:105570. doi: 10.1016/j.cmpb.2020.105570. Epub 2020 Jun 3.
3
Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): a population-based study.重症监护病房死亡率预测的超级 ICU 学习者算法(SICULA):一项基于人群的研究。
Lancet Respir Med. 2015 Jan;3(1):42-52. doi: 10.1016/S2213-2600(14)70239-5. Epub 2014 Nov 24.
4
How Does the Skeletal Oncology Research Group Algorithm's Prediction of 5-year Survival in Patients with Chondrosarcoma Perform on International Validation?骨肿瘤研究组算法对软骨肉瘤患者 5 年生存率的预测在国际验证中的表现如何?
Clin Orthop Relat Res. 2020 Oct;478(10):2300-2308. doi: 10.1097/CORR.0000000000001305.
5
Calibration and discrimination by daily Logistic Organ Dysfunction scoring comparatively with daily Sequential Organ Failure Assessment scoring for predicting hospital mortality in critically ill patients.通过每日逻辑器官功能障碍评分与每日序贯器官衰竭评估评分比较进行校准和鉴别,以预测危重症患者的医院死亡率。
Crit Care Med. 2002 Sep;30(9):2003-13. doi: 10.1097/00003246-200209000-00009.
6
Sample size for binary logistic prediction models: Beyond events per variable criteria.二项逻辑预测模型的样本量:超越变量标准的事件数。
Stat Methods Med Res. 2019 Aug;28(8):2455-2474. doi: 10.1177/0962280218784726. Epub 2018 Jul 3.
7
A discussion of calibration techniques for evaluating binary and categorical predictive models.关于评估二元和分类预测模型的校准技术的讨论。
Prev Vet Med. 2018 Jan 1;149:107-114. doi: 10.1016/j.prevetmed.2017.11.018. Epub 2017 Nov 24.
8
Factors affecting the performance of the models in the Mortality Probability Model II system and strategies of customization: a simulation study.影响死亡率概率模型II系统中模型性能的因素及定制策略:一项模拟研究。
Crit Care Med. 1996 Jan;24(1):57-63. doi: 10.1097/00003246-199601000-00011.
9
The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models.综合校准指数(ICI)及其相关指标,用于量化逻辑回归模型的校准。
Stat Med. 2019 Sep 20;38(21):4051-4065. doi: 10.1002/sim.8281. Epub 2019 Jul 3.
10
Improving calibration of logistic regression models by local estimates.通过局部估计改进逻辑回归模型的校准
AMIA Annu Symp Proc. 2008 Nov 6;2008:535-9.

引用本文的文献

1
Simple severity scale for perforated peptic ulcer with generalized peritonitis: a derivation and internal validation study.伴有弥漫性腹膜炎的穿孔性消化性溃疡简易严重程度量表:一项推导与内部验证研究
Int J Surg. 2024 Nov 1;110(11):7134-7141. doi: 10.1097/JS9.0000000000002037.
2
Early symptoms preceding post-infectious irritable bowel syndrome following COVID-19: a retrospective observational study incorporating daily gastrointestinal symptoms.新冠病毒感染后出现肠易激综合征的早期前驱症状:一项纳入每日胃肠道症状的回顾性观察研究。
BMC Gastroenterol. 2023 Apr 5;23(1):108. doi: 10.1186/s12876-023-02746-y.
3
Validation of the Dutch version of the Hip Outcome Score; validity, reliability, and responsiveness in patients with femoroacetabular impingement syndrome.

本文引用的文献

1
Construction and internal validation of a new mortality risk score for patients admitted to the intensive care unit.重症监护病房收治患者新型死亡风险评分的构建与内部验证
Int J Clin Pract. 2016 Nov;70(11):916-922. doi: 10.1111/ijcp.12851. Epub 2016 Aug 3.
2
A calibration hierarchy for risk models was defined: from utopia to empirical data.定义了风险模型的校准层次结构:从理想状态到经验数据。
J Clin Epidemiol. 2016 Jun;74:167-76. doi: 10.1016/j.jclinepi.2015.12.005. Epub 2016 Jan 6.
3
Sample size considerations for the external validation of a multivariable prognostic model: a resampling study.
荷兰版髋关节结局评分的验证;股骨髋臼撞击综合征患者的有效性、可靠性和反应性
J Hip Preserv Surg. 2021 Oct 7;8(3):298-304. doi: 10.1093/jhps/hnab073. eCollection 2021 Aug.
4
Cardiovascular Risk According to Body Mass Index in Women of Reproductive Age With Polycystic Ovary Syndrome: A Systematic Review and Meta-Analysis.多囊卵巢综合征育龄女性中根据体重指数评估的心血管风险:一项系统评价和荟萃分析
Front Cardiovasc Med. 2022 Feb 16;9:822079. doi: 10.3389/fcvm.2022.822079. eCollection 2022.
5
Factors affecting the performance of brain arteriovenous malformation rupture prediction models.影响脑动静脉畸形破裂预测模型性能的因素。
BMC Med Inform Decis Mak. 2021 May 3;21(1):142. doi: 10.1186/s12911-021-01511-z.
6
External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb.临床预测模型的外部验证:基于模拟的样本量计算比经验法则更可靠。
J Clin Epidemiol. 2021 Jul;135:79-89. doi: 10.1016/j.jclinepi.2021.02.011. Epub 2021 Feb 14.
7
Deep learning-based computer-aided diagnosis in screening breast ultrasound to reduce false-positive diagnoses.基于深度学习的计算机辅助诊断在乳腺超声筛查中减少假阳性诊断。
Sci Rep. 2021 Jan 11;11(1):395. doi: 10.1038/s41598-020-79880-0.
8
A novel scoring system to predict the requirement for surgical intervention in victims of motor vehicle crashes: Development and validation using independent cohorts.一种用于预测机动车事故受害者手术干预需求的新型评分系统:使用独立队列进行开发和验证。
PLoS One. 2019 Dec 10;14(12):e0226282. doi: 10.1371/journal.pone.0226282. eCollection 2019.
9
Ability of Fibrin Monomers to Predict Progressive Hemorrhagic Injury in Patients with Severe Traumatic Brain Injury.纤维蛋白单体预测严重创伤性脑损伤患者进行性出血性损伤的能力。
Neurocrit Care. 2020 Aug;33(1):182-195. doi: 10.1007/s12028-019-00882-6.
10
Detection of frailty in older patients using a mobile app: cross-sectional observational study in primary care.使用移动应用程序检测老年患者的虚弱状况:初级保健中的横断面观察性研究。
Br J Gen Pract. 2019 Dec 26;70(690):e29-e35. doi: 10.3399/bjgp19X706577. Print 2020 Jan.
多变量预后模型外部验证的样本量考量:一项重抽样研究
Stat Med. 2016 Jan 30;35(2):214-26. doi: 10.1002/sim.6787. Epub 2015 Nov 9.
4
A spline-based tool to assess and visualize the calibration of multiclass risk predictions.一种基于样条的工具,用于评估和可视化多类别风险预测的校准情况。
J Biomed Inform. 2015 Apr;54:283-93. doi: 10.1016/j.jbi.2014.12.016. Epub 2015 Jan 9.
5
Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement.透明报告个体预后或诊断的多变量预测模型(TRIPOD):TRIPOD 声明。
Ann Intern Med. 2015 Jan 6;162(1):55-63. doi: 10.7326/M14-0697.
6
Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers.使用局部加权回归平滑法对逻辑回归模型的内部和外部校准进行图形评估。
Stat Med. 2014 Feb 10;33(3):517-35. doi: 10.1002/sim.5941. Epub 2013 Aug 23.
7
Presentation of multivariate data for clinical use: The Framingham Study risk score functions.用于临床的多变量数据呈现:弗雷明汉研究风险评分函数。
Stat Med. 2004 May 30;23(10):1631-60. doi: 10.1002/sim.1742.
8
The meaning and use of the area under a receiver operating characteristic (ROC) curve.接受者操作特征(ROC)曲线下面积的意义及应用。
Radiology. 1982 Apr;143(1):29-36. doi: 10.1148/radiology.143.1.7063747.
9
Basic principles of ROC analysis.ROC分析的基本原理。
Semin Nucl Med. 1978 Oct;8(4):283-98. doi: 10.1016/s0001-2998(78)80014-2.