• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越歧视:再入院风险预测模型的校准方法和临床实用性比较。

Beyond discrimination: A comparison of calibration methods and clinical usefulness of predictive models of readmission risk.

机构信息

Department of Biomedical Informatics, Vanderbilt University Medical Center, United States; Department of Medicine, Vanderbilt University Medical Center, United States; Department of Psychiatry, Vanderbilt University Medical Center, United States.

Department of Biomedical Informatics, Vanderbilt University Medical Center, United States.

出版信息

J Biomed Inform. 2017 Dec;76:9-18. doi: 10.1016/j.jbi.2017.10.008. Epub 2017 Oct 24.

DOI:10.1016/j.jbi.2017.10.008
PMID:29079501
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5716927/
Abstract

BACKGROUND

Prior to implementing predictive models in novel settings, analyses of calibration and clinical usefulness remain as important as discrimination, but they are not frequently discussed. Calibration is a model's reflection of actual outcome prevalence in its predictions. Clinical usefulness refers to the utilities, costs, and harms of using a predictive model in practice. A decision analytic approach to calibrating and selecting an optimal intervention threshold may help maximize the impact of readmission risk and other preventive interventions.

OBJECTIVES

To select a pragmatic means of calibrating predictive models that requires a minimum amount of validation data and that performs well in practice. To evaluate the impact of miscalibration on utility and cost via clinical usefulness analyses.

MATERIALS AND METHODS

Observational, retrospective cohort study with electronic health record data from 120,000 inpatient admissions at an urban, academic center in Manhattan. The primary outcome was thirty-day readmission for three causes: all-cause, congestive heart failure, and chronic coronary atherosclerotic disease. Predictive modeling was performed via L1-regularized logistic regression. Calibration methods were compared including Platt Scaling, Logistic Calibration, and Prevalence Adjustment. Performance of predictive modeling and calibration was assessed via discrimination (c-statistic), calibration (Spiegelhalter Z-statistic, Root Mean Square Error [RMSE] of binned predictions, Sanders and Murphy Resolutions of the Brier Score, Calibration Slope and Intercept), and clinical usefulness (utility terms represented as costs). The amount of validation data necessary to apply each calibration algorithm was also assessed.

RESULTS

C-statistics by diagnosis ranged from 0.7 for all-cause readmission to 0.86 (0.78-0.93) for congestive heart failure. Logistic Calibration and Platt Scaling performed best and this difference required analyzing multiple metrics of calibration simultaneously, in particular Calibration Slopes and Intercepts. Clinical usefulness analyses provided optimal risk thresholds, which varied by reason for readmission, outcome prevalence, and calibration algorithm. Utility analyses also suggested maximum tolerable intervention costs, e.g., $1720 for all-cause readmissions based on a published cost of readmission of $11,862.

CONCLUSIONS

Choice of calibration method depends on availability of validation data and on performance. Improperly calibrated models may contribute to higher costs of intervention as measured via clinical usefulness. Decision-makers must understand underlying utilities or costs inherent in the use-case at hand to assess usefulness and will obtain the optimal risk threshold to trigger intervention with intervention cost limits as a result.

摘要

背景

在将预测模型应用于新环境之前,校准和临床实用性的分析与区分度同样重要,但目前对此讨论较少。校准是模型对预测中实际结果发生率的反映。临床实用性是指在实践中使用预测模型的效用、成本和危害。通过决策分析方法对校准和选择最佳干预阈值进行分析,可能有助于最大限度地提高再入院风险和其他预防干预措施的效果。

目的

选择一种实用的校准预测模型的方法,该方法只需最少的验证数据,并且在实践中表现良好。通过临床实用性分析评估校准不当对效用和成本的影响。

材料和方法

这是一项基于电子病历数据的观察性、回顾性队列研究,数据来自曼哈顿市一所城市学术中心的 12 万例住院患者。主要结局指标是 30 天内因三种原因(全因、充血性心力衰竭和慢性冠状动脉粥样硬化性疾病)的再入院。通过 L1-正则化逻辑回归进行预测建模。比较了几种校准方法,包括 Platt 缩放、逻辑校准和患病率调整。通过区分度(c 统计量)、校准(Spiegelhalter Z 统计量、分箱预测的均方根误差 [RMSE]、桑德斯和墨菲的 Brier 评分分辨率、校准斜率和截距)和临床实用性(效用项表示为成本)评估预测模型和校准的性能。还评估了应用每种校准算法所需的验证数据量。

结果

根据诊断,c 统计量范围从全因再入院的 0.7 到充血性心力衰竭的 0.86(0.78-0.93)。逻辑校准和 Platt 缩放表现最好,这一差异需要同时分析校准的多个指标,特别是校准斜率和截距。临床实用性分析提供了最佳风险阈值,这些阈值因再入院原因、结果发生率和校准算法而异。效用分析还表明了最大可容忍干预成本,例如,基于发表的再入院成本 11862 美元,全因再入院的最高可容忍干预成本为 1720 美元。

结论

校准方法的选择取决于验证数据的可用性和性能。校准不当的模型可能会导致干预成本增加,这可以通过临床实用性来衡量。决策者必须了解手头用例中固有的效用或成本,以评估效用,并根据干预成本限制获得最佳风险阈值,以触发干预。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/0a1f81089af0/nihms916149f4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/410cdd923551/nihms916149f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/5a074c68d5cd/nihms916149f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/bfb9489ff154/nihms916149f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/0a1f81089af0/nihms916149f4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/410cdd923551/nihms916149f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/5a074c68d5cd/nihms916149f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/bfb9489ff154/nihms916149f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8095/5716927/0a1f81089af0/nihms916149f4a.jpg

相似文献

1
Beyond discrimination: A comparison of calibration methods and clinical usefulness of predictive models of readmission risk.超越歧视:再入院风险预测模型的校准方法和临床实用性比较。
J Biomed Inform. 2017 Dec;76:9-18. doi: 10.1016/j.jbi.2017.10.008. Epub 2017 Oct 24.
2
The effects of data sources, cohort selection, and outcome definition on a predictive model of risk of thirty-day hospital readmissions.数据来源、队列选择和结局定义对30天再入院风险预测模型的影响。
J Biomed Inform. 2014 Dec;52:418-26. doi: 10.1016/j.jbi.2014.08.006. Epub 2014 Aug 23.
3
Prediction model for outcome after low-back surgery: individualized likelihood of complication, hospital readmission, return to work, and 12-month improvement in functional disability.腰椎手术后预后的预测模型:并发症、再次入院、恢复工作的个体化可能性以及功能障碍12个月内的改善情况。
Neurosurg Focus. 2015 Dec;39(6):E13. doi: 10.3171/2015.8.FOCUS15338.
4
PREDICTIVE MODELING OF HOSPITAL READMISSION RATES USING ELECTRONIC MEDICAL RECORD-WIDE MACHINE LEARNING: A CASE-STUDY USING MOUNT SINAI HEART FAILURE COHORT.使用全电子病历机器学习对医院再入院率进行预测建模:以西奈山心力衰竭队列为例的研究
Pac Symp Biocomput. 2017;22:276-287. doi: 10.1142/9789813207813_0027.
5
Identifying Potentially Avoidable Readmissions: A Medication-Based 15-Day Readmission Risk Stratification Algorithm.识别潜在可避免的再入院:一种基于药物治疗的15天再入院风险分层算法。
Pharmacotherapy. 2017 Mar;37(3):268-277. doi: 10.1002/phar.1896. Epub 2017 Feb 20.
6
7
Validation of the LACE readmission and mortality prediction model in a large surgical cohort: Comparison of performance at preoperative assessment and discharge time points.在一个大型外科队列中验证 LACE 再入院和死亡率预测模型:术前评估和出院时间点的性能比较。
J Clin Anesth. 2019 Dec;58:22-26. doi: 10.1016/j.jclinane.2019.04.039. Epub 2019 May 2.
8
An automated model to identify heart failure patients at risk for 30-day readmission or death using electronic medical record data.利用电子病历数据建立自动模型识别 30 天内再入院或死亡风险的心力衰竭患者。
Med Care. 2010 Nov;48(11):981-8. doi: 10.1097/MLR.0b013e3181ef60d9.
9
10
Postpartum readmission for hypertension and pre-eclampsia: development and validation of a predictive model.产后因高血压和子痫前期再次入院:预测模型的建立与验证。
BJOG. 2023 Nov;130(12):1531-1540. doi: 10.1111/1471-0528.17572. Epub 2023 Jun 14.

引用本文的文献

1
External Validation of the Veterans Affairs Women Cardiovascular Disease Risk Score to Nonveteran Women.退伍军人事务部女性心血管疾病风险评分在非退伍军人女性中的外部验证
JACC Adv. 2025 Aug 12;4(9):102060. doi: 10.1016/j.jacadv.2025.102060.
2
Zoonotic diseases in China: epidemiological trends, incidence forecasting, and comparative analysis between real-world surveillance data and Global Burden of Disease 2021 estimates.中国的人畜共患病:流行病学趋势、发病率预测以及实际监测数据与《2021年全球疾病负担》估计值之间的比较分析
Infect Dis Poverty. 2025 Jul 4;14(1):60. doi: 10.1186/s40249-025-01335-3.
3
Association of initial national early warning score with clinical deterioration in pulmonary embolism.

本文引用的文献

1
Calibration drift in regression and machine learning models for acute kidney injury.急性肾损伤回归模型和机器学习模型中的校准漂移
J Am Med Inform Assoc. 2017 Nov 1;24(6):1052-1061. doi: 10.1093/jamia/ocx030.
2
Utility of models to predict 28-day or 30-day unplanned hospital readmissions: an updated systematic review.预测28天或30天非计划住院再入院的模型效用:一项更新的系统评价
BMJ Open. 2016 Jun 27;6(6):e011060. doi: 10.1136/bmjopen-2016-011060.
3
The effects of data sources, cohort selection, and outcome definition on a predictive model of risk of thirty-day hospital readmissions.
初始国家早期预警评分与肺栓塞临床恶化的相关性
Thromb J. 2025 May 16;23(1):49. doi: 10.1186/s12959-025-00735-7.
4
A novel method for screening malignant hematological diseases by constructing an optimal machine learning model based on blood cell parameters.一种基于血细胞参数构建最优机器学习模型来筛查恶性血液病的新方法。
BMC Med Inform Decis Mak. 2025 Feb 11;25(1):72. doi: 10.1186/s12911-025-02892-1.
5
Firearm Injury Risk Prediction Among Children Transported by 9-1-1 Emergency Medical Services: A Machine Learning Analysis.911紧急医疗服务运送儿童时的枪支伤害风险预测:一项机器学习分析
Pediatr Emerg Care. 2025 Mar 1;41(3):195-202. doi: 10.1097/PEC.0000000000003314. Epub 2024 Dec 12.
6
Large language model uncertainty proxies: discrimination and calibration for medical diagnosis and treatment.大语言模型不确定性代理:医学诊断与治疗中的辨别与校准
J Am Med Inform Assoc. 2025 Jan 1;32(1):139-149. doi: 10.1093/jamia/ocae254.
7
The construction of a nomogram to predict the prognosis and recurrence risks of UPJO.用于预测肾盂输尿管连接部梗阻(UPJO)预后和复发风险的列线图构建。
Front Pediatr. 2024 Apr 2;12:1376196. doi: 10.3389/fped.2024.1376196. eCollection 2024.
8
An AdaBoost-based algorithm to detect hospital-acquired pressure injury in the presence of conflicting annotations.一种基于 AdaBoost 的算法,用于在存在冲突注释的情况下检测医院获得性压力性损伤。
Comput Biol Med. 2024 Jan;168:107754. doi: 10.1016/j.compbiomed.2023.107754. Epub 2023 Nov 22.
9
Evaluation of available risk scores to predict multiple cardiovascular complications for patients with type 2 diabetes mellitus using electronic health records.利用电子健康记录评估现有风险评分以预测2型糖尿病患者的多种心血管并发症
Comput Methods Programs Biomed Update. 2023;3. doi: 10.1016/j.cmpbup.2022.100087. Epub 2022 Dec 19.
10
Deep Learning With Chest Radiographs for Making Prognoses in Patients With COVID-19: Retrospective Cohort Study.深度学习结合胸部 X 光片对 COVID-19 患者预后的评估:回顾性队列研究。
J Med Internet Res. 2023 Feb 16;25:e42717. doi: 10.2196/42717.
数据来源、队列选择和结局定义对30天再入院风险预测模型的影响。
J Biomed Inform. 2014 Dec;52:418-26. doi: 10.1016/j.jbi.2014.08.006. Epub 2014 Aug 23.
4
Towards better clinical prediction models: seven steps for development and an ABCD for validation.迈向更好的临床预测模型:开发的七个步骤及验证的ABCD法
Eur Heart J. 2014 Aug 1;35(29):1925-31. doi: 10.1093/eurheartj/ehu207. Epub 2014 Jun 4.
5
Combining test statistics and models in bootstrapped model rejection: it is a balancing act.在自举模型拒绝中结合检验统计量和模型:这是一种平衡行为。
BMC Syst Biol. 2014 Apr 17;8:46. doi: 10.1186/1752-0509-8-46.
6
Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician's guide.净重新分类改善:计算、解释和争议:文献综述及临床医生指南。
Ann Intern Med. 2014 Jan 21;160(2):122-31. doi: 10.7326/M13-1522.
7
Medicare beneficiaries most likely to be readmitted.最有可能再次入院的医疗保险受益人。
J Hosp Med. 2013 Nov;8(11):639-41. doi: 10.1002/jhm.2074. Epub 2013 Aug 28.
8
Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers.使用局部加权回归平滑法对逻辑回归模型的内部和外部校准进行图形评估。
Stat Med. 2014 Feb 10;33(3):517-35. doi: 10.1002/sim.5941. Epub 2013 Aug 23.
9
Predicting who will fail early discharge after laparoscopic colorectal surgery with an established enhanced recovery pathway.预测采用既定加速康复路径的腹腔镜结直肠手术后早期出院失败的患者。
Surg Endosc. 2014 Jan;28(1):74-9. doi: 10.1007/s00464-013-3158-2. Epub 2013 Aug 27.
10
Prediction of pneumonia 30-day readmissions: a single-center attempt to increase model performance.预测肺炎 30 天再入院率:单中心尝试提高模型性能。
Respir Care. 2014 Feb;59(2):199-208. doi: 10.4187/respcare.02563. Epub 2013 Aug 13.