• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于评估逻辑回归模型拟合优度的新测试方法和图形工具。

A new test and graphical tool to assess the goodness of fit of logistic regression models.

作者信息

Nattino Giovanni, Finazzi Stefano, Bertolini Guido

机构信息

GiViTI Coordinating Center, Laboratory of Clinical Epidemiology, IRCCS - Istituto di Ricerche Farmacologiche 'Mario Negri', Villa Camozzi, Ranica (BG), Italy.

Laboratoire Matériaux et Phénomènes Quantiques, Université Paris Diderot-Paris 7, Paris, France.

出版信息

Stat Med. 2016 Feb 28;35(5):709-20. doi: 10.1002/sim.6744. Epub 2015 Oct 5.

DOI:10.1002/sim.6744
PMID:26439593
Abstract

A prognostic model is well calibrated when it accurately predicts event rates. This is first determined by testing for goodness of fit with the development dataset. All existing tests and graphic tools designed for the purpose suffer several drawbacks, related mainly to the subgrouping of observations or to heavy dependence on arbitrary parameters. We propose a statistical test and a graphical method to assess the goodness of fit of logistic regression models, obtained through an extension of similar techniques developed for external validation. We analytically computed and numerically verified the distribution of the underlying statistic. Simulations on a set of realistic scenarios show that this test and the well-known Hosmer-Lemeshow approach have similar type I error rates. The main advantage of this new approach is that the relationship between model predictions and outcome rates across the range of probabilities can be represented in the calibration belt plot, together with its statistical confidence. By readily spotting any deviations from the perfect fit, this new graphical tool is designed to identify, during the process of model development, poorly modeled variables that call for further investigation. This is illustrated through an example based on real data.

摘要

当预后模型能够准确预测事件发生率时,它就得到了良好的校准。这首先要通过对开发数据集进行拟合优度检验来确定。为此目的设计的所有现有检验和图形工具都存在几个缺点,主要与观测值的分组或对任意参数的严重依赖有关。我们提出了一种统计检验和一种图形方法,以评估逻辑回归模型的拟合优度,这是通过扩展为外部验证而开发的类似技术获得的。我们对基础统计量的分布进行了解析计算和数值验证。在一组现实场景上的模拟表明,该检验与著名的霍斯默 - 莱梅肖方法具有相似的第一类错误率。这种新方法的主要优点是,在校准带图中可以表示模型预测与概率范围内的结果率之间的关系及其统计置信度。通过容易地发现与完美拟合的任何偏差,这种新的图形工具旨在在模型开发过程中识别需要进一步研究的建模不佳的变量。这通过一个基于实际数据的例子进行了说明。

相似文献

1
A new test and graphical tool to assess the goodness of fit of logistic regression models.一种用于评估逻辑回归模型拟合优度的新测试方法和图形工具。
Stat Med. 2016 Feb 28;35(5):709-20. doi: 10.1002/sim.6744. Epub 2015 Oct 5.
2
A new calibration test and a reappraisal of the calibration belt for the assessment of prediction models based on dichotomous outcomes.一种用于基于二分结果评估预测模型的新校准测试及校准带重新评估。
Stat Med. 2014 Jun 30;33(14):2390-407. doi: 10.1002/sim.6100. Epub 2014 Feb 4.
3
Assessing the goodness of fit of logistic regression models in large samples: A modification of the Hosmer-Lemeshow test.评估大样本中逻辑回归模型的拟合优度:Hosmer-Lemeshow 检验的改进。
Biometrics. 2020 Jun;76(2):549-560. doi: 10.1111/biom.13249. Epub 2020 Apr 6.
4
Limited-information goodness-of-fit testing of diagnostic classification item response models.诊断分类项目反应模型的有限信息拟合优度检验
Br J Math Stat Psychol. 2016 Nov;69(3):225-252. doi: 10.1111/bmsp.12074.
5
Global goodness-of-fit tests in logistic regression with sparse data.稀疏数据逻辑回归中的全局拟合优度检验。
Stat Med. 2002 Dec 30;21(24):3789-801. doi: 10.1002/sim.1421.
6
Standardizing the power of the Hosmer-Lemeshow goodness of fit test in large data sets.在大数据集中标准化 Hosmer-Lemeshow 拟合优度检验的功效。
Stat Med. 2013 Jan 15;32(1):67-80. doi: 10.1002/sim.5525. Epub 2012 Jul 26.
7
A discussion of calibration techniques for evaluating binary and categorical predictive models.关于评估二元和分类预测模型的校准技术的讨论。
Prev Vet Med. 2018 Jan 1;149:107-114. doi: 10.1016/j.prevetmed.2017.11.018. Epub 2017 Nov 24.
8
Assessing the calibration of mortality benchmarks in critical care: The Hosmer-Lemeshow test revisited.评估重症监护中死亡率基准的校准:重新审视霍斯默-莱梅肖检验。
Crit Care Med. 2007 Sep;35(9):2052-6. doi: 10.1097/01.CCM.0000275267.64078.B0.
9
A goodness-of-fit test for the proportional odds regression model.比例优势回归模型的拟合优度检验。
Stat Med. 2013 Jun 15;32(13):2235-49. doi: 10.1002/sim.5645. Epub 2012 Oct 4.
10
One model, several results: the paradox of the Hosmer-Lemeshow goodness-of-fit test for the logistic regression model.一个模型,多个结果:逻辑回归模型的霍斯默-莱梅肖拟合优度检验的悖论。
J Epidemiol Biostat. 2000;5(4):251-3.

引用本文的文献

1
External validation of the Malaria Scoring System in a non-endemic emergency department.疟疾评分系统在非流行区急诊科的外部验证
Intern Emerg Med. 2025 Jul 17. doi: 10.1007/s11739-025-04044-9.
2
A nomogram for predicting tongue collapse in DISE based on UACT findings in OSA patients.基于阻塞性睡眠呼吸暂停患者上气道塌陷时间(UACT)结果预测动态喉镜检查(DISE)中舌塌陷的列线图。
Eur Arch Otorhinolaryngol. 2025 May 5. doi: 10.1007/s00405-025-09420-x.
3
Performance characteristics and complications of an Inter-Changeable Operator Model for intubation in an Australian helicopter emergency medical service.
澳大利亚直升机紧急医疗服务中用于插管的可互换操作模式的性能特征及并发症
Emerg Med Australas. 2025 Jun;37(3):e70052. doi: 10.1111/1742-6723.70052.
4
Performance of Pediatric Risk of Mortality IV in Brazilian PICUs: A Multicenter Prospective Study.巴西重症监护病房中儿童死亡风险IV评分的表现:一项多中心前瞻性研究。
Crit Care Explor. 2025 Mar 28;7(4):e1243. doi: 10.1097/CCE.0000000000001243. eCollection 2025 Apr 1.
5
Development and Validation of a Quantitative Score for the Criteria Clinical Control in Stable COPD Proposed in the Spanish COPD Guidelines (GesEPOC): Results of the EPOCONSUL Audit.西班牙慢性阻塞性肺疾病指南(GesEPOC)中提出的稳定期慢性阻塞性肺疾病临床控制标准定量评分的制定与验证:EPOCONSUL审计结果
J Clin Med. 2025 Jan 22;14(3):707. doi: 10.3390/jcm14030707.
6
Development and internal validation of a prognostic model for predicting tonic-clonic seizures during pregnancy in women with epilepsy.癫痫女性孕期强直阵挛发作预测模型的开发与内部验证
BMC Pregnancy Childbirth. 2024 Dec 31;24(1):887. doi: 10.1186/s12884-024-07112-8.
7
External validation of ID-BactER and Shapiro scores for predicting bacteraemia in the emergency department.ID-BactER和Shapiro评分在急诊科预测菌血症的外部验证
Ther Adv Infect Dis. 2024 Dec 7;11:20499361241304508. doi: 10.1177/20499361241304508. eCollection 2024 Jan-Dec.
8
A novel nomogram to predict the risk of requiring mechanical ventilation in patients with sepsis within 48 hours of admission: a retrospective analysis.一种预测脓毒症患者入院后 48 小时内需要机械通气风险的新列线图:一项回顾性分析。
PeerJ. 2024 Nov 1;12:e18500. doi: 10.7717/peerj.18500. eCollection 2024.
9
Optimizing anesthesia management based on early identification of electroencephalogram burst suppression risk in non-cardiac surgery patients: a visualized dynamic nomogram.基于非心脏手术患者脑电爆发抑制风险的早期识别优化麻醉管理:可视化动态列线图。
Ann Med. 2024 Dec;56(1):2407067. doi: 10.1080/07853890.2024.2407067. Epub 2024 Sep 24.
10
Clinical utility of an artificial intelligence radiomics-based tool for risk stratification of pulmonary nodules.人工智能影像组学工具在肺结节风险分层中的临床应用。
JNCI Cancer Spectr. 2024 Sep 2;8(5). doi: 10.1093/jncics/pkae086.