• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估重症监护中死亡率基准的校准:重新审视霍斯默-莱梅肖检验。

Assessing the calibration of mortality benchmarks in critical care: The Hosmer-Lemeshow test revisited.

作者信息

Kramer Andrew A, Zimmerman Jack E

机构信息

Cerner Corporation, Vienna, VA, USA.

出版信息

Crit Care Med. 2007 Sep;35(9):2052-6. doi: 10.1097/01.CCM.0000275267.64078.B0.

DOI:10.1097/01.CCM.0000275267.64078.B0
PMID:17568333
Abstract

OBJECTIVE

To examine the Hosmer-Lemeshow test's sensitivity in evaluating the calibration of models predicting hospital mortality in large critical care populations.

DESIGN

Simulation study.

SETTING

Intensive care unit databases used for predictive modeling.

PATIENTS

Data sets were simulated representing the approximate number of patients used in earlier versions of critical care predictive models (n = 5,000 and 10,000) and more recent predictive models (n = 50,000). Each patient had a hospital mortality probability generated as a function of 23 risk variables.

INTERVENTIONS

None.

MEASUREMENTS AND MAIN RESULTS

Data sets of 5,000, 10,000, and 50,000 patients were replicated 1,000 times. Logistic regression models were evaluated for each simulated data set. This process was initially carried out under conditions of perfect fit (observed mortality = predicted mortality; standardized mortality ratio = 1.000) and repeated with an observed mortality that differed slightly (0.4%) from predicted mortality. Under conditions of perfect fit, the Hosmer-Lemeshow test was not influenced by the number of patients in the data set. In situations where there was a slight deviation from perfect fit, the Hosmer-Lemeshow test was sensitive to sample size. For populations of 5,000 patients, 10% of the Hosmer-Lemeshow tests were significant at p < .05, whereas for 10,000 patients 34% of the Hosmer-Lemeshow tests were significant at p < .05. When the number of patients matched contemporary studies (i.e., 50,000 patients), the Hosmer-Lemeshow test was statistically significant in 100% of the models.

CONCLUSIONS

Caution should be used in interpreting the calibration of predictive models developed using a smaller data set when applied to larger numbers of patients. A significant Hosmer-Lemeshow test does not necessarily mean that a predictive model is not useful or suspect. While decisions concerning a mortality model's suitability should include the Hosmer-Lemeshow test, additional information needs to be taken into consideration. This includes the overall number of patients, the observed and predicted probabilities within each decile, and adjunct measures of model calibration.

摘要

目的

检验霍斯默-莱梅肖检验在评估预测大型重症监护人群医院死亡率模型的校准方面的敏感性。

设计

模拟研究。

设置

用于预测建模的重症监护病房数据库。

患者

模拟数据集,代表早期重症监护预测模型(n = 5000和10000)及近期预测模型(n = 50000)中使用的患者大致数量。每位患者的医院死亡概率根据23个风险变量生成。

干预措施

无。

测量指标及主要结果

对5000、10000和50000例患者的数据集进行1000次重复。对每个模拟数据集评估逻辑回归模型。此过程最初在完美拟合条件下(观察到的死亡率 = 预测的死亡率;标准化死亡率比 = 1.000)进行,并在观察到的死亡率与预测死亡率略有差异(0.4%)时重复进行。在完美拟合条件下,霍斯默-莱梅肖检验不受数据集中患者数量的影响。在与完美拟合略有偏差的情况下,霍斯默-莱梅肖检验对样本量敏感。对于5000例患者的群体,10%的霍斯默-莱梅肖检验在p < 0.05时具有显著性,而对于10000例患者,则有34%的霍斯默-莱梅肖检验在p < 0.05时具有显著性。当患者数量与当代研究匹配(即50000例患者)时,100%的模型中霍斯默-莱梅肖检验具有统计学显著性。

结论

在将使用较小数据集开发的预测模型应用于更多患者时,解释其校准情况时应谨慎。霍斯默-莱梅肖检验具有显著性并不一定意味着预测模型无用或可疑。虽然关于死亡率模型适用性的决策应包括霍斯默-莱梅肖检验,但还需要考虑其他信息。这包括患者总数、每个十分位数内的观察到的和预测的概率,以及模型校准的辅助指标。

相似文献

1
Assessing the calibration of mortality benchmarks in critical care: The Hosmer-Lemeshow test revisited.评估重症监护中死亡率基准的校准:重新审视霍斯默-莱梅肖检验。
Crit Care Med. 2007 Sep;35(9):2052-6. doi: 10.1097/01.CCM.0000275267.64078.B0.
2
Assessing contemporary intensive care unit outcome: an updated Mortality Probability Admission Model (MPM0-III).评估当代重症监护病房的预后:更新的死亡概率入院模型(MPM0-III)。
Crit Care Med. 2007 Mar;35(3):827-35. doi: 10.1097/01.CCM.0000257337.63529.9F.
3
Subgroup mortality probability models: are they necessary for specialized intensive care units?亚组死亡概率模型:它们对专科重症监护病房来说是必要的吗?
Crit Care Med. 2009 Aug;37(8):2375-86. doi: 10.1097/CCM.0b013e3181a12851.
4
Acute Physiology and Chronic Health Evaluation (APACHE) IV: hospital mortality assessment for today's critically ill patients.急性生理学与慢性健康状况评估(APACHE)IV:当今危重症患者的医院死亡率评估
Crit Care Med. 2006 May;34(5):1297-310. doi: 10.1097/01.CCM.0000215112.84523.F0.
5
One model, several results: the paradox of the Hosmer-Lemeshow goodness-of-fit test for the logistic regression model.一个模型,多个结果:逻辑回归模型的霍斯默-莱梅肖拟合优度检验的悖论。
J Epidemiol Biostat. 2000;5(4):251-3.
6
Mortality and length-of-stay outcomes, 1993-2003, in the binational Australian and New Zealand intensive care adult patient database.1993年至2003年,澳大利亚和新西兰成人重症监护患者双边数据库中的死亡率和住院时间结果。
Crit Care Med. 2008 Jan;36(1):46-61. doi: 10.1097/01.CCM.0000295313.08084.58.
7
Veterans Affairs intensive care unit risk adjustment model: validation, updating, recalibration.退伍军人事务部重症监护病房风险调整模型:验证、更新、重新校准
Crit Care Med. 2008 Apr;36(4):1031-42. doi: 10.1097/CCM.0b013e318169f290.
8
The acute physiology and chronic health evaluation III outcome prediction in patients admitted to the intensive care unit after pneumonectomy.肺切除术后入住重症监护病房患者的急性生理学与慢性健康状况评估III结局预测
J Cardiothorac Vasc Anesth. 2007 Dec;21(6):832-7. doi: 10.1053/j.jvca.2006.12.005. Epub 2007 Mar 6.
9
Validation of pediatric index of mortality 2 (PIM2) in a single pediatric intensive care unit of Argentina.阿根廷一家儿科重症监护病房中儿童死亡指数2(PIM2)的验证
Pediatr Crit Care Med. 2007 Jan;8(1):54-7. doi: 10.1097/01.pcc.0000256619.78382.93.
10
[Validation of the EuroSCORE probabilistic model in patients undergoing coronary bypass grafting].[欧洲心脏手术风险评估系统(EuroSCORE)概率模型在冠状动脉搭桥手术患者中的验证]
Rev Esp Cardiol. 2008 Jun;61(6):589-94.

引用本文的文献

1
Predicting prognosis of patients with hepatitis B virus-related acute-on-chronic liver failure from longitudinal ultrasound images using a multi-task deep learning approach.使用多任务深度学习方法从纵向超声图像预测乙型肝炎病毒相关慢加急性肝衰竭患者的预后
Ann Med. 2025 Dec;57(1):2551819. doi: 10.1080/07853890.2025.2551819. Epub 2025 Aug 26.
2
Harnessing Radiomics and Explainable AI for the Classification of Usual and Nonspecific Interstitial Pneumonia.利用放射组学和可解释人工智能对普通型和非特异性间质性肺炎进行分类。
J Clin Med. 2025 Jul 11;14(14):4934. doi: 10.3390/jcm14144934.
3
A novel nomogram for predicting prolonged disorders of consciousness in severe supratentorial hypertensive intracerebral hemorrhage patients.
一种用于预测重症幕上高血压脑出血患者意识障碍持续时间的新型列线图。
Sci Rep. 2025 Jul 17;15(1):25911. doi: 10.1038/s41598-025-11798-x.
4
Development and validation of a clinical model to predict low-grade intraepithelial neoplasia in chronic atrophic gastritis patients: a retrospective observational multicenter analysis.预测慢性萎缩性胃炎患者低级别上皮内瘤变的临床模型的开发与验证:一项回顾性观察性多中心分析
Front Oncol. 2025 Jun 4;15:1597099. doi: 10.3389/fonc.2025.1597099. eCollection 2025.
5
CT-based radiomics for prediction of response to neoadjuvant immunochemotherapy in patients with esophageal carcinoma.基于CT的影像组学在预测食管癌患者新辅助免疫化疗反应中的应用
Front Oncol. 2025 May 12;15:1511691. doi: 10.3389/fonc.2025.1511691. eCollection 2025.
6
Performance of PREVENT and pooled cohort equations for predicting 10-Year ASCVD risk in the UK Biobank.PREVENT及汇总队列方程在英国生物银行中预测10年动脉粥样硬化性心血管疾病(ASCVD)风险的性能。
Am J Prev Cardiol. 2025 May 18;22:101009. doi: 10.1016/j.ajpc.2025.101009. eCollection 2025 Jun.
7
Predicting risk of treatment non-adherence in patients receiving intense light therapy: development and evaluation of a new predictive nomogram.预测接受强光治疗患者的治疗不依从风险:一种新的预测列线图的开发与评估
Lasers Med Sci. 2025 Jun 11;40(1):268. doi: 10.1007/s10103-025-04524-6.
8
Prediction of Ki-67 expression in hepatocellular carcinoma with machine learning models based on intratumoral and peritumoral radiomic features.基于肿瘤内和肿瘤周围放射组学特征的机器学习模型预测肝细胞癌中Ki-67的表达
World J Gastrointest Oncol. 2025 May 15;17(5):104172. doi: 10.4251/wjgo.v17.i5.104172.
9
Construction of a predictive model for relapse of primary autoimmune hemolytic anemia: a retrospective cohort study.原发性自身免疫性溶血性贫血复发预测模型的构建:一项回顾性队列研究。
Ann Med. 2025 Dec;57(1):2506482. doi: 10.1080/07853890.2025.2506482. Epub 2025 May 22.
10
Nomogram model for identifying portal vein thrombosis in patients with decompensated cirrhosis.用于识别失代偿期肝硬化患者门静脉血栓形成的列线图模型
Eur J Gastroenterol Hepatol. 2025 Aug 1;37(8):935-942. doi: 10.1097/MEG.0000000000002968. Epub 2025 Mar 26.