可视化和评估逻辑回归模型中的歧视。

Visualizing and assessing discrimination in the logistic regression model.

机构信息

MRC Clinical Trials Unit, 222 Euston Road, London NW12DA, UK.

出版信息

Stat Med. 2010 Oct 30;29(24):2508-20. doi: 10.1002/sim.3994.

PMID:20641144

Abstract

Logistic regression models are widely used in medicine for predicting patient outcome (prognosis) and constructing diagnostic tests (diagnosis). Multivariable logistic models yield an (approximately) continuous risk score, a transformation of which gives the estimated event probability for an individual. A key aspect of model performance is discrimination, that is, the model's ability to distinguish between patients who have (or will have) an event of interest and those who do not (or will not). Graphical aids are important in understanding a logistic model. The receiver-operating characteristic (ROC) curve is familiar, but not necessarily easy to interpret. We advocate a simple graphic that provides further insight into discrimination, namely a histogram or dot plot of the risk score in the outcome groups. The most popular performance measure for the logistic model is the c-index, numerically equivalent to the area under the ROC curve. We discuss the comparative merits of the c-index and the (standardized) mean difference in risk score between the outcome groups. The latter statistic, sometimes known generically as the effect size, has been computed in slightly different ways by several different authors, including Glass, Cohen and Hedges. An alternative measure is the overlap between the distributions in the outcome groups, defined as the area under the minimum of the two density functions. The larger the overlap, the weaker the discrimination. Under certain assumptions about the distribution of the risk score, the c-index, effect size and overlap are functionally related. We illustrate the ideas with simulated and real data sets.

摘要

逻辑回归模型在医学中被广泛用于预测患者的结局（预后）和构建诊断测试（诊断）。多变量逻辑模型产生一个（近似）连续的风险评分，该评分的转换给出了个体的估计事件概率。模型性能的一个关键方面是区分度，即模型区分有（或将会有）感兴趣事件的患者和没有（或不会有）感兴趣事件的患者的能力。图形辅助工具对于理解逻辑模型很重要。熟悉接收者操作特征（ROC）曲线，但不一定容易解释。我们提倡使用一种简单的图形，提供对区分度的进一步了解，即风险评分在结局组中的直方图或点图。逻辑模型最常用的性能衡量指标是 c 指数，它在数值上等同于 ROC 曲线下的面积。我们讨论了 c 指数和结局组之间风险评分（标准化）均值差异的相对优点。后者的统计量，有时通常称为效应量，已被几位不同的作者以略有不同的方式计算，包括 Glass、Cohen 和 Hedges。另一个替代衡量标准是结局组之间分布的重叠，定义为两个密度函数中的最小值的面积。重叠越大，区分度越弱。在风险评分分布的某些假设下，c 指数、效应量和重叠在功能上是相关的。我们用模拟数据集和真实数据集来说明这些想法。

相似文献

Visualizing and assessing discrimination in the logistic regression model.

Stat Med. 2010 Oct 30;29(24):2508-20. doi: 10.1002/sim.3994.

A discussion of calibration techniques for evaluating binary and categorical predictive models.

Prev Vet Med. 2018 Jan 1;149:107-114. doi: 10.1016/j.prevetmed.2017.11.018. Epub 2017 Nov 24.

Improved ischemic stroke outcome prediction using model estimation of outcome probability: the THRIVE-c calculation.

Int J Stroke. 2015 Aug;10(6):815-21. doi: 10.1111/ijs.12529. Epub 2015 Jun 4.

Sternal wound infection after coronary artery bypass graft surgery: validation of existing risk scores.

J Thorac Cardiovasc Surg. 2007 Feb;133(2):397-403. doi: 10.1016/j.jtcvs.2006.10.012.

Risk scoring system and predictor for clinically relevant pancreatic fistula after pancreaticoduodenectomy.

World J Gastroenterol. 2015 May 21;21(19):5926-33. doi: 10.3748/wjg.v21.i19.5926.

A global goodness-of-fit test for receiver operating characteristic curve analysis via the bootstrap method.

J Biomed Inform. 2005 Oct;38(5):395-403. doi: 10.1016/j.jbi.2005.02.004. Epub 2005 Mar 9.

A practical scoring system for predicting cirrhosis in patients with chronic viral hepatitis.

Hepatogastroenterology. 2012 Nov-Dec;59(120):2592-7. doi: 10.5754/hge10157.

New metrics for assessing diagnostic potential of candidate biomarkers.

Clin J Am Soc Nephrol. 2012 Aug;7(8):1355-64. doi: 10.2215/CJN.09590911. Epub 2012 Jun 7.

Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study.

J Clin Oncol. 2015 Oct 1;33(28):3144-51. doi: 10.1200/JCO.2014.59.1339. Epub 2015 Aug 3.

The Abdominal Aortic Aneurysm Statistically Corrected Operative Risk Evaluation (AAA SCORE) for predicting mortality after open and endovascular interventions.

J Vasc Surg. 2015 Jan;61(1):35-43. doi: 10.1016/j.jvs.2014.06.002. Epub 2014 Jun 28.

引用本文的文献

Machine Learning-Based Explainable Automated Nonlinear Computation Scoring System for Health Score and an Application for Prediction of Perioperative Stroke: Retrospective Study.

J Med Internet Res. 2025 Mar 19;27:e58021. doi: 10.2196/58021.

From one size fits all to a tailored approach: integrating precision medicine into medical education.

BMC Med Educ. 2025 Jan 18;25(1):90. doi: 10.1186/s12909-024-06138-y.

Predicting periprosthetic joint infection: external validation of preoperative prediction models.

J Bone Jt Infect. 2024 Oct 25;9(5):231-239. doi: 10.5194/jbji-9-231-2024. eCollection 2024.

Factors associated with COVID-19 infection in pregnant women: Focusing on maternal anxiety.

PLoS One. 2024 Oct 24;19(10):e0312300. doi: 10.1371/journal.pone.0312300. eCollection 2024.

A systematic review of clinical and biomechanical engineering perspectives on the prediction of restenosis in coronary and peripheral arteries.

JVS Vasc Sci. 2023 Sep 15;4:100128. doi: 10.1016/j.jvssci.2023.100128. eCollection 2023.

Data completeness and consistency in individual medical records of institutional births: retrospective crossectional study from Northwest Ethiopia, 2022.

BMC Health Serv Res. 2023 Oct 31;23(1):1189. doi: 10.1186/s12913-023-10127-0.

Estimating postoperative mortality in colorectal surgery- a systematic review of risk prediction models.

Int J Colorectal Dis. 2023 Jun 1;38(1):155. doi: 10.1007/s00384-023-04455-0.

Transparent reporting of multivariable prediction models developed or validated using clustered data (TRIPOD-Cluster): explanation and elaboration.

BMJ. 2023 Feb 7;380:e071058. doi: 10.1136/bmj-2022-071058.

Multilevel modelling for measuring interaction of effects between multiple categorical variables: An illustrative application using risk factors for preeclampsia.

Paediatr Perinat Epidemiol. 2023 Feb;37(2):154-164. doi: 10.1111/ppe.12932. Epub 2022 Nov 10.

Development, validation and clinical utility of a risk prediction model for adverse pregnancy outcomes in women with gestational diabetes: The PeRSonal GDM model.

EClinicalMedicine. 2022 Sep 5;52:101637. doi: 10.1016/j.eclinm.2022.101637. eCollection 2022 Oct.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

可视化和评估逻辑回归模型中的歧视。

Visualizing and assessing discrimination in the logistic regression model.

机构信息

MRC Clinical Trials Unit, 222 Euston Road, London NW12DA, UK.

出版信息

Stat Med. 2010 Oct 30;29(24):2508-20. doi: 10.1002/sim.3994.

DOI:10.1002/sim.3994

PMID:20641144

Abstract

摘要

可视化和评估逻辑回归模型中的歧视。

Visualizing and assessing discrimination in the logistic regression model.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

可视化和评估逻辑回归模型中的歧视。

Visualizing and assessing discrimination in the logistic regression model.

机构信息

出版信息

相似文献

引用本文的文献