预测模型中连续变量分类的新方法：提议与验证。

A new approach to categorising continuous variables in prediction models: Proposal and validation.

作者信息

Barrio Irantzu, Arostegui Inmaculada, Rodríguez-Álvarez María-Xosé, Quintana José-María

机构信息

1 Departamento de Matemática Aplicada, Estadística e Investigación Operativa, Universidad del País Vasco UPV/EHU, Leioa, Spain.

2 Red de Investigación en Servicios de Salud en Enfermedades Crónicas (REDISSEC), Galdakao, Spain.

出版信息

Stat Methods Med Res. 2017 Dec;26(6):2586-2602. doi: 10.1177/0962280215601873. Epub 2015 Sep 18.

DOI:10.1177/0962280215601873

PMID:26384514

Abstract

When developing prediction models for application in clinical practice, health practitioners usually categorise clinical variables that are continuous in nature. Although categorisation is not regarded as advisable from a statistical point of view, due to loss of information and power, it is a common practice in medical research. Consequently, providing researchers with a useful and valid categorisation method could be a relevant issue when developing prediction models. Without recommending categorisation of continuous predictors, our aim is to propose a valid way to do it whenever it is considered necessary by clinical researchers. This paper focuses on categorising a continuous predictor within a logistic regression model, in such a way that the best discriminative ability is obtained in terms of the highest area under the receiver operating characteristic curve (AUC). The proposed methodology is validated when the optimal cut points' location is known in theory or in practice. In addition, the proposed method is applied to a real data-set of patients with an exacerbation of chronic obstructive pulmonary disease, in the context of the IRYSS-COPD study where a clinical prediction rule for severe evolution was being developed. The clinical variable PCO was categorised in a univariable and a multivariable setting.

摘要

在开发用于临床实践的预测模型时，医疗从业者通常会对本质上为连续型的临床变量进行分类。尽管从统计学角度来看，分类并不被认为是可取的，因为会损失信息和效能，但它在医学研究中却是常见的做法。因此，在开发预测模型时，为研究人员提供一种有用且有效的分类方法可能是一个相关问题。在不建议对连续预测变量进行分类的情况下，我们的目标是提出一种有效的方法，以便在临床研究人员认为必要时进行分类。本文重点关注在逻辑回归模型中对连续预测变量进行分类，从而根据受试者工作特征曲线（AUC）下的最高面积获得最佳判别能力。当理论上或实践中知道最佳切点的位置时，对所提出的方法进行验证。此外，在所开展的IRYSS - COPD研究（正在制定严重病情进展的临床预测规则）背景下，将所提出的方法应用于慢性阻塞性肺疾病急性加重患者的真实数据集。临床变量PCO在单变量和多变量设置中进行了分类。

相似文献

A new approach to categorising continuous variables in prediction models: Proposal and validation.

Stat Methods Med Res. 2017 Dec;26(6):2586-2602. doi: 10.1177/0962280215601873. Epub 2015 Sep 18.

Use of generalised additive models to categorise continuous variables in clinical prediction.

BMC Med Res Methodol. 2013 Jun 26;13:83. doi: 10.1186/1471-2288-13-83.

Development of the ProPal-COPD tool to identify patients with COPD for proactive palliative care.

Int J Chron Obstruct Pulmon Dis. 2017 Jul 20;12:2121-2128. doi: 10.2147/COPD.S140037. eCollection 2017.

A score to predict short-term risk of COPD exacerbations (SCOPEX).

Int J Chron Obstruct Pulmon Dis. 2015 Jan 27;10:201-9. doi: 10.2147/COPD.S69589. eCollection 2015.

Prediction of thoracic injury severity in frontal impacts by selected anatomical morphomic variables through model-averaged logistic regression approach.

Accid Anal Prev. 2013 Nov;60:172-80. doi: 10.1016/j.aap.2013.08.020. Epub 2013 Sep 5.

A decision tree to assess short-term mortality after an emergency department visit for an exacerbation of COPD: a cohort study.

Respir Res. 2015 Dec 22;16:151. doi: 10.1186/s12931-015-0313-4.

Development and validation of a claims-based prediction model for COPD severity.

Respir Med. 2013 Oct;107(10):1568-77. doi: 10.1016/j.rmed.2013.05.012. Epub 2013 Jun 25.

Machine learning algorithms and forced oscillation measurements to categorise the airway obstruction severity in chronic obstructive pulmonary disease.

Comput Methods Programs Biomed. 2015 Feb;118(2):186-97. doi: 10.1016/j.cmpb.2014.11.002. Epub 2014 Nov 22.

Prognostic severity scores for patients with COPD exacerbations attending emergency departments.

Int J Tuberc Lung Dis. 2014 Dec;18(12):1415-20. doi: 10.5588/ijtld.14.0312.

Optimal combination of biomarkers for time-dependent receiver operating characteristic estimation and related problems.

Stat Methods Med Res. 2017 Apr;26(2):898-913. doi: 10.1177/0962280214561506. Epub 2014 Nov 28.

引用本文的文献

Diagnostic systematic review and meta-analysis of machine learning in predicting biochemical recurrence of prostate cancer.

Sci Rep. 2025 Aug 4;15(1):28378. doi: 10.1038/s41598-025-11445-5.

The Association Between Elevated Estimated Glomerular Filtration Rate and Poor Clinical Outcomes of Pediatric Patients with Community-Acquired Bacterial Meningitis Receiving Vancomycin: A Ten-Year Retrospective Cohort Study.

Infect Dis Ther. 2025 Jul 18. doi: 10.1007/s40121-025-01196-1.

Development and validation of a visual prediction model for severe acute pancreatitis: a retrospective study.

Front Med (Lausanne). 2025 Jul 2;12:1564742. doi: 10.3389/fmed.2025.1564742. eCollection 2025.

Predictive Ability of Previous Pain and Disease Conditions on the Presentation of Post-COVID Pain in a Danish Cohort of Adult COVID-19 Survivors.

Eur J Pain. 2025 May;29(5):e70021. doi: 10.1002/ejp.70021.

Discretizing multiple continuous predictors with U-shaped relationships with lnOR: introducing the recursive gradient scanning method in clinical and epidemiological research.

BMC Med Res Methodol. 2025 Mar 12;25(1):70. doi: 10.1186/s12874-025-02522-4.

Discarded intravenous medication in the ICU: the GAME-OVER multicenter prospective observational study.

Crit Care. 2025 Feb 21;29(1):84. doi: 10.1186/s13054-025-05299-6.

Data-Sharing Statements Requested from Clinical Trials by Public, Environmental, and Occupational Health Journals: Cross-Sectional Study.

J Med Internet Res. 2025 Feb 7;27:e64069. doi: 10.2196/64069.

Clinicopathologic predictors of renal response and survival in newly diagnosed multiple myeloma with renal injury: a retrospective study.

Clin Exp Med. 2025 Feb 4;25(1):48. doi: 10.1007/s10238-025-01571-9.

Comprehensive predictive model for cerebral microbleeds: integrating clinical and biochemical markers.

Front Neurosci. 2024 Dec 13;18:1429088. doi: 10.3389/fnins.2024.1429088. eCollection 2024.

Integrating clinical and biochemical markers: a novel nomogram for predicting lacunes in cerebral small vessel disease.

Front Aging Neurosci. 2024 Aug 23;16:1404836. doi: 10.3389/fnagi.2024.1404836. eCollection 2024.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

预测模型中连续变量分类的新方法：提议与验证。

A new approach to categorising continuous variables in prediction models: Proposal and validation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献