Suppr
超能文献

在多变量诊断研究中，缺失值插补优于完全病例分析和缺失指标法：一个临床实例。

Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example.

作者信息

van der Heijden Geert J M G, Donders A Rogier T, Stijnen Theo, Moons Karel G M

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center, P.O. Box 80035, 3508 GA Utrecht, The Netherlands.

出版信息

J Clin Epidemiol. 2006 Oct;59(10):1102-9. doi: 10.1016/j.jclinepi.2006.01.015. Epub 2006 Jul 11.

DOI:10.1016/j.jclinepi.2006.01.015

PMID:16980151

Abstract

BACKGROUND AND OBJECTIVES

To illustrate the effects of different methods for handling missing data--complete case analysis, missing-indicator method, single imputation of unconditional and conditional mean, and multiple imputation (MI)--in the context of multivariable diagnostic research aiming to identify potential predictors (test results) that independently contribute to the prediction of disease presence or absence.

METHODS

We used data from 398 subjects from a prospective study on the diagnosis of pulmonary embolism. Various diagnostic predictors or tests had (varying percentages of) missing values. Per method of handling these missing values, we fitted a diagnostic prediction model using multivariable logistic regression analysis.

RESULTS

The receiver operating characteristic curve area for all diagnostic models was above 0.75. The predictors in the final models based on the complete case analysis, and after using the missing-indicator method, were very different compared to the other models. The models based on MI did not differ much from the models derived after using single conditional and unconditional mean imputation.

CONCLUSION

In multivariable diagnostic research complete case analysis and the use of the missing-indicator method should be avoided, even when data are missing completely at random. MI methods are known to be superior to single imputation methods. For our example study, the single imputation methods performed equally well, but this was most likely because of the low overall number of missing values.

摘要

背景与目的

在多变量诊断研究中，旨在识别独立有助于预测疾病存在与否的潜在预测因素（检测结果），阐述处理缺失数据的不同方法——完整病例分析、缺失指标法、无条件和有条件均值的单一插补以及多重插补（MI）的效果。

方法

我们使用了来自一项关于肺栓塞诊断的前瞻性研究中398名受试者的数据。各种诊断预测因素或检测存在（不同百分比的）缺失值。对于处理这些缺失值的每种方法，我们使用多变量逻辑回归分析拟合了一个诊断预测模型。

结果

所有诊断模型的受试者工作特征曲线面积均高于0.75。基于完整病例分析以及使用缺失指标法后最终模型中的预测因素，与其他模型相比差异很大。基于MI的模型与使用单一条件和无条件均值插补后得出的模型差异不大。

结论

在多变量诊断研究中，即使数据是完全随机缺失的，也应避免完整病例分析和使用缺失指标法。已知MI方法优于单一插补方法。对于我们的示例研究，单一插补方法表现同样良好，但这很可能是因为总体缺失值数量较少。

相似文献

Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example.

J Clin Epidemiol. 2006 Oct;59(10):1102-9. doi: 10.1016/j.jclinepi.2006.01.015. Epub 2006 Jul 11.

Using the outcome for imputation of missing predictor values was preferred.

J Clin Epidemiol. 2006 Oct;59(10):1092-101. doi: 10.1016/j.jclinepi.2006.01.009. Epub 2006 Jun 19.

Missing data on the Center for Epidemiologic Studies Depression Scale: a comparison of 4 imputation techniques.

Res Social Adm Pharm. 2007 Mar;3(1):1-27. doi: 10.1016/j.sapharm.2006.04.001.

Dealing with missing data in a multi-question depression scale: a comparison of imputation methods.

BMC Med Res Methodol. 2006 Dec 13;6:57. doi: 10.1186/1471-2288-6-57.

Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example.

J Clin Epidemiol. 2010 Jul;63(7):728-36. doi: 10.1016/j.jclinepi.2009.08.028. Epub 2010 Mar 25.

The validity of using multiple imputation for missing out-of-hospital data in a state trauma registry.

Acad Emerg Med. 2006 Mar;13(3):314-24. doi: 10.1197/j.aem.2005.09.011. Epub 2006 Feb 22.

Methods for handling missing data in palliative care research.

Palliat Med. 2006 Dec;20(8):791-8. doi: 10.1177/0269216306072555.

Comparison of methods of handling missing data in individual patient data meta-analyses: an empirical example on antibiotics in children with acute otitis media.

Am J Epidemiol. 2008 Mar 1;167(5):540-5. doi: 10.1093/aje/kwm341. Epub 2008 Jan 9.

Missing data in the American College of Surgeons National Surgical Quality Improvement Program are not missing at random: implications and potential impact on quality assessments.

J Am Coll Surg. 2010 Feb;210(2):125-139.e2. doi: 10.1016/j.jamcollsurg.2009.10.021.

Missing covariate data in medical research: to impute is better than to ignore.

J Clin Epidemiol. 2010 Jul;63(7):721-7. doi: 10.1016/j.jclinepi.2009.12.008. Epub 2010 Mar 24.

引用本文的文献

Quantitative lung ultrasound to guide surfactant retreatment in preterm neonates born at ≤30 weeks' gestation: a multicentre retrospective non-inferiority diagnostic accuracy study.

EBioMedicine. 2025 Jul 25;118:105865. doi: 10.1016/j.ebiom.2025.105865.

Interchangeability of patient pain, fatigue and global scores in patients with spondyloarthritis - a registry-based simulation study.

BMC Rheumatol. 2025 Jul 1;9(1):75. doi: 10.1186/s41927-025-00527-6.

Incorporation of missing indicator with multiple imputation in propensity score analysis with partially observed covariates: A simulation study.

Stat Methods Med Res. 2025 Jul;34(7):1293-1302. doi: 10.1177/09622802251338365. Epub 2025 Jun 19.

Enrichment in Colorectal Tumor Tissue: Associations With Tumor Characteristics and Survival Outcomes.

Gastro Hep Adv. 2025 Feb 20;4(6):100644. doi: 10.1016/j.gastha.2025.100644. eCollection 2025.

Cardiovascular Health in Cardiac Rehabilitation: Applying the American Heart Association Life's Simple 7 Framework in a Center-Based Cohort.

J Am Heart Assoc. 2025 Jun 17;14(12):e039010. doi: 10.1161/JAHA.124.039010. Epub 2025 Jun 5.

Model for Musculoskeletal Injury Risk Factors Among US Army Basic Combat Trainees.

JAMA Netw Open. 2025 Jun 2;8(6):e2513177. doi: 10.1001/jamanetworkopen.2025.13177.

Dietary inflammation and its impact on congestive heart failure in older adults with depression.

Sci Rep. 2025 May 10;15(1):16301. doi: 10.1038/s41598-025-98279-3.

Morbidity prediction in conservatively managed rib fracture patients.

Eur J Trauma Emerg Surg. 2025 Apr 29;51(1):184. doi: 10.1007/s00068-025-02860-4.

Rigorous validation of machine learning in laboratory medicine: guidance toward quality improvement.

Crit Rev Clin Lab Sci. 2025 Aug;62(5):327-346. doi: 10.1080/10408363.2025.2488842. Epub 2025 Apr 17.

Diagnostic accuracy of the Scandinavian guidelines for minor and moderate head trauma in children: a prospective, pragmatic, validation study.

Lancet Reg Health Eur. 2025 Feb 13;51:101233. doi: 10.1016/j.lanepe.2025.101233. eCollection 2025 Apr.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

在多变量诊断研究中，缺失值插补优于完全病例分析和缺失指标法：一个临床实例。

Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example.

作者信息

机构信息

出版信息

BACKGROUND AND OBJECTIVES

METHODS

RESULTS

CONCLUSION

背景与目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译