使用自助法改进通过向后变量消除法选择的回归系数的估计和置信区间。

Using the bootstrap to improve estimation and confidence intervals for regression coefficients selected using backwards variable elimination.

作者信息

Austin Peter C

机构信息

Institute for Clinical Evaluative Sciences, Toronto, Ont., Canada.

出版信息

Stat Med. 2008 Jul 30;27(17):3286-300. doi: 10.1002/sim.3104.

DOI:10.1002/sim.3104

PMID:17940997

Abstract

Applied researchers frequently use automated model selection methods, such as backwards variable elimination, to develop parsimonious regression models. Statisticians have criticized the use of these methods for several reasons, amongst them are the facts that the estimated regression coefficients are biased and that the derived confidence intervals do not have the advertised coverage rates. We developed a method to improve estimation of regression coefficients and confidence intervals which employs backwards variable elimination in multiple bootstrap samples. In a given bootstrap sample, predictor variables that are not selected for inclusion in the final regression model have their regression coefficient set to zero. Regression coefficients are averaged across the bootstrap samples, and non-parametric percentile bootstrap confidence intervals are then constructed for each regression coefficient. We conducted a series of Monte Carlo simulations to examine the performance of this method for estimating regression coefficients and constructing confidence intervals for variables selected using backwards variable elimination. We demonstrated that this method results in confidence intervals with superior coverage compared with those developed from conventional backwards variable elimination. We illustrate the utility of our method by applying it to a large sample of subjects hospitalized with a heart attack.

摘要

应用研究人员经常使用自动模型选择方法，如向后变量剔除，来开发简约的回归模型。统计学家因多种原因批评了这些方法的使用，其中包括估计的回归系数存在偏差，以及导出的置信区间没有所宣称的覆盖率。我们开发了一种方法来改进回归系数和置信区间的估计，该方法在多个自助抽样样本中采用向后变量剔除。在给定的自助抽样样本中，未被选入最终回归模型的预测变量的回归系数被设为零。对自助抽样样本的回归系数进行平均，然后为每个回归系数构建非参数百分位数自助置信区间。我们进行了一系列蒙特卡罗模拟，以检验该方法在估计回归系数以及为使用向后变量剔除所选变量构建置信区间方面的性能。我们证明，与传统向后变量剔除所构建的置信区间相比，该方法得到的置信区间具有更好的覆盖率。我们通过将该方法应用于大量因心脏病发作住院的受试者样本，来说明我们方法的实用性。

相似文献

Using the bootstrap to improve estimation and confidence intervals for regression coefficients selected using backwards variable elimination.

Stat Med. 2008 Jul 30;27(17):3286-300. doi: 10.1002/sim.3104.

Bootstrap model selection had similar performance for selecting authentic and noise variables compared to backward variable elimination: a simulation study.

J Clin Epidemiol. 2008 Oct;61(10):1009-17.e1. doi: 10.1016/j.jclinepi.2007.11.014. Epub 2008 Jun 9.

Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.

J Clin Epidemiol. 2004 Nov;57(11):1138-46. doi: 10.1016/j.jclinepi.2004.04.003.

The use of bootstrap methods for estimating sample size and analysing health-related quality of life outcomes.

Stat Med. 2005 Apr 15;24(7):1075-102. doi: 10.1002/sim.1984.

Non-parametric bootstrap confidence intervals for the intraclass correlation coefficient.

Stat Med. 2003 Dec 30;22(24):3805-21. doi: 10.1002/sim.1643.

Use of the bootstrap in analysing cost data from cluster randomised trials: some simulation results.

BMC Health Serv Res. 2004 Nov 18;4(1):33. doi: 10.1186/1472-6963-4-33.

Bootstrap standard error and confidence intervals for the correlations corrected for indirect range restriction.

Br J Math Stat Psychol. 2011 Nov;64(3):367-87. doi: 10.1348/2044-8317.002007. Epub 2010 Dec 6.

Relative risks and confidence intervals were easily computed indirectly from multivariable logistic regression.

J Clin Epidemiol. 2007 Sep;60(9):874-82. doi: 10.1016/j.jclinepi.2006.12.001. Epub 2007 Jan 18.

Non-parametric estimators of a monotonic dose-response curve and bootstrap confidence intervals.

Stat Med. 2003 Mar 30;22(6):869-82. doi: 10.1002/sim.1460.

A bootstrap method to avoid the effect of concurvity in generalised additive models in time series studies of air pollution.

J Epidemiol Community Health. 2005 Oct;59(10):881-4. doi: 10.1136/jech.2004.026740.

引用本文的文献

Prediction of acute kidney injury risk after cardiac surgery: using a hybrid machine learning algorithm.

BMC Med Inform Decis Mak. 2022 May 18;22(1):137. doi: 10.1186/s12911-022-01859-w.

Relative strain is a novel predictor of aneurysmal degeneration of the thoracic aorta: An ex vivo mechanical study.

JVS Vasc Sci. 2021 Oct 8;2:235-246. doi: 10.1016/j.jvssci.2021.08.003. eCollection 2021.

Considering cross-cultural differences in sleep duration between Japanese and Canadian university students.

PLoS One. 2021 Apr 26;16(4):e0250671. doi: 10.1371/journal.pone.0250671. eCollection 2021.

Exploring the relationship between sexual function, sense of coherence, and well-being in a sample of Iranian breast cancer survivors.

Support Care Cancer. 2021 Jun;29(6):3191-3199. doi: 10.1007/s00520-020-05831-0. Epub 2020 Oct 22.

Derivation and validation of text search algorithms for renal and adrenal lesion identification in radiology text reports.

Can Urol Assoc J. 2020 Jun;14(6):E264-E270. doi: 10.5489/cuaj.6105.

Risk Prediction Tools to Improve Patient Selection for Carotid Endarterectomy Among Patients With Asymptomatic Carotid Stenosis.

JAMA Surg. 2019 Apr 1;154(4):336-344. doi: 10.1001/jamasurg.2018.5119.

Sense of coherence as a mediator of health-related quality of life dimensions in patients with breast cancer: a longitudinal study with prospective design.

Health Qual Life Outcomes. 2015 Dec 9;13:195. doi: 10.1186/s12955-015-0392-4.

Predictors of Clinical Success in the Treatment of Patients with Methicillin-Resistant Staphylococcus aureus (MRSA) Nosocomial Pneumonia (NP).

PLoS One. 2015 Jul 21;10(7):e0131932. doi: 10.1371/journal.pone.0131932. eCollection 2015.

Risk-adjusted clinical outcomes in patients enrolled in a bloodless program.

Transfusion. 2014 Oct;54(10 Pt 2):2668-77. doi: 10.1111/trf.12752. Epub 2014 Jun 18.

Measuring the degree of integration for an integrated service network.

Int J Integr Care. 2012 Sep 18;12:e137. doi: 10.5334/ijic.835. Print 2012 Jul-Sep.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用自助法改进通过向后变量消除法选择的回归系数的估计和置信区间。

Using the bootstrap to improve estimation and confidence intervals for regression coefficients selected using backwards variable elimination.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献