Consistent Group Identification and Variable Selection in Regression with Correlated Predictors.

作者信息

Sharma Dhruv B, Bondell Howard D, Zhang Hao Helen

出版信息

J Comput Graph Stat. 2013 Apr 1;22(2):319-340. doi: 10.1080/15533174.2012.707849.

DOI:10.1080/15533174.2012.707849

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3678393/

Abstract

Statistical procedures for variable selection have become integral elements in any analysis. Successful procedures are characterized by high predictive accuracy, yielding interpretable models while retaining computational efficiency. Penalized methods that perform coefficient shrinkage have been shown to be successful in many cases. Models with correlated predictors are particularly challenging to tackle. We propose a penalization procedure that performs variable selection while clustering groups of predictors automatically. The oracle properties of this procedure including consistency in group identification are also studied. The proposed method compares favorably with existing selection approaches in both prediction accuracy and model discovery, while retaining its computational efficiency. Supplemental material are available online.

摘要

变量选择的统计程序已成为任何分析中不可或缺的元素。成功的程序具有高预测准确性的特点，能产生可解释的模型，同时保持计算效率。已证明在许多情况下，执行系数收缩的惩罚方法是成功的。处理具有相关预测变量的模型尤其具有挑战性。我们提出一种惩罚程序，该程序在自动对预测变量组进行聚类的同时执行变量选择。还研究了该程序的神谕属性，包括组识别的一致性。所提出的方法在预测准确性和模型发现方面与现有选择方法相比具有优势，同时保持其计算效率。补充材料可在线获取。

相似文献

Consistent Group Identification and Variable Selection in Regression with Correlated Predictors.

J Comput Graph Stat. 2013 Apr 1;22(2):319-340. doi: 10.1080/15533174.2012.707849.

Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Biometrics. 2008 Mar;64(1):115-23. doi: 10.1111/j.1541-0420.2007.00843.x. Epub 2007 Jun 30.

Variable Selection in Generalized Functional Linear Models.

Stat. 2013;2(1):86-103. doi: 10.1002/sta4.20.

On the robustness of the adaptive lasso to model misspecification.

Biometrika. 2012 Sep;99(3):717-731. doi: 10.1093/biomet/ass027. Epub 2012 Jul 11.

Robust Variable Selection with Exponential Squared Loss.

J Am Stat Assoc. 2013 Apr 1;108(502):632-643. doi: 10.1080/01621459.2013.766613.

NEW EFFICIENT ESTIMATION AND VARIABLE SELECTION METHODS FOR SEMIPARAMETRIC VARYING-COEFFICIENT PARTIALLY LINEAR MODELS.

Ann Stat. 2011 Feb 1;39(1):305-332. doi: 10.1214/10-AOS842.

A Confidence Region Approach to Tuning for Variable Selection.

J Comput Graph Stat. 2012;21(2):295-314. doi: 10.1080/10618600.2012.679890. Epub 2012 Jun 14.

VARIABLE SELECTION FOR HIGH DIMENSIONAL MULTIVARIATE OUTCOMES.

Stat Sin. 2014 Oct;24(4):1633-1654. doi: 10.5705/ss.2013.019.

Interquantile Shrinkage and Variable Selection in Quantile Regression.

Comput Stat Data Anal. 2014 Jan 1;69:208-219. doi: 10.1016/j.csda.2013.08.006.

Interquantile Shrinkage in Regression Models.

J Comput Graph Stat. 2013;22(4). doi: 10.1080/10618600.2012.707454.

引用本文的文献

Graph-based regularization for regression problems with alignment and highly-correlated designs.

SIAM J Math Data Sci. 2020;2(2):480-504. doi: 10.1137/19M1287365. Epub 2020 Jun 16.

Deciduous forest responses to temperature, precipitation, and drought imply complex climate change impacts.

Proc Natl Acad Sci U S A. 2015 Nov 3;112(44):13585-90. doi: 10.1073/pnas.1509991112. Epub 2015 Oct 19.

The Cluster Elastic Net for High-Dimensional Regression With Unknown Variable Grouping.

Technometrics. 2014 Feb 20;56(1):112-122. doi: 10.1080/00401706.2013.810174.

本文引用的文献

ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS.

Ann Stat. 2009;37(4):1733-1751. doi: 10.1214/08-AOS625.

A multivariate regression approach to association analysis of a quantitative trait network.

Bioinformatics. 2009 Jun 15;25(12):i204-12. doi: 10.1093/bioinformatics/btp218.

Variable Selection using MM Algorithms.

Ann Stat. 2005;33(4):1617-1642. doi: 10.1214/009053605000000200.

Simultaneous factor selection and collapsing levels in ANOVA.

Biometrics. 2009 Mar;65(1):169-77. doi: 10.1111/j.1541-0420.2008.01061.x. Epub 2008 May 28.

Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Biometrics. 2008 Mar;64(1):115-23. doi: 10.1111/j.1541-0420.2007.00843.x. Epub 2007 Jun 30.

Averaged gene expressions for regression.

Biostatistics. 2007 Apr;8(2):212-27. doi: 10.1093/biostatistics/kxl002. Epub 2006 May 11.

Supervised harvesting of expression trees.

Genome Biol. 2001;2(1):RESEARCH0003. doi: 10.1186/gb-2001-2-1-research0003. Epub 2001 Jan 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

相关预测变量回归中的一致组识别与变量选择

Consistent Group Identification and Variable Selection in Regression with Correlated Predictors.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

相关预测变量回归中的一致组识别与变量选择

Consistent Group Identification and Variable Selection in Regression with Correlated Predictors.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献