凸LAR：最小角回归的扩展

ConvexLAR: An Extension of Least Angle Regression.

作者信息

Xiao Wei, Wu Yichao, Zhou Hua

机构信息

Department of Statistics, North Carolina State University, Raleigh, NC 27695.

出版信息

J Comput Graph Stat. 2015 Jul 1;24(3):603-626. doi: 10.1080/10618600.2014.962700. Epub 2015 Sep 16.

DOI:10.1080/10618600.2014.962700

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4840418/

Abstract

The least angle regression (LAR) was proposed by Efron, Hastie, Johnstone and Tibshirani (2004) for continuous model selection in linear regression. It is motivated by a geometric argument and tracks a path along which the predictors enter successively and the active predictors always maintain the same absolute correlation (angle) with the residual vector. Although it gains popularity quickly, its extensions seem rare compared to the penalty methods. In this expository article, we show that the powerful geometric idea of LAR can be generalized in a fruitful way. We propose a ConvexLAR algorithm that works for any convex loss function and naturally extends to group selection and data adaptive variable selection. After simple modification it also yields new exact path algorithms for certain penalty methods such as a convex loss function with lasso or group lasso penalty. Variable selection in recurrent event and panel count data analysis, Ada-Boost, and Gaussian graphical model is reconsidered from the ConvexLAR angle.

摘要

最小角回归（LAR）由埃弗龙（Efron）、哈斯蒂（Hastie）、约翰斯通（Johnstone）和蒂布希拉尼（Tibshirani）于2004年提出，用于线性回归中的连续模型选择。它由一个几何论证推动，并追踪一条路径，沿着该路径预测变量依次进入，且活跃预测变量与残差向量始终保持相同的绝对相关性（角度）。尽管它迅速受到欢迎，但与惩罚方法相比，其扩展似乎很少。在这篇说明性文章中，我们表明LAR强大的几何思想可以以富有成效的方式进行推广。我们提出了一种适用于任何凸损失函数的凸LAR算法，并且自然地扩展到组选择和数据自适应变量选择。经过简单修改后，它还为某些惩罚方法（如带有套索或组套索惩罚的凸损失函数）产生了新的精确路径算法。从凸LAR的角度重新考虑了复发事件和面板计数数据分析、Ada - Boost以及高斯图形模型中的变量选择。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa46/4840418/de19cd2487cf/nihms746958f1.jpg

相似文献

1

ConvexLAR: An Extension of Least Angle Regression.凸LAR：最小角回归的扩展

J Comput Graph Stat. 2015 Jul 1;24(3):603-626. doi: 10.1080/10618600.2014.962700. Epub 2015 Sep 16.

2

An ordinary differential equation based solution path algorithm.一种基于常微分方程的求解路径算法。

J Nonparametr Stat. 2011;23(1):185-199. doi: 10.1080/10485252.2010.490584.

3

ELASTIC NET FOR COX'S PROPORTIONAL HAZARDS MODEL WITH A SOLUTION PATH ALGORITHM.带求解路径算法的Cox比例风险模型的弹性网络法

Stat Sin. 2012;22:27-294. doi: 10.5705/ss.2010.107.

4

A Generic Path Algorithm for Regularized Statistical Estimation.一种用于正则化统计估计的通用路径算法。

J Am Stat Assoc. 2014;109(506):686-699. doi: 10.1080/01621459.2013.864166.

5

A Path Algorithm for Constrained Estimation.一种用于约束估计的路径算法。

J Comput Graph Stat. 2013;22(2):261-283. doi: 10.1080/10618600.2012.681248.

6

Elastic Net Regularization Paths for All Generalized Linear Models.所有广义线性模型的弹性网络正则化路径

J Stat Softw. 2023;106. doi: 10.18637/jss.v106.i01. Epub 2023 Mar 23.

7

Path Following in the Exact Penalty Method of Convex Programming.凸规划精确罚函数法中的路径跟踪

Comput Optim Appl. 2015 Jul 1;61(3):609-634. doi: 10.1007/s10589-015-9732-x.

8

Algorithms for Fitting the Constrained Lasso.用于拟合约束套索的算法

J Comput Graph Stat. 2018;27(4):861-871. doi: 10.1080/10618600.2018.1473777. Epub 2018 Aug 7.

9

A SIGNIFICANCE TEST FOR THE LASSO.套索（LASSO）的显著性检验

Ann Stat. 2014 Apr;42(2):413-468. doi: 10.1214/13-AOS1175.

10

Variable Selection with Prior Information for Generalized Linear Models via the Prior LASSO Method.通过先验套索方法对广义线性模型进行带先验信息的变量选择

J Am Stat Assoc. 2016;111(513):355-376. doi: 10.1080/01621459.2015.1008363. Epub 2016 May 5.

引用本文的文献

1

Detection of key mRNAs in liver tissue of hepatocellular carcinoma patients based on machine learning and bioinformatics analysis.基于机器学习和生物信息学分析检测肝细胞癌患者肝组织中的关键信使核糖核酸

MethodsX. 2023 Jan 18;10:102021. doi: 10.1016/j.mex.2023.102021. eCollection 2023.

2

Tumor Mutation Burden-Associated LINC00638/miR-4732-3p/ULBP1 Axis Promotes Immune Escape PD-L1 in Hepatocellular Carcinoma.肿瘤突变负荷相关的LINC00638/miR-4732-3p/ULBP1轴促进肝细胞癌中PD-L1的免疫逃逸

Front Oncol. 2021 Sep 8;11:729340. doi: 10.3389/fonc.2021.729340. eCollection 2021.

3

Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization.通过算法正则化实现凸聚类的动态可视化与快速计算

J Comput Graph Stat. 2020;29(1):87-96. doi: 10.1080/10618600.2019.1629943. Epub 2019 Jul 19.

4

Diagnostic, progressive and prognostic performance of mA methylation RNA regulators in lung adenocarcinoma.mA 甲基化 RNA 调控因子在肺腺癌中的诊断、进展和预后性能。

Int J Biol Sci. 2020 Mar 25;16(11):1785-1797. doi: 10.7150/ijbs.39046. eCollection 2020.

5

Integrated transcriptomic analysis reveals hub genes involved in diagnosis and prognosis of pancreatic cancer.综合转录组分析揭示了与胰腺癌诊断和预后相关的枢纽基因。

Mol Med. 2019 Nov 9;25(1):47. doi: 10.1186/s10020-019-0113-2.

6

Robust analysis of novel mRNA-lncRNA cross talk based on ceRNA hypothesis uncovers carcinogenic mechanism and promotes diagnostic accuracy in esophageal cancer.基于ceRNA假说对新型mRNA-lncRNA相互作用进行稳健分析，揭示食管癌致癌机制并提高诊断准确性。

Cancer Manag Res. 2018 Dec 27;11:347-358. doi: 10.2147/CMAR.S183310. eCollection 2019.

7

Efficient least angle regression for identification of linear-in-the-parameters models.用于识别参数线性模型的高效最小角回归

Proc Math Phys Eng Sci. 2017 Feb;473(2198):20160775. doi: 10.1098/rspa.2016.0775.

8

Path Following in the Exact Penalty Method of Convex Programming.凸规划精确罚函数法中的路径跟踪

Comput Optim Appl. 2015 Jul 1;61(3):609-634. doi: 10.1007/s10589-015-9732-x.

本文引用的文献

1

ELASTIC NET FOR COX'S PROPORTIONAL HAZARDS MODEL WITH A SOLUTION PATH ALGORITHM.带求解路径算法的Cox比例风险模型的弹性网络法

Stat Sin. 2012;22:27-294. doi: 10.5705/ss.2010.107.

2

An ordinary differential equation based solution path algorithm.一种基于常微分方程的求解路径算法。

J Nonparametr Stat. 2011;23(1):185-199. doi: 10.1080/10485252.2010.490584.

3

Variable selection for recurrent event data via nonconcave penalized estimating function.通过非凹惩罚估计函数对复发事件数据进行变量选择。

Lifetime Data Anal. 2009 Jun;15(2):197-215. doi: 10.1007/s10985-008-9104-2. Epub 2008 Nov 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验