Breheny Patrick, Huang Jian
Department of Biostatistics, Department of Statistics, University of Kentucky, 121 Washington Ave., Room 203C, Lexington, Kentucky 40536-0003, USA. Department of Statistics, and Actuarial Sciences, Department of Biostatistics, University of Iowa, 241 Schaeffer Hall, Iowa City, Iowa 52242, USA.
Ann Appl Stat. 2011 Jan 1;5(1):232-253. doi: 10.1214/10-AOAS388.
A number of variable selection methods have been proposed involving nonconvex penalty functions. These methods, which include the smoothly clipped absolute deviation (SCAD) penalty and the minimax concave penalty (MCP), have been demonstrated to have attractive theoretical properties, but model fitting is not a straightforward task, and the resulting solutions may be unstable. Here, we demonstrate the potential of coordinate descent algorithms for fitting these models, establishing theoretical convergence properties and demonstrating that they are significantly faster than competing approaches. In addition, we demonstrate the utility of convexity diagnostics to determine regions of the parameter space in which the objective function is locally convex, even though the penalty is not. Our simulation study and data examples indicate that nonconvex penalties like MCP and SCAD are worthwhile alternatives to the lasso in many applications. In particular, our numerical results suggest that MCP is the preferred approach among the three methods.
已经提出了许多涉及非凸惩罚函数的变量选择方法。这些方法包括平滑截断绝对偏差(SCAD)惩罚和极小极大凹惩罚(MCP),已被证明具有吸引人的理论性质,但模型拟合并非易事,且所得解可能不稳定。在此,我们展示了坐标下降算法用于拟合这些模型的潜力,建立了理论收敛性质,并证明它们比其他竞争方法快得多。此外,我们展示了凸性诊断的效用,以确定目标函数局部凸的参数空间区域,即使惩罚不是凸的。我们的模拟研究和数据示例表明,在许多应用中,像MCP和SCAD这样的非凸惩罚是套索回归的有价值替代方法。特别是,我们的数值结果表明,MCP是这三种方法中的首选方法。