Lee Wonyul, Liu Yufeng
Department of Statistics and Operations Research, Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
J Multivar Anal. 2012 Oct 1;111:241-255. doi: 10.1016/j.jmva.2012.03.013. Epub 2012 Apr 27.
Multivariate regression is a common statistical tool for practical problems. Many multivariate regression techniques are designed for univariate response cases. For problems with multiple response variables available, one common approach is to apply the univariate response regression technique separately on each response variable. Although it is simple and popular, the univariate response approach ignores the joint information among response variables. In this paper, we propose three new methods for utilizing joint information among response variables. All methods are in a penalized likelihood framework with weighted L(1) regularization. The proposed methods provide sparse estimators of conditional inverse co-variance matrix of response vector given explanatory variables as well as sparse estimators of regression parameters. Our first approach is to estimate the regression coefficients with plug-in estimated inverse covariance matrices, and our second approach is to estimate the inverse covariance matrix with plug-in estimated regression parameters. Our third approach is to estimate both simultaneously. Asymptotic properties of these methods are explored. Our numerical examples demonstrate that the proposed methods perform competitively in terms of prediction, variable selection, as well as inverse covariance matrix estimation.
多元回归是解决实际问题常用的统计工具。许多多元回归技术是针对单变量响应情形设计的。对于有多个响应变量的问题,一种常见方法是对每个响应变量分别应用单变量响应回归技术。尽管这种方法简单且常用,但单变量响应方法忽略了响应变量之间的联合信息。在本文中,我们提出了三种利用响应变量之间联合信息的新方法。所有方法都在惩罚似然框架下,采用加权(L(1))正则化。所提出的方法提供了给定解释变量时响应向量的条件逆协方差矩阵的稀疏估计以及回归参数的稀疏估计。我们的第一种方法是用代入估计的逆协方差矩阵来估计回归系数,第二种方法是用代入估计的回归参数来估计逆协方差矩阵。我们的第三种方法是同时进行估计。探索了这些方法的渐近性质。我们的数值例子表明,所提出的方法在预测、变量选择以及逆协方差矩阵估计方面具有竞争力。