Mishra Aditya, Dey Dipak K, Chen Kun
Department of Statistics, University of Connecticut.
J Comput Graph Stat. 2017;26(4):814-825. doi: 10.1080/10618600.2017.1340891. Epub 2017 Oct 16.
In multivariate regression models, a sparse singular value decomposition of the regression component matrix is appealing for reducing dimensionality and facilitating interpretation. However, the recovery of such a decomposition remains very challenging, largely due to the simultaneous presence of orthogonality constraints and co-sparsity regularization. By delving into the underlying statistical data generation mechanism, we reformulate the problem as a supervised co-sparse factor analysis, and develop an efficient computational procedure, named sequential factor extraction via co-sparse unit-rank estimation (SeCURE), that completely bypasses the orthogonality requirements. At each step, the problem reduces to a sparse multivariate regression with a unit-rank constraint. Nicely, each sequentially extracted sparse and unit-rank coefficient matrix automatically leads to co-sparsity in its pair of singular vectors. Each latent factor is thus a sparse linear combination of the predictors and may influence only a subset of responses. The proposed algorithm is guaranteed to converge, and it ensures efficient computation even with incomplete data and/or when enforcing exact orthogonality is desired. Our estimators enjoy the oracle properties asymptotically; a non-asymptotic error bound further reveals some interesting finite-sample behaviors of the estimators. The efficacy of SeCURE is demonstrated by simulation studies and two applications in genetics.
在多元回归模型中,回归分量矩阵的稀疏奇异值分解对于降维和便于解释很有吸引力。然而,这种分解的恢复仍然非常具有挑战性,主要是由于正交性约束和共同稀疏正则化同时存在。通过深入研究潜在的统计数据生成机制,我们将该问题重新表述为监督共同稀疏因子分析,并开发了一种高效的计算程序,称为通过共同稀疏单位秩估计进行顺序因子提取(SeCURE),它完全绕过了正交性要求。在每一步,问题都简化为具有单位秩约束的稀疏多元回归。很好的是,每个顺序提取的稀疏且单位秩的系数矩阵会自动在其奇异向量对中产生共同稀疏性。因此,每个潜在因子都是预测变量的稀疏线性组合,并且可能仅影响响应的一个子集。所提出的算法保证收敛,并且即使在数据不完整和/或需要强制精确正交性的情况下,它也能确保高效计算。我们的估计量渐近地具有神谕性质;一个非渐近误差界进一步揭示了估计量的一些有趣的有限样本行为。通过模拟研究和遗传学中的两个应用证明了SeCURE的有效性。