Fan Jianqing, Liao Yuan, Mincheva Martina
Department of Operations Research and Financial Engineering, Princeton University ; Bendheim Center for Finance, Princeton University.
Department of Mathematics, University of Maryland.
J R Stat Soc Series B Stat Methodol. 2013 Sep 1;75(4). doi: 10.1111/rssb.12016.
This paper deals with the estimation of a high-dimensional covariance with a conditional sparsity structure and fast-diverging eigenvalues. By assuming sparse error covariance matrix in an approximate factor model, we allow for the presence of some cross-sectional correlation even after taking out common but unobservable factors. We introduce the Principal Orthogonal complEment Thresholding (POET) method to explore such an approximate factor structure with sparsity. The POET estimator includes the sample covariance matrix, the factor-based covariance matrix (Fan, Fan, and Lv, 2008), the thresholding estimator (Bickel and Levina, 2008) and the adaptive thresholding estimator (Cai and Liu, 2011) as specific examples. We provide mathematical insights when the factor analysis is approximately the same as the principal component analysis for high-dimensional data. The rates of convergence of the sparse residual covariance matrix and the conditional sparse covariance matrix are studied under various norms. It is shown that the impact of estimating the unknown factors vanishes as the dimensionality increases. The uniform rates of convergence for the unobserved factors and their factor loadings are derived. The asymptotic results are also verified by extensive simulation studies. Finally, a real data application on portfolio allocation is presented.
本文研究具有条件稀疏结构和快速发散特征值的高维协方差估计问题。通过在近似因子模型中假设误差协方差矩阵稀疏,即使在去除共同但不可观测的因子后,我们仍允许存在一些横截面相关性。我们引入主正交补阈值法(POET)来探索这种具有稀疏性的近似因子结构。POET估计器包括样本协方差矩阵、基于因子的协方差矩阵(范剑青、范剑青和吕晓玲,2008)、阈值估计器(比克尔和列维纳,2008)以及自适应阈值估计器(蔡和刘,2011)作为具体例子。当因子分析与高维数据的主成分分析近似相同时,我们给出了数学见解。在各种范数下研究了稀疏残差协方差矩阵和条件稀疏协方差矩阵的收敛速度。结果表明,随着维度增加,估计未知因子的影响逐渐消失。推导了未观测因子及其因子载荷的一致收敛速度。大量模拟研究也验证了渐近结果。最后,给出了一个投资组合分配的实际数据应用。