Banerjee Samprit, Monni Stefano
Division of Biostatistics, Weill Medical College of Cornell University.
Department of Mathematics, American University of Beirut.
J Stat Plan Inference. 2021 Jul;213:16-32. doi: 10.1016/j.jspi.2020.10.006. Epub 2020 Nov 16.
We introduce an estimation method of covariance matrices in a high-dimensional setting, i.e., when the dimension of the matrix, , is larger than the sample size . Specifically, we propose an orthogonally equivariant estimator. The eigenvectors of such estimator are the same as those of the sample covariance matrix. The eigenvalue estimates are obtained from an adjusted profile likelihood function derived by approximating the integral of the density function of the sample covariance matrix over its eigenvectors, which is a challenging problem in its own right. Exact solutions to the approximate likelihood equations are obtained and employed to construct estimates that involve a tuning parameter. Bootstrap and cross-validation based algorithms are proposed to choose this tuning parameter under various loss functions. Finally, comparisons with two well-known orthogonally equivariant estimators are given, which are based on Monte-Carlo risk estimates for simulated data and misclassification errors in real data analyses.
我们介绍一种在高维情形下协方差矩阵的估计方法,即当矩阵的维度(p)大于样本量(n)时的情况。具体而言,我们提出一种正交不变估计器。这种估计器的特征向量与样本协方差矩阵的特征向量相同。特征值估计是通过对样本协方差矩阵密度函数在其特征向量上的积分进行近似而得到的调整后的轮廓似然函数得出的,这本身就是一个具有挑战性的问题。我们获得了近似似然方程的精确解,并用于构建涉及一个调谐参数的估计。提出了基于自助法和交叉验证的算法,以在各种损失函数下选择此调谐参数。最后,给出了与两个著名的正交不变估计器的比较,这是基于模拟数据的蒙特卡罗风险估计和实际数据分析中的错误分类误差进行的。