高维小样本协方差矩阵的正交等变估计量

An Orthogonally Equivariant Estimator of the Covariance Matrix in High Dimensions and for Small Sample Sizes.

作者信息

Banerjee Samprit, Monni Stefano

机构信息

Division of Biostatistics, Weill Medical College of Cornell University.

Department of Mathematics, American University of Beirut.

出版信息

J Stat Plan Inference. 2021 Jul;213:16-32. doi: 10.1016/j.jspi.2020.10.006. Epub 2020 Nov 16.

DOI:10.1016/j.jspi.2020.10.006

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7709931/

Abstract

We introduce an estimation method of covariance matrices in a high-dimensional setting, i.e., when the dimension of the matrix, , is larger than the sample size . Specifically, we propose an orthogonally equivariant estimator. The eigenvectors of such estimator are the same as those of the sample covariance matrix. The eigenvalue estimates are obtained from an adjusted profile likelihood function derived by approximating the integral of the density function of the sample covariance matrix over its eigenvectors, which is a challenging problem in its own right. Exact solutions to the approximate likelihood equations are obtained and employed to construct estimates that involve a tuning parameter. Bootstrap and cross-validation based algorithms are proposed to choose this tuning parameter under various loss functions. Finally, comparisons with two well-known orthogonally equivariant estimators are given, which are based on Monte-Carlo risk estimates for simulated data and misclassification errors in real data analyses.

摘要

我们介绍一种在高维情形下协方差矩阵的估计方法，即当矩阵的维度(p)大于样本量(n)时的情况。具体而言，我们提出一种正交不变估计器。这种估计器的特征向量与样本协方差矩阵的特征向量相同。特征值估计是通过对样本协方差矩阵密度函数在其特征向量上的积分进行近似而得到的调整后的轮廓似然函数得出的，这本身就是一个具有挑战性的问题。我们获得了近似似然方程的精确解，并用于构建涉及一个调谐参数的估计。提出了基于自助法和交叉验证的算法，以在各种损失函数下选择此调谐参数。最后，给出了与两个著名的正交不变估计器的比较，这是基于模拟数据的蒙特卡罗风险估计和实际数据分析中的错误分类误差进行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/961f/7709931/5b5f728b435d/nihms-1647217-f0001.jpg

相似文献

1

An Orthogonally Equivariant Estimator of the Covariance Matrix in High Dimensions and for Small Sample Sizes.高维小样本协方差矩阵的正交等变估计量

J Stat Plan Inference. 2021 Jul;213:16-32. doi: 10.1016/j.jspi.2020.10.006. Epub 2020 Nov 16.

2

Equivariant minimax dominators of the MLE in the array normal model.数组正态模型中极大似然估计的等变极小极大主导者

J Multivar Anal. 2015 May 1;137:32-49. doi: 10.1016/j.jmva.2015.01.020.

3

Cross-Validated Loss-Based Covariance Matrix Estimator Selection in High Dimensions.高维中基于交叉验证损失的协方差矩阵估计器选择

J Comput Graph Stat. 2023;32(2):601-612. doi: 10.1080/10618600.2022.2110883. Epub 2022 Oct 7.

4

Shrinkage estimators for covariance matrices.协方差矩阵的收缩估计量。

Biometrics. 2001 Dec;57(4):1173-84. doi: 10.1111/j.0006-341x.2001.01173.x.

5

Condition Number Regularized Covariance Estimation.条件数正则化协方差估计

J R Stat Soc Series B Stat Methodol. 2013 Jun 1;75(3):427-450. doi: 10.1111/j.1467-9868.2012.01049.x.

6

Estimation of Large-Dimensional Covariance Matrices via Second-Order Stein-Type Regularization.通过二阶斯坦因型正则化估计大维度协方差矩阵

Entropy (Basel). 2022 Dec 27;25(1):53. doi: 10.3390/e25010053.

7

A generalized-weights solution to sample overlap in meta-analysis.广义权重法解决荟萃分析中的样本重叠问题。

Res Synth Methods. 2020 Nov;11(6):812-832. doi: 10.1002/jrsm.1441. Epub 2020 Sep 18.

8

Estimation of parameters of inverse Weibull distribution and application to multi-component stress-strength model.逆威布尔分布参数估计及其在多组件应力-强度模型中的应用。

J Appl Stat. 2020 Aug 8;49(1):169-194. doi: 10.1080/02664763.2020.1803815. eCollection 2022.

9

The Bayesian Covariance Lasso.贝叶斯协方差套索

Stat Interface. 2013 Apr 1;6(2):243-259. doi: 10.4310/sii.2013.v6.n2.a8.

10

Optimal Shrinkage of Eigenvalues in the Spiked Covariance Model.尖峰协方差模型中特征值的最优收缩

Ann Stat. 2018 Aug;46(4):1742-1778. doi: 10.1214/17-AOS1601. Epub 2018 Jun 27.

本文引用的文献

1

Condition Number Regularized Covariance Estimation.条件数正则化协方差估计

J R Stat Soc Series B Stat Methodol. 2013 Jun 1;75(3):427-450. doi: 10.1111/j.1467-9868.2012.01049.x.

2

Sparse estimation of a covariance matrix.协方差矩阵的稀疏估计。

Biometrika. 2011 Dec;98(4):807-820. doi: 10.1093/biomet/asr054.

3

Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer.乳腺癌对紫杉醇、氟尿嘧啶、阿霉素和环磷酰胺术前化疗敏感性的药物基因组学预测指标

J Clin Oncol. 2006 Sep 10;24(26):4236-44. doi: 10.1200/JCO.2006.05.6861. Epub 2006 Aug 8.

4

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.癌症的分子分类：通过基因表达监测进行类别发现和类别预测。

Science. 1999 Oct 15;286(5439):531-7. doi: 10.1126/science.286.5439.531.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验