Hu Zongliang, Dong Kai, Dai Wenlin, Tong Tiejun
.
, Hong.
Int J Biostat. 2017 Sep 21;13(2):/j/ijb.2017.13.issue-2/ijb-2017-0013/ijb-2017-0013.xml. doi: 10.1515/ijb-2017-0013.
The determinant of the covariance matrix for high-dimensional data plays an important role in statistical inference and decision. It has many real applications including statistical tests and information theory. Due to the statistical and computational challenges with high dimensionality, little work has been proposed in the literature for estimating the determinant of high-dimensional covariance matrix. In this paper, we estimate the determinant of the covariance matrix using some recent proposals for estimating high-dimensional covariance matrix. Specifically, we consider a total of eight covariance matrix estimation methods for comparison. Through extensive simulation studies, we explore and summarize some interesting comparison results among all compared methods. We also provide practical guidelines based on the sample size, the dimension, and the correlation of the data set for estimating the determinant of high-dimensional covariance matrix. Finally, from a perspective of the loss function, the comparison study in this paper may also serve as a proxy to assess the performance of the covariance matrix estimation.
高维数据协方差矩阵的行列式在统计推断和决策中起着重要作用。它有许多实际应用,包括统计检验和信息论。由于高维度带来的统计和计算挑战,文献中很少有关于估计高维协方差矩阵行列式的工作。在本文中,我们使用一些最近提出的估计高维协方差矩阵的方法来估计协方差矩阵的行列式。具体来说,我们总共考虑了八种协方差矩阵估计方法进行比较。通过广泛的模拟研究,我们探索并总结了所有比较方法之间一些有趣的比较结果。我们还根据样本大小、维度和数据集的相关性提供了估计高维协方差矩阵行列式的实用指南。最后,从损失函数的角度来看,本文的比较研究也可以作为评估协方差矩阵估计性能的一个代理。