Suppr超能文献

具有发散尖峰的随机矩阵特征向量的渐近理论

Asymptotic Theory of Eigenvectors for Random Matrices with Diverging Spikes.

作者信息

Fan Jianqing, Fan Yingying, Han Xiao, Lv Jinchi

机构信息

Princeton University.

University of Southern California.

出版信息

J Am Stat Assoc. 2022;117(538):996-1009. doi: 10.1080/01621459.2020.1840990. Epub 2020 Dec 8.

Abstract

Characterizing the asymptotic distributions of eigenvectors for large random matrices poses important challenges yet can provide useful insights into a range of statistical applications. To this end, in this paper we introduce a general framework of asymptotic theory of eigenvectors (ATE) for large spiked random matrices with diverging spikes and heterogeneous variances, and establish the asymptotic properties of the spiked eigenvectors and eigenvalues for the scenario of the generalized Wigner matrix noise. Under some mild regularity conditions, we provide the asymptotic expansions for the spiked eigenvalues and show that they are asymptotically normal after some normalization. For the spiked eigenvectors, we establish asymptotic expansions for the general linear combination and further show that it is asymptotically normal after some normalization, where the weight vector can be arbitrary. We also provide a more general asymptotic theory for the spiked eigenvectors using the bilinear form. Simulation studies verify the validity of our new theoretical results. Our family of models encompasses many popularly used ones such as the stochastic block models with or without overlapping communities for network analysis and the topic models for text analysis, and our general theory can be exploited for statistical inference in these large-scale applications.

摘要

刻画大型随机矩阵特征向量的渐近分布面临着重大挑战,但能为一系列统计应用提供有用的见解。为此,在本文中,我们针对具有发散尖峰和异质方差的大型尖峰随机矩阵,引入了特征向量渐近理论(ATE)的一般框架,并建立了广义维格纳矩阵噪声情形下尖峰特征向量和特征值的渐近性质。在一些温和的正则条件下,我们给出了尖峰特征值的渐近展开式,并表明经过一些归一化后它们渐近服从正态分布。对于尖峰特征向量,我们建立了一般线性组合的渐近展开式,并进一步表明经过一些归一化后它渐近服从正态分布,其中权重向量可以是任意的。我们还使用双线性形式为尖峰特征向量提供了更一般的渐近理论。模拟研究验证了我们新理论结果的有效性。我们的模型族涵盖了许多常用的模型,例如用于网络分析的有或无重叠社区的随机块模型以及用于文本分析的主题模型,并且我们的一般理论可用于这些大规模应用中的统计推断。

相似文献

1
Asymptotic Theory of Eigenvectors for Random Matrices with Diverging Spikes.具有发散尖峰的随机矩阵特征向量的渐近理论
J Am Stat Assoc. 2022;117(538):996-1009. doi: 10.1080/01621459.2020.1840990. Epub 2020 Dec 8.

引用本文的文献

1
Subject clustering by IF-PCA and several recent methods.通过IF-PCA和几种近期方法进行主题聚类。
Front Genet. 2023 May 23;14:1166404. doi: 10.3389/fgene.2023.1166404. eCollection 2023.

本文引用的文献

2
IPAD: Stable Interpretable Forecasting with Knockoffs Inference.IPAD:基于仿冒品推断的稳定可解释预测
J Am Stat Assoc. 2020;115(532):1822-1834. doi: 10.1080/01621459.2019.1654878. Epub 2019 Sep 17.
3
RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.RANK:基于图形非线性仿样的大规模推断
J Am Stat Assoc. 2020;115(529):362-379. doi: 10.1080/01621459.2018.1546589. Epub 2019 Apr 11.
6
Asymptotic analysis of the stochastic block model for modular networks and its algorithmic applications.模块化网络随机块模型的渐近分析及其算法应用。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Dec;84(6 Pt 2):066106. doi: 10.1103/PhysRevE.84.066106. Epub 2011 Dec 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验