Wang Jia, Cai Xizhen, Niu Xiaoyue, Li Runze
Department of Statistics, Pennsylvania State University, University Park, PA 16802,USA.
Department of Mathematics and Statistics, Williams College, Williamstown, MA 01267,USA.
J Am Stat Assoc. 2024;119(546):1322-1335. doi: 10.1080/01621459.2023.2187815. Epub 2023 Apr 13.
We consider a class of network models, in which the connection probability depends on ultrahigh-dimensional nodal covariates () and node-specific popularity (). A Bayesian method is proposed to select nodal features in both dense and sparse networks under a mild assumption on popularity parameters. The proposed approach is implemented via Gibbs sampling. To alleviate the computational burden for large sparse networks, we further develop a working model in which parameters are updated based on a dense sub-graph at each step. Model selection consistency is established for both models, in the sense that the probability of the true model being selected converges to one asymptotically, even when the dimension grows with the network size at an exponential rate. The performance of the proposed models and estimation procedures are illustrated through Monte Carlo studies and three real world examples.
我们考虑一类网络模型,其中连接概率取决于超高维节点协变量()和节点特定的受欢迎程度()。在对受欢迎程度参数的一个温和假设下,提出了一种贝叶斯方法来在密集和稀疏网络中选择节点特征。所提出的方法通过吉布斯采样来实现。为了减轻大型稀疏网络的计算负担,我们进一步开发了一个工作模型,其中参数在每一步基于一个密集子图进行更新。两个模型都建立了模型选择一致性,即即使维度以指数速率随网络规模增长,选择真实模型的概率渐近收敛到1。通过蒙特卡罗研究和三个实际例子说明了所提出模型和估计程序的性能。