Guo Jian, Levina Elizaveta, Michailidis George, Zhu Ji
Department of Statistics, University of Michigan, 1085 South University, Ann Arbor, Michigan 48109-1107, U.S.A. ,
Biometrika. 2011 Mar;98(1):1-15. doi: 10.1093/biomet/asq060. Epub 2011 Feb 9.
Gaussian graphical models explore dependence relationships between random variables, through the estimation of the corresponding inverse covariance matrices. In this paper we develop an estimator for such models appropriate for data from several graphical models that share the same variables and some of the dependence structure. In this setting, estimating a single graphical model would mask the underlying heterogeneity, while estimating separate models for each category does not take advantage of the common structure. We propose a method that jointly estimates the graphical models corresponding to the different categories present in the data, aiming to preserve the common structure, while allowing for differences between the categories. This is achieved through a hierarchical penalty that targets the removal of common zeros in the inverse covariance matrices across categories. We establish the asymptotic consistency and sparsity of the proposed estimator in the high-dimensional case, and illustrate its performance on a number of simulated networks. An application to learning semantic connections between terms from webpages collected from computer science departments is included.
高斯图形模型通过估计相应的逆协方差矩阵来探索随机变量之间的依赖关系。在本文中,我们针对此类模型开发了一种估计器,适用于来自几个共享相同变量和部分依赖结构的图形模型的数据。在这种情况下,估计单个图形模型会掩盖潜在的异质性,而针对每个类别估计单独的模型则无法利用共同结构。我们提出了一种方法,该方法联合估计与数据中存在的不同类别相对应的图形模型,旨在保留共同结构,同时允许类别之间存在差异。这是通过一种分层惩罚来实现的,该惩罚旨在消除跨类别的逆协方差矩阵中的共同零元素。我们在高维情况下建立了所提出估计器的渐近一致性和稀疏性,并在多个模拟网络上说明了其性能。还包括一个应用,用于学习从计算机科学系收集的网页中术语之间的语义联系。