Arroyo Relión Jesús D, Kessler Daniel, Levina Elizaveta, Taylor Stephan F
Johns Hopkins University.
University of Michigan.
Ann Appl Stat. 2019 Sep;13(3):1648-1677. doi: 10.1214/19-AOAS1252. Epub 2019 Oct 17.
While statistical analysis of a single network has received a lot of attention in recent years, with a focus on social networks, analysis of a sample of networks presents its own challenges which require a different set of analytic tools. Here we study the problem of classification of networks with labeled nodes, motivated by applications in neuroimaging. Brain networks are constructed from imaging data to represent functional connectivity between regions of the brain, and previous work has shown the potential of such networks to distinguish between various brain disorders, giving rise to a network classification problem. Existing approaches tend to either treat all edge weights as a long vector, ignoring the network structure, or focus on graph topology as represented by summary measures while ignoring the edge weights. Our goal is to design a classification method that uses both the individual edge information and the network structure of the data in a computationally efficient way, and that can produce a parsimonious and interpretable representation of differences in brain connectivity patterns between classes. We propose a graph classification method that uses edge weights as predictors but incorporates the network nature of the data via penalties that promote sparsity in the number of nodes, in addition to the usual sparsity penalties that encourage selection of edges. We implement the method via efficient convex optimization and provide a detailed analysis of data from two fMRI studies of schizophrenia.
近年来,虽然对单个网络的统计分析受到了广泛关注,尤其是聚焦于社交网络,但对网络样本的分析也带来了自身的挑战,这需要一套不同的分析工具。在此,受神经成像应用的启发,我们研究带标签节点的网络分类问题。脑网络由成像数据构建而成,用于表示大脑区域之间的功能连接,先前的研究表明此类网络在区分各种脑部疾病方面具有潜力,从而引发了网络分类问题。现有方法往往要么将所有边权重视为一个长向量,而忽略网络结构,要么专注于由汇总度量表示的图拓扑结构,同时忽略边权重。我们的目标是设计一种分类方法,该方法能以计算高效的方式同时使用数据的单个边信息和网络结构,并且能够生成类间脑连接模式差异的简洁且可解释的表示。我们提出一种图分类方法,该方法将边权重用作预测变量,但除了鼓励选择边的常见稀疏惩罚外,还通过促进节点数量稀疏性的惩罚纳入数据的网络性质。我们通过高效的凸优化实现该方法,并对来自两项精神分裂症功能磁共振成像研究的数据进行了详细分析。