Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213.
Heinz College, Carnegie Mellon University, Pittsburgh, PA 15213.
Proc Natl Acad Sci U S A. 2018 Jan 30;115(5):927-932. doi: 10.1073/pnas.1718449115. Epub 2018 Jan 16.
Community detection is challenging when the network structure is estimated with uncertainty. Dynamic networks present additional challenges but also add information across time periods. We propose a global community detection method, persistent communities by eigenvector smoothing (PisCES), that combines information across a series of networks, longitudinally, to strengthen the inference for each period. Our method is derived from evolutionary spectral clustering and degree correction methods. Data-driven solutions to the problem of tuning parameter selection are provided. In simulations we find that PisCES performs better than competing methods designed for a low signal-to-noise ratio. Recently obtained gene expression data from rhesus monkey brains provide samples from finely partitioned brain regions over a broad time span including pre- and postnatal periods. Of interest is how gene communities develop over space and time; however, once the data are divided into homogeneous spatial and temporal periods, sample sizes are very small, making inference quite challenging. Applying PisCES to medial prefrontal cortex in monkey rhesus brains from near conception to adulthood reveals dense communities that persist, merge, and diverge over time and others that are loosely organized and short lived, illustrating how dynamic community detection can yield interesting insights into processes such as brain development.
当网络结构的估计存在不确定性时,进行社区检测具有挑战性。动态网络带来了额外的挑战,但也在各个时间段增加了信息。我们提出了一种全局社区检测方法,即特征向量平滑的持久社区(PisCES),它通过纵向将一系列网络中的信息进行组合,以加强每个时间段的推断。我们的方法源自演化谱聚类和度修正方法。针对调整参数选择问题,提供了数据驱动的解决方案。在模拟中,我们发现 PisCES 的性能优于为低信噪比设计的竞争方法。最近从恒河猴大脑中获得的基因表达数据提供了在广泛的时间跨度内(包括产前和产后期间)从精细分区的大脑区域获得的样本。感兴趣的是基因社区如何在空间和时间上发展;但是,一旦数据被划分为同质的时空期,样本量就非常小,使得推断极具挑战性。将 PisCES 应用于从恒河猴大脑近到成年的内侧前额叶皮层,揭示了密集的社区,这些社区随时间持续、合并和分化,而其他社区则组织松散且短暂,这说明了动态社区检测如何为大脑发育等过程提供有趣的见解。