Department of Mathematical Sciences, University of Texas at Dallas, Richardson, TX 75080.
Department of Electrical and Computer Engineering, Princeton University, Princeton, NJ 08544.
Proc Natl Acad Sci U S A. 2021 May 25;118(21). doi: 10.1073/pnas.2019994118.
Multilayer networks continue to gain significant attention in many areas of study, particularly due to their high utility in modeling interdependent systems such as critical infrastructures, human brain connectome, and socioenvironmental ecosystems. However, clustering of multilayer networks, especially using the information on higher-order interactions of the system entities, still remains in its infancy. In turn, higher-order connectivity is often the key in such multilayer network applications as developing optimal partitioning of critical infrastructures in order to isolate unhealthy system components under cyber-physical threats and simultaneous identification of multiple brain regions affected by trauma or mental illness. In this paper, we introduce the concepts of topological data analysis to studies of complex multilayer networks and propose a topological approach for network clustering. The key rationale is to group nodes based not on pairwise connectivity patterns or relationships between observations recorded at two individual nodes but based on how similar in shape their local neighborhoods are at various resolution scales. Since shapes of local node neighborhoods are quantified using a topological summary in terms of persistence diagrams, we refer to the approach as clustering using persistence diagrams (CPD). CPD systematically accounts for the important heterogeneous higher-order properties of node interactions within and in-between network layers and integrates information from the node neighbors. We illustrate the utility of CPD by applying it to an emerging problem of societal importance: vulnerability zoning of residential properties to weather- and climate-induced risks in the context of house insurance claim dynamics.
多层网络在许多研究领域继续受到广泛关注,特别是由于它们在建模相互依存的系统(如关键基础设施、人类大脑连接组和社会环境生态系统)方面具有很高的实用性。然而,多层网络的聚类,特别是利用系统实体的高阶相互作用的信息进行聚类,仍然处于起步阶段。反过来,高阶连接通常是多层网络应用的关键,例如开发关键基础设施的最佳分区,以便在网络物理威胁下隔离不健康的系统组件,同时识别受创伤或精神疾病影响的多个大脑区域。在本文中,我们将拓扑数据分析的概念引入到复杂的多层网络研究中,并提出了一种网络聚类的拓扑方法。关键原理是,不是根据节点之间的成对连接模式或关系,而是根据它们在不同分辨率尺度下局部节点邻居的形状相似程度来对节点进行分组。由于局部节点邻居的形状是使用持久性图的拓扑摘要来量化的,因此我们将该方法称为基于持久性图的聚类(CPD)。CPD 系统地考虑了网络层内和层间节点相互作用的重要异构高阶特性,并整合了节点邻居的信息。我们通过将其应用于一个具有社会重要性的新兴问题来展示 CPD 的实用性:在房屋保险索赔动态的背景下,对住宅物业的天气和气候风险的脆弱性进行分区。