Xu Yungang, Guo Maozu, Liu Xiaoyan, Wang Chunyu, Liu Yang, Liu Guojun
School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
School of Life Science and Technology, Harbin Institute of Technology, Harbin 150001, China.
Nucleic Acids Res. 2016 Nov 16;44(20):e152. doi: 10.1093/nar/gkw679. Epub 2016 Aug 2.
Module identification is a frequently used approach for mining local structures with more significance in global networks. Recently, a wide variety of bilayer networks are emerging to characterize the more complex biological processes. In the light of special topological properties of bilayer networks and the accompanying challenges, there is yet no effective method aiming at bilayer module identification to probe the modular organizations from the more inspiring bilayer networks. To this end, we proposed the pseudo-3D clustering algorithm, which starts from extracting initial non-hierarchically organized modules and then iteratively deciphers the hierarchical organization of modules according to a bottom-up strategy. Specifically, a modularity function for bilayer modules was proposed to facilitate the algorithm reporting the optimal partition that gives the most accurate characterization of the bilayer network. Simulation studies demonstrated its robustness and outperformance against alternative competing methods. Specific applications to both the soybean and human miRNA-gene bilayer networks demonstrated that the pseudo-3D clustering algorithm successfully identified the overlapping, hierarchically organized and highly cohesive bilayer modules. The analyses on topology, functional and human disease enrichment and the bilayer subnetwork involved in soybean fat biosynthesis provided both the theoretical and biological evidence supporting the effectiveness and robustness of pseudo-3D clustering algorithm.
模块识别是一种在全局网络中挖掘具有更重要意义的局部结构时常用的方法。最近,各种各样的双层网络不断涌现,用于描述更复杂的生物过程。鉴于双层网络的特殊拓扑特性以及随之而来的挑战,目前还没有一种有效的方法来针对双层模块识别,以便从更具启发性的双层网络中探究模块组织。为此,我们提出了伪3D聚类算法,该算法从提取初始的非层次组织模块开始,然后根据自下而上的策略迭代地解读模块的层次组织。具体而言,提出了一种针对双层模块的模块化函数,以促进该算法报告能够最准确表征双层网络的最优划分。模拟研究证明了它相对于其他竞争方法的稳健性和优越性。对大豆和人类miRNA-基因双层网络的具体应用表明,伪3D聚类算法成功地识别出了重叠的、层次组织的和高度凝聚的双层模块。对拓扑、功能和人类疾病富集以及大豆脂肪生物合成中涉及的双层子网的分析,为支持伪3D聚类算法的有效性和稳健性提供了理论和生物学证据。