Wang Tao, Peng Qidi, Liu Bo, Liu Yongzhuang, Wang Yadong
School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
Front Bioeng Biotechnol. 2020 May 6;8:418. doi: 10.3389/fbioe.2020.00418. eCollection 2020.
The study of disease-relevant gene modules is one of the main methods to discover disease pathway and potential drug targets. Recent studies have found that most disease proteins tend to form many separate connected components and scatter across the protein-protein interaction network. However, most of the research on discovering disease modules are biased toward well-studied seed genes, which tend to extend seed genes into a single connected subnetwork. In this paper, we propose N2V-HC, an algorithm framework aiming to unbiasedly discover the scattered disease modules based on deep representation learning of integrated multi-layer biological networks. Our method first predicts disease associated genes based on summary data of Genome-wide Association Studies (GWAS) and expression Quantitative Trait Loci (eQTL) studies, and generates an integrated network on the basis of human interactome. The features of nodes in the network are then extracted by deep representation learning. Hierarchical clustering with dynamic tree cut methods are applied to discover the modules that are enriched with disease associated genes. The evaluation on real networks and simulated networks show that N2V-HC performs better than existing methods in network module discovery. Case studies on Parkinson's disease and Alzheimer's disease, show that N2V-HC can be used to discover biological meaningful modules related to the pathways underlying complex diseases.
对疾病相关基因模块的研究是发现疾病通路和潜在药物靶点的主要方法之一。最近的研究发现,大多数疾病蛋白倾向于形成许多独立的连通组件,并分散在蛋白质-蛋白质相互作用网络中。然而,大多数发现疾病模块的研究都偏向于对已充分研究的种子基因,这些研究往往将种子基因扩展到单个连通子网中。在本文中,我们提出了N2V-HC,这是一种算法框架,旨在基于整合的多层生物网络的深度表示学习,无偏地发现分散的疾病模块。我们的方法首先基于全基因组关联研究(GWAS)和表达数量性状位点(eQTL)研究的汇总数据预测疾病相关基因,并在人类相互作用组的基础上生成一个整合网络。然后通过深度表示学习提取网络中节点的特征。应用动态树切割方法进行层次聚类,以发现富含疾病相关基因的模块。对真实网络和模拟网络的评估表明,N2V-HC在网络模块发现方面比现有方法表现更好。对帕金森病和阿尔茨海默病的案例研究表明,N2V-HC可用于发现与复杂疾病潜在通路相关的具有生物学意义的模块。