Ghiassian Susan Dina, Menche Jörg, Barabási Albert-László
Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, Massachusetts, United States of America; Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America.
Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, Massachusetts, United States of America; Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America; Center for Network Science, Central European University, Budapest, Hungary.
PLoS Comput Biol. 2015 Apr 8;11(4):e1004120. doi: 10.1371/journal.pcbi.1004120. eCollection 2015 Apr.
The observation that disease associated proteins often interact with each other has fueled the development of network-based approaches to elucidate the molecular mechanisms of human disease. Such approaches build on the assumption that protein interaction networks can be viewed as maps in which diseases can be identified with localized perturbation within a certain neighborhood. The identification of these neighborhoods, or disease modules, is therefore a prerequisite of a detailed investigation of a particular pathophenotype. While numerous heuristic methods exist that successfully pinpoint disease associated modules, the basic underlying connectivity patterns remain largely unexplored. In this work we aim to fill this gap by analyzing the network properties of a comprehensive corpus of 70 complex diseases. We find that disease associated proteins do not reside within locally dense communities and instead identify connectivity significance as the most predictive quantity. This quantity inspires the design of a novel Disease Module Detection (DIAMOnD) algorithm to identify the full disease module around a set of known disease proteins. We study the performance of the algorithm using well-controlled synthetic data and systematically validate the identified neighborhoods for a large corpus of diseases.
疾病相关蛋白常常相互作用这一观察结果推动了基于网络的方法的发展,以阐明人类疾病的分子机制。此类方法基于这样的假设:蛋白质相互作用网络可被视为图谱,其中疾病可通过特定邻域内的局部扰动来识别。因此,识别这些邻域或疾病模块是详细研究特定病理表型的先决条件。虽然存在许多启发式方法能够成功找出与疾病相关的模块,但基本的潜在连接模式在很大程度上仍未得到探索。在这项工作中,我们旨在通过分析70种复杂疾病的综合语料库的网络属性来填补这一空白。我们发现,疾病相关蛋白并不存在于局部密集的群落中,相反,我们将连接显著性确定为最具预测性的量。这个量激发了一种新颖的疾病模块检测(DIAMOnD)算法的设计,以识别围绕一组已知疾病蛋白的完整疾病模块。我们使用严格控制的合成数据研究了该算法的性能,并系统地验证了针对大量疾病语料库所识别出的邻域。