School of Computer Science and Technology, Xidian University, Xi'an, People's Republic of China.
School of Humanities and Foreign Languages, Xi'an University of Technology, Xi'an, People's Republic of China.
BMC Bioinformatics. 2020 Oct 2;21(1):433. doi: 10.1186/s12859-020-03769-y.
Precise disease module is conducive to understanding the molecular mechanism of disease causation and identifying drug targets. However, due to the fragmentization of disease module in incomplete human interactome, how to determine connectivity pattern and detect a complete neighbourhood of disease based on this is still an open question.
In this paper, we perform exploratory analysis leading to an important observation that through a few intermediate nodes, most separate connected components formed by disease-associated proteins can be effectively connected and eventually form a complete disease module. And based on the topological properties of these intermediate nodes, we propose a connect separate connected components (C3) method to detect a succinct disease module by introducing a relatively small number of intermediate nodes, which allows us to obtain more pure disease module than other methods. Then we apply C3 across a large corpus of diseases to validate this connectivity pattern of disease module. Furthermore, the connectivity of the perturbed genes in multi-omics data such as The Cancer Genome Atlas also fits this pattern.
C3 tool is not only useful in detecting a clearly-defined connected disease neighbourhood of 299 diseases and cancer with multi-omics data, but also helpful in better understanding the interconnection of phenotypically related genes in different omics data and studying complex pathological processes.
精确的疾病模块有助于理解疾病发病的分子机制,并确定药物靶点。然而,由于不完全的人类相互作用网络中疾病模块的碎片化,如何基于此确定连接模式并检测完整的疾病邻域仍然是一个悬而未决的问题。
在本文中,我们进行了探索性分析,得出了一个重要的观察结果,即通过少数中间节点,可以有效地连接由疾病相关蛋白形成的大多数分离的连通组件,并最终形成一个完整的疾病模块。并且基于这些中间节点的拓扑性质,我们提出了一种连接分离连通组件(C3)的方法,通过引入少量的中间节点来检测简洁的疾病模块,从而比其他方法获得更纯粹的疾病模块。然后,我们将 C3 应用于大量的疾病中,以验证这种疾病模块的连接模式。此外,多组学数据(如癌症基因组图谱)中扰动基因的连通性也符合这种模式。
C3 工具不仅有助于检测 299 种疾病和癌症的多组学数据中明确定义的连通疾病邻域,还有助于更好地理解不同组学数据中表型相关基因的相互联系,以及研究复杂的病理过程。