College of Computer Science and Electronic Engineering, Hunan University, Lushan Nan Road, Changsha, 410082, China.
Hunan Want Want Hospital, Renmin Zhong Road, Changsha, 410006, China.
BMC Genomics. 2021 Jun 10;22(Suppl 1):436. doi: 10.1186/s12864-021-07628-9.
Gene interaction patterns, including modules and motifs, can be used to identify cancer specific biomarkers and to reveal the mechanism of tumorigenesis. Most of the existing module network inferencing methods focus on gene independent functional patterns, while the studies of overlapping characteristics between modules are lacking. The objective of this study was to reveal the functional overlapping patterns in gene modules, helping elucidate the regulatory relationship between overlapping genes and communities, as well as to explore cancer formation and progression.
We analyzed six cancer datasets from The Cancer Genome Atlas and obtained three kinds of gene functional modules for each cancer, including Independent-Community, Dependent-Community and Merged-Community. In the six cancers, 59(3.5%) Independent-Communities were identified, while 1631(96.5%) Dependent-Communities were acquired. Compared with Lemon-Tree and K-Means, the gene communities identified by our method were enriched in more known GO categories with lower p-values. Meanwhile, those identified distinguishing communities can significantly distinguish the survival prognostic of patients by Kaplan-Meier analysis. Furthermore, identified driver genes in the gene communities can be considered as biomarkers which can accurately distinguish the tumour or normal samples for each cancer type.
In all identified communities, Dependent-Communities are the majority. Our method is more effective than the other two methods which do not consider the overlapping characteristics of modules. This indicates that overlapping genes are located in different specific functional groups, and a communication bridge is established between the communities to construct a comprehensive carcinogenesis.
基因相互作用模式,包括模块和基序,可以用于鉴定癌症特异性生物标志物,并揭示肿瘤发生的机制。大多数现有的模块网络推断方法侧重于基因独立的功能模式,而对模块之间的重叠特征的研究则较少。本研究的目的是揭示基因模块中的功能重叠模式,有助于阐明重叠基因和社区之间的调控关系,以及探索癌症的形成和发展。
我们分析了来自癌症基因组图谱的六个癌症数据集,并为每种癌症获得了三种基因功能模块,包括独立社区、依赖社区和合并社区。在这六种癌症中,鉴定了 59 个(3.5%)独立社区,而获得了 1631 个(96.5%)依赖社区。与 Lemon-Tree 和 K-Means 相比,我们的方法鉴定的基因社区在更多的已知 GO 类别中富集,p 值更低。同时,通过 Kaplan-Meier 分析,那些鉴定出的区分社区可以显著区分患者的生存预后。此外,在基因社区中鉴定出的驱动基因可以被认为是生物标志物,它们可以准确地区分每种癌症类型的肿瘤或正常样本。
在所有鉴定出的社区中,依赖社区是多数。我们的方法比不考虑模块重叠特征的另外两种方法更有效。这表明重叠基因位于不同的特定功能组中,并且在社区之间建立了一个通信桥梁,以构建一个全面的致癌发生。