NICTA, Victoria Laboratory and Department of Computing and Information Systems, University of Melbourne, Parkville, Vic 3010, Australia.
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2105-14-S2-S7. Epub 2013 Jan 21.
Gene expression profiles can show significant changes when genetically diseased cells are compared with non-diseased cells. Biological networks are often used to identify active subnetworks (ASNs) of the diseases from the expression profiles to understand the reason behind the observed changes. Current methodologies for discovering ASNs mostly use undirected PPI networks and node centric approaches. This can limit their ability to find the meaningful ASNs when using integrated networks having comprehensive information than the traditional protein-protein interaction networks. Using appropriate scoring functions to assess both genes and their interactions may allow the discovery of better ASNs. In this paper, we present CASNet, which aims to identify better ASNs using (i) integrated interaction networks (mixed graphs), (ii) directions of regulations of genes, and (iii) combined node and edge scores. We simplify and extend previous methodologies to incorporate edge evaluations and lessen their sensitivity to significance thresholds. We formulate our objective functions using mixed integer programming (MIP) and show that optimal solutions may be obtained. We compare the ASNs obtained by CASNet and similar other approaches to show that CASNet can often discover more meaningful and stable regulatory ASNs. Our analysis of a breast cancer dataset finds that the positive feedback loops across 7 genes, AR, ESR1, MYC, E2F2, PGR, BCL2 and CCND1 are conserved across the basal/triple negative subtypes in multiple datasets that could potentially explain the aggressive nature of this cancer subtype. Furthermore, comparison of the basal subtype of breast cancer and the mesenchymal subtype of glioblastoma ASNs shows that an ASN in the vicinity of IL6 is conserved across the two subtypes. This result suggests that subtypes of different cancers can show molecular similarities indicating that the therapeutic approaches in different types of cancers may be shared.
当比较遗传疾病细胞与非疾病细胞时,基因表达谱会显示出显著变化。生物网络通常用于从表达谱中识别疾病的活跃子网络(ASN),以了解观察到的变化背后的原因。目前用于发现 ASN 的方法大多使用无向 PPI 网络和基于节点的方法。当使用具有比传统蛋白质-蛋白质相互作用网络更全面信息的综合网络时,这可能会限制它们发现有意义的 ASN 的能力。使用适当的评分函数来评估基因及其相互作用可能会发现更好的 ASN。在本文中,我们提出了 CASNet,旨在使用(i)综合相互作用网络(混合图)、(ii)基因调控方向和(iii)组合节点和边评分来识别更好的 ASN。我们简化并扩展了以前的方法,以纳入边评估并降低它们对显著阈值的敏感性。我们使用混合整数规划(MIP)来制定我们的目标函数,并表明可以获得最优解。我们将 CASNet 获得的 ASN 与其他类似方法进行比较,结果表明 CASNet 通常可以发现更有意义和稳定的调控 ASN。我们对乳腺癌数据集的分析发现,在多个数据集的基底/三阴性亚型中,跨越 7 个基因(AR、ESR1、MYC、E2F2、PGR、BCL2 和 CCND1)的正反馈回路是保守的,这可能解释了这种癌症亚型的侵袭性本质。此外,对乳腺癌的基底亚型和胶质母细胞瘤的间充质亚型的 ASN 进行比较,结果表明在两个亚型中,IL6 附近的一个 ASN 是保守的。这一结果表明,不同癌症的亚型可以表现出分子相似性,这表明不同类型癌症的治疗方法可能是共享的。