Department of Computer Science, University of Maryland, College Park, Maryland, United States of America ; Liver Carcinogenesis Section, Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America.
PLoS One. 2013 Nov 4;8(11):e78624. doi: 10.1371/journal.pone.0078624. eCollection 2013.
The high tumor heterogeneity makes it very challenging to identify key tumorigenic pathways as therapeutic targets. The integration of multiple omics data is a promising approach to identify driving regulatory networks in patient subgroups. Here, we propose a novel conceptual framework to discover patterns of miRNA-gene networks, observed frequently up- or down-regulated in a group of patients and to use such networks for patient stratification in hepatocellular carcinoma (HCC). We developed an integrative subgraph mining approach, called iSubgraph, and identified altered regulatory networks frequently observed in HCC patients. The miRNA and gene expression profiles were jointly analyzed in a graph structure. We defined a method to transform microarray data into graph representation that encodes miRNA and gene expression levels and the interactions between them as well. The iSubgraph algorithm was capable to detect cooperative regulation of miRNAs and genes even if it occurred only in some patients. Next, the miRNA-mRNA modules were used in an unsupervised class prediction model to discover HCC subgroups via patient clustering by mixture models. The robustness analysis of the mixture model showed that the class predictions are highly stable. Moreover, the Kaplan-Meier survival analysis revealed that the HCC subgroups identified by the algorithm have different survival characteristics. The pathway analyses of the miRNA-mRNA co-modules identified by the algorithm demonstrate key roles of Myc, E2F1, let-7, TGFB1, TNF and EGFR in HCC subgroups. Thus, our method can integrate various omics data derived from different platforms and with different dynamic scales to better define molecular tumor subtypes. iSubgraph is available as MATLAB code at http://www.cs.umd.edu/~ozdemir/isubgraph/.
肿瘤异质性高,使得确定关键的肿瘤发生途径作为治疗靶点极具挑战性。整合多种组学数据是识别患者亚群中驱动调控网络的一种很有前途的方法。在这里,我们提出了一种新的概念框架,用于发现 miRNA-基因网络模式,这些模式在一组患者中经常观察到上调或下调,并将这些网络用于肝细胞癌 (HCC) 的患者分层。我们开发了一种名为 iSubgraph 的集成子图挖掘方法,并鉴定了 HCC 患者中经常观察到的改变的调控网络。miRNA 和基因表达谱在图结构中进行联合分析。我们定义了一种将微阵列数据转换为图表示的方法,该方法可以对 miRNA 和基因表达水平及其相互作用进行编码。iSubgraph 算法能够检测 miRNA 和基因的协同调控,即使这种调控仅在一些患者中发生。接下来,使用 miRNA-mRNA 模块通过混合模型对患者进行聚类,在无监督分类预测模型中发现 HCC 亚组。混合模型的稳健性分析表明,分类预测具有很高的稳定性。此外,Kaplan-Meier 生存分析表明,算法识别的 HCC 亚组具有不同的生存特征。算法鉴定的 miRNA-mRNA 共模块的通路分析表明,Myc、E2F1、let-7、TGFB1、TNF 和 EGFR 在 HCC 亚组中发挥关键作用。因此,我们的方法可以整合来自不同平台和不同动态尺度的各种组学数据,以更好地定义分子肿瘤亚型。iSubgraph 可作为 MATLAB 代码在 http://www.cs.umd.edu/~ozdemir/isubgraph/ 获得。