Kim Dokyoon, Li Ruowang, Lucas Anastasia, Verma Shefali S, Dudek Scott M, Ritchie Marylyn D
Biomedical and Translational Informatics, Geisinger Health System, Danville, Pennsylvania, USA.
Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA.
J Am Med Inform Assoc. 2017 May 1;24(3):577-587. doi: 10.1093/jamia/ocw165.
It is common that cancer patients have different molecular signatures even though they have similar clinical features, such as histology, due to the heterogeneity of tumors. To overcome this variability, we previously developed a new approach incorporating prior biological knowledge that identifies knowledge-driven genomic interactions associated with outcomes of interest. However, no systematic approach has been proposed to identify interaction models between pathways based on multi-omics data. Here we have proposed such a novel methodological framework, called metadimensional knowledge-driven genomic interactions (MKGIs). To test the utility of the proposed framework, we applied it to an ovarian cancer dataset including multi-omics profiles from The Cancer Genome Atlas to predict grade, stage, and survival outcome. We found that each knowledge-driven genomic interaction model, based on different genomic datasets, contains different sets of pathway features, which suggests that each genomic data type may contribute to outcomes in ovarian cancer via a different pathway. In addition, MKGI models significantly outperformed the single knowledge-driven genomic interaction model. From the MKGI models, many interactions between pathways associated with outcomes were found, including the mitogen-activated protein kinase (MAPK) signaling pathway and the gonadotropin-releasing hormone (GnRH) signaling pathway, which are known to play important roles in cancer pathogenesis. The beauty of incorporating biological knowledge into the model based on multi-omics data is the ability to improve diagnosis and prognosis and provide better interpretability. Thus, determining variability in molecular signatures based on these interactions between pathways may lead to better diagnostic/treatment strategies for better precision medicine.
由于肿瘤的异质性,癌症患者即使具有相似的临床特征(如组织学特征),也会有不同的分子特征。为了克服这种变异性,我们之前开发了一种新方法,该方法纳入了先验生物学知识,可识别与感兴趣的结果相关的知识驱动型基因组相互作用。然而,尚未有人提出基于多组学数据来识别通路间相互作用模型的系统方法。在此,我们提出了这样一种新颖的方法框架,称为元维度知识驱动型基因组相互作用(MKGIs)。为了测试所提出框架的效用,我们将其应用于一个卵巢癌数据集,该数据集包含来自癌症基因组图谱的多组学概况,以预测分级、分期和生存结果。我们发现,基于不同基因组数据集的每个知识驱动型基因组相互作用模型都包含不同的通路特征集,这表明每种基因组数据类型可能通过不同的通路对卵巢癌的结果产生影响。此外,MKGI模型显著优于单一的知识驱动型基因组相互作用模型。从MKGI模型中,发现了许多与结果相关的通路之间的相互作用,包括丝裂原活化蛋白激酶(MAPK)信号通路和促性腺激素释放激素(GnRH)信号通路,已知它们在癌症发病机制中起重要作用。将生物学知识纳入基于多组学数据的模型的美妙之处在于能够改善诊断和预后,并提供更好的可解释性。因此,基于这些通路间相互作用来确定分子特征的变异性可能会带来更好的诊断/治疗策略,以实现更好的精准医学。