Suppr超能文献

整合多组学数据进行建模以识别癌症驱动因素并推断患者特异性基因活性。

Integrative modeling of multi-omics data to identify cancer drivers and infer patient-specific gene activity.

作者信息

Pavel Ana B, Sonkin Dmitriy, Reddy Anupama

机构信息

Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, 02215, MA, USA.

Section of Computational Biomedicine, Boston University School of Medicine, 72 East Concord Street, Boston, 02118, MA, USA.

出版信息

BMC Syst Biol. 2016 Feb 11;10:16. doi: 10.1186/s12918-016-0260-9.

Abstract

BACKGROUND

High throughput technologies have been used to profile genes in multiple different dimensions, such as genetic variation, copy number, gene and protein expression, epigenetics, metabolomics. Computational analyses often treat these different data types as independent, leading to an explosion in the number of features making studies under-powered and more importantly do not provide a comprehensive view of the gene's state. We sought to infer gene activity by integrating different dimensions using biological knowledge of oncogenes and tumor suppressors.

RESULTS

This paper proposes an integrative model of oncogene and tumor suppressor activity in cells which is used to identify cancer drivers and compute patient-specific gene activity scores. We have developed a Fuzzy Logic Modeling (FLM) framework to incorporate biological knowledge with multi-omics data such as somatic mutation, gene expression and copy number measurements. The advantage of using a fuzzy logic approach is to abstract meaningful biological rules from low-level numerical data. Biological knowledge is often qualitative, thus combining it with quantitative numerical measurements may leverage new biological insights about a gene's state. We show that the oncogenic and altered tumor suppressing state of a gene can be better characterized by integrating different molecular measurements with biological knowledge than by each data type alone. We validate the gene activity score using data from the Cancer Cell Line Encyclopedia and drug sensitivity data for five compounds: BYL719 (PIK3CA inhibitor), PLX4720 (BRAF inhibitor), AZD6244 (MEK inhibitor), Erlotinib (EGFR inhibitor), and Nutlin-3 (MDM2 inhibitor). The integrative score improves prediction of drug sensitivity for the known drug targets of these compounds compared to each data type alone. The gene activity scores are also used to cluster colorectal cancer cell lines. Two subtypes of CRCs were found and potential cancer drivers and therapeutic targets for each of the subtypes were identified.

CONCLUSIONS

We propose a fuzzy logic based approach to infer gene activity in cancer by integrating numerical data with descriptive biological knowledge. We compute general patient-specific gene-level scores useful to determine the oncogenic or tumor suppressor status of cancer gene drivers and to cluster or classify patients.

摘要

背景

高通量技术已被用于从多个不同维度对基因进行分析,如遗传变异、拷贝数、基因和蛋白质表达、表观遗传学、代谢组学。计算分析通常将这些不同的数据类型视为相互独立的,导致特征数量激增,使研究的效能不足,更重要的是无法提供基因状态的全面视图。我们试图通过利用癌基因和肿瘤抑制基因的生物学知识整合不同维度来推断基因活性。

结果

本文提出了一种细胞中癌基因和肿瘤抑制基因活性的整合模型,用于识别癌症驱动基因并计算患者特异性基因活性评分。我们开发了一个模糊逻辑建模(FLM)框架,将生物学知识与多组学数据(如体细胞突变、基因表达和拷贝数测量)相结合。使用模糊逻辑方法的优势在于从低级数值数据中抽象出有意义的生物学规则。生物学知识通常是定性的,因此将其与定量数值测量相结合可能会产生关于基因状态的新生物学见解。我们表明,与单独使用每种数据类型相比,通过将不同的分子测量与生物学知识相结合,可以更好地表征基因的致癌和改变的肿瘤抑制状态。我们使用来自癌症细胞系百科全书的数据和五种化合物的药物敏感性数据验证了基因活性评分:BYL719(PIK3CA抑制剂)、PLX4720(BRAF抑制剂)、AZD6244(MEK抑制剂)、厄洛替尼(EGFR抑制剂)和Nutlin-3(MDM2抑制剂)。与单独使用每种数据类型相比,整合评分提高了对这些化合物已知药物靶点的药物敏感性预测。基因活性评分还用于对结肠癌细胞系进行聚类。发现了两种结直肠癌亚型,并确定了每种亚型的潜在癌症驱动基因和治疗靶点。

结论

我们提出了一种基于模糊逻辑的方法,通过将数值数据与描述性生物学知识相结合来推断癌症中的基因活性。我们计算了通用的患者特异性基因水平评分,有助于确定癌症基因驱动基因的致癌或肿瘤抑制状态,并对患者进行聚类或分类。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验