癌症突变状态的组学特征中广泛存在冗余。

Widespread redundancy in -omics profiles of cancer mutation states.

机构信息

Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

Department of Epidemiology, Geisel School of Medicine, Dartmouth College, Lebanon, NH, USA.

出版信息

Genome Biol. 2022 Jun 27;23(1):137. doi: 10.1186/s13059-022-02705-y.

DOI:10.1186/s13059-022-02705-y

PMID:35761387

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9238138/

Abstract

BACKGROUND

In studies of cellular function in cancer, researchers are increasingly able to choose from many -omics assays as functional readouts. Choosing the correct readout for a given study can be difficult, and which layer of cellular function is most suitable to capture the relevant signal remains unclear.

RESULTS

We consider prediction of cancer mutation status (presence or absence) from functional -omics data as a representative problem that presents an opportunity to quantify and compare the ability of different -omics readouts to capture signals of dysregulation in cancer. From the TCGA Pan-Cancer Atlas that contains genetic alteration data, we focus on RNA sequencing, DNA methylation arrays, reverse phase protein arrays (RPPA), microRNA, and somatic mutational signatures as -omics readouts. Across a collection of genes recurrently mutated in cancer, RNA sequencing tends to be the most effective predictor of mutation state. We find that one or more other data types for many of the genes are approximately equally effective predictors. Performance is more variable between mutations than that between data types for the same mutation, and there is little difference between the top data types. We also find that combining data types into a single multi-omics model provides little or no improvement in predictive ability over the best individual data type.

CONCLUSIONS

Based on our results, for the design of studies focused on the functional outcomes of cancer mutations, there are often multiple -omics types that can serve as effective readouts, although gene expression seems to be a reasonable default option.

摘要

背景

在癌症的细胞功能研究中，研究人员越来越能够从众多的组学检测中选择作为功能读数。为特定的研究选择正确的读数可能很困难，并且细胞功能的哪一层最适合捕捉相关的失调信号仍不清楚。

结果

我们将从功能组学数据中预测癌症突变状态（存在或不存在）视为一个代表性问题，该问题提供了一个机会来量化和比较不同组学读数捕捉癌症失调信号的能力。从包含遗传改变数据的 TCGA 泛癌图谱中，我们专注于 RNA 测序、DNA 甲基化阵列、反相蛋白阵列 (RPPA)、microRNA 和体细胞突变特征作为组学读数。在一组在癌症中经常发生突变的基因中，RNA 测序往往是突变状态的最有效预测因子。我们发现，对于许多基因，一种或多种其他类型的数据在预测突变状态方面的效果大致相同。在相同的突变之间，性能在突变之间比在数据类型之间更具可变性，并且顶级数据类型之间几乎没有差异。我们还发现，将数据类型组合到单个多组学模型中，对预测能力的提高几乎没有或没有，而不是最佳的单个数据类型。

结论

根据我们的结果，对于专注于癌症突变功能结果的研究设计，通常有多种组学类型可以作为有效的读数，尽管基因表达似乎是一个合理的默认选项。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a536/9238138/ecf5cbb392a0/13059_2022_2705_Fig1_HTML.jpg

相似文献

Widespread redundancy in -omics profiles of cancer mutation states.癌症突变状态的组学特征中广泛存在冗余。

Genome Biol. 2022 Jun 27;23(1):137. doi: 10.1186/s13059-022-02705-y.

Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction.泛癌种评估基因表达和体细胞改变数据以预测癌症预后。

BMC Cancer. 2021 Sep 25;21(1):1053. doi: 10.1186/s12885-021-08796-3.

Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference.通过途径活性推断，对 RPPA 蛋白质组学数据与多组学数据进行拓扑整合，以进行乳腺癌的生存预测。

BMC Med Genomics. 2019 Jul 11;12(Suppl 5):94. doi: 10.1186/s12920-019-0511-x.

Prediction of survival and recurrence in patients with pancreatic cancer by integrating multi-omics data.通过整合多组学数据预测胰腺癌患者的生存和复发情况。

Sci Rep. 2020 Nov 3;10(1):18951. doi: 10.1038/s41598-020-76025-1.

Multi-omics characterization and validation of MSI-related molecular features across multiple malignancies.多组学特征分析和验证多种恶性肿瘤中与 MSI 相关的分子特征。

Life Sci. 2021 Apr 1;270:119081. doi: 10.1016/j.lfs.2021.119081. Epub 2021 Jan 28.

A pan-cancer analysis of driver gene mutations, DNA methylation and gene expressions reveals that chromatin remodeling is a major mechanism inducing global changes in cancer epigenomes.泛癌症分析驱动基因突变、DNA 甲基化和基因表达，揭示了染色质重塑是诱导癌症表观基因组全局变化的主要机制。

BMC Med Genomics. 2018 Nov 6;11(1):98. doi: 10.1186/s12920-018-0425-z.

The Cancer Omics Atlas: an integrative resource for cancer omics annotations.癌症组学图谱：癌症组学注释的综合资源

BMC Med Genomics. 2018 Aug 8;11(1):63. doi: 10.1186/s12920-018-0381-7.

A multi-omics supervised autoencoder for pan-cancer clinical outcome endpoints prediction.一种用于泛癌临床结局终点预测的多组学监督自动编码器。

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):129. doi: 10.1186/s12911-020-1114-3.

Multi-omics characterization and validation of invasiveness-related molecular features across multiple cancer types.多种癌症类型中侵袭性相关分子特征的多组学表征与验证

J Transl Med. 2021 Mar 25;19(1):124. doi: 10.1186/s12967-021-02773-x.

TCGAplot: an R package for integrative pan-cancer analysis and visualization of TCGA multi-omics data.TCGAplot：一个用于 TCGA 多组学数据综合癌症分析和可视化的 R 包。

BMC Bioinformatics. 2023 Dec 17;24(1):483. doi: 10.1186/s12859-023-05615-3.

引用本文的文献

CIBRA identifies genomic alterations with a system-wide impact on tumor biology.CIBRA 鉴定出对肿瘤生物学具有系统影响的基因组改变。

Bioinformatics. 2024 Sep 1;40(Suppl 2):ii37-ii44. doi: 10.1093/bioinformatics/btae384.

Optimizer's dilemma: optimization strongly influences model selection in transcriptomic prediction.优化器的困境：在转录组预测中，优化对模型选择有强烈影响。

Bioinform Adv. 2024 Jan 24;4(1):vbae004. doi: 10.1093/bioadv/vbae004. eCollection 2024.

Identification of SPP1 as a Prognostic Biomarker and Immune Cells Modulator in Urothelial Bladder Cancer: A Bioinformatics Analysis.鉴定SPP1作为尿路上皮膀胱癌的预后生物标志物和免疫细胞调节剂：一项生物信息学分析

Cancers (Basel). 2023 Dec 4;15(23):5704. doi: 10.3390/cancers15235704.

Cross-platform normalization enables machine learning model training on microarray and RNA-seq data simultaneously.跨平台归一化可实现微阵列和 RNA-seq 数据上的机器学习模型训练。

Commun Biol. 2023 Feb 25;6(1):222. doi: 10.1038/s42003-023-04588-6.

wenda_gpu: fast domain adaptation for genomic data.用于基因组数据的快速领域自适应。

Bioinformatics. 2022 Nov 15;38(22):5129-5130. doi: 10.1093/bioinformatics/btac663.

本文引用的文献

Using biological constraints to improve prediction in precision oncology.利用生物学限制因素提高精准肿瘤学中的预测能力。

iScience. 2023 Feb 2;26(3):106108. doi: 10.1016/j.isci.2023.106108. eCollection 2023 Mar 17.

Genome-wide identification and analysis of prognostic features in human cancers.全基因组鉴定和分析人类癌症的预后特征。

Cell Rep. 2022 Mar 29;38(13):110569. doi: 10.1016/j.celrep.2022.110569.

MethylSPWNet and MethylCapsNet: Biologically Motivated Organization of DNAm Neural Networks, Inspired by Capsule Networks.甲基化SPW网络和甲基化胶囊网络：受胶囊网络启发的DNA甲基化神经网络的生物学驱动组织

NPJ Syst Biol Appl. 2021 Aug 20;7(1):33. doi: 10.1038/s41540-021-00193-7.

Tejaas: reverse regression increases power for detecting trans-eQTLs.Tejaas：反向回归增加了检测跨 eQTL 的功效。

Genome Biol. 2021 May 6;22(1):142. doi: 10.1186/s13059-021-02361-8.

Systematic interrogation of mutation groupings reveals divergent downstream expression programs within key cancer genes.系统探究突变分组揭示了关键癌症基因内下游表达程序的差异。

BMC Bioinformatics. 2021 May 6;22(1):233. doi: 10.1186/s12859-021-04147-y.

Accurate cancer phenotype prediction with AKLIMATE, a stacked kernel learner integrating multimodal genomic data and pathway knowledge.利用 Aklimate 进行准确的癌症表型预测，Aklimate 是一种集成多模态基因组数据和通路知识的堆叠核学习器。

PLoS Comput Biol. 2021 Apr 16;17(4):e1008878. doi: 10.1371/journal.pcbi.1008878. eCollection 2021 Apr.

Recruitment of KMT2C/MLL3 to DNA Damage Sites Mediates DNA Damage Responses and Regulates PARP Inhibitor Sensitivity in Cancer.KMT2C/MLL3 招募到 DNA 损伤部位介导 DNA 损伤反应，并调节癌症中 PARP 抑制剂的敏感性。

Cancer Res. 2021 Jun 15;81(12):3358-3373. doi: 10.1158/0008-5472.CAN-21-0688. Epub 2021 Apr 14.

Prediction of PIK3CA mutations from cancer gene expression data.从癌症基因表达数据预测 PIK3CA 突变。

PLoS One. 2020 Nov 9;15(11):e0241514. doi: 10.1371/journal.pone.0241514. eCollection 2020.

Identification of pan-cancer Ras pathway activation with deep learning.深度学习鉴定泛癌症 Ras 信号通路激活情况

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa258.

Predicting Drug Response and Synergy Using a Deep Learning Model of Human Cancer Cells.利用人类癌细胞深度学习模型预测药物反应和协同作用。

Cancer Cell. 2020 Nov 9;38(5):672-684.e6. doi: 10.1016/j.ccell.2020.09.014. Epub 2020 Oct 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

癌症突变状态的组学特征中广泛存在冗余。

Widespread redundancy in -omics profiles of cancer mutation states.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献