• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合多组学分析血液样本可改善模型预测:在癌症中的应用。

Integrative, multi-omics, analysis of blood samples improves model predictions: applications to cancer.

机构信息

Oslo Center for Biostatistics and Epidemiology, UiO, University of Oslo, Oslo, Norway.

Department of Community Medicine, UiT, The Arctic University of Norway, Tromsö, Norway.

出版信息

BMC Bioinformatics. 2021 Aug 5;22(1):395. doi: 10.1186/s12859-021-04296-0.

DOI:10.1186/s12859-021-04296-0
PMID:34353282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8340537/
Abstract

BACKGROUND

Cancer genomic studies often include data collected from several omics platforms. Each omics data source contributes to the understanding of the underlying biological process via source specific ("individual") patterns of variability. At the same time, statistical associations and potential interactions among the different data sources can reveal signals from common biological processes that might not be identified by single source analyses. These common patterns of variability are referred to as "shared" or "joint". In this work, we show how the use of joint and individual components can lead to better predictive models, and to a deeper understanding of the biological process at hand. We identify joint and individual contributions of DNA methylation, miRNA and mRNA expression collected from blood samples in a lung cancer case-control study nested within the Norwegian Women and Cancer (NOWAC) cohort study, and we use such components to build prediction models for case-control and metastatic status. To assess the quality of predictions, we compare models based on simultaneous, integrative analysis of multi-source omics data to a standard non-integrative analysis of each single omics dataset, and to penalized regression models. Additionally, we apply the proposed approach to a breast cancer dataset from The Cancer Genome Atlas.

RESULTS

Our results show how an integrative analysis that preserves both components of variation is more appropriate than standard multi-omics analyses that are not based on such a distinction. Both joint and individual components are shown to contribute to a better quality of model predictions, and facilitate the interpretation of the underlying biological processes in lung cancer development.

CONCLUSIONS

In the presence of multiple omics data sources, we recommend the use of data integration techniques that preserve the joint and individual components across the omics sources. We show how the inclusion of such components increases the quality of model predictions of clinical outcomes.

摘要

背景

癌症基因组研究通常包括从多个组学平台收集的数据。每个组学数据源通过特定于源的(“个体”)可变性模式为理解潜在的生物学过程做出贡献。同时,不同数据源之间的统计关联和潜在相互作用可以揭示可能无法通过单源分析识别的常见生物学过程的信号。这些共同的可变性模式被称为“共享”或“联合”。在这项工作中,我们展示了如何使用联合和个体成分来构建更好的预测模型,并深入了解手头的生物学过程。我们确定了在嵌套于挪威妇女与癌症(NOWAC)队列研究中的肺癌病例对照研究中从血液样本中收集的 DNA 甲基化、miRNA 和 mRNA 表达的联合和个体贡献,并使用这些成分构建病例对照和转移状态的预测模型。为了评估预测的质量,我们将基于多组学数据的同时、综合分析的模型与每个单组学数据集的非综合分析以及惩罚回归模型进行比较。此外,我们将提出的方法应用于来自癌症基因组图谱的乳腺癌数据集。

结果

我们的结果表明,与不基于这种区别的标准多组学分析相比,保留变异的两个成分的综合分析更合适。联合和个体成分都被证明有助于提高模型预测的质量,并促进对肺癌发展中潜在生物学过程的解释。

结论

在存在多个组学数据源的情况下,我们建议使用保留组学源之间联合和个体成分的数据集成技术。我们展示了包含这些成分如何提高临床结果模型预测的质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/763056dd8964/12859_2021_4296_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/d0047831ac53/12859_2021_4296_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/e882e84c0f41/12859_2021_4296_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/ce54815b5a4f/12859_2021_4296_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/4bec2bd3422f/12859_2021_4296_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/763056dd8964/12859_2021_4296_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/d0047831ac53/12859_2021_4296_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/e882e84c0f41/12859_2021_4296_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/ce54815b5a4f/12859_2021_4296_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/4bec2bd3422f/12859_2021_4296_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c56/8340537/763056dd8964/12859_2021_4296_Fig5_HTML.jpg

相似文献

1
Integrative, multi-omics, analysis of blood samples improves model predictions: applications to cancer.整合多组学分析血液样本可改善模型预测:在癌症中的应用。
BMC Bioinformatics. 2021 Aug 5;22(1):395. doi: 10.1186/s12859-021-04296-0.
2
Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.基于深度学习的多组学生物标志物数据特征层融合在乳腺癌患者生存分析中的应用。
BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.
3
HCNM: Heterogeneous Correlation Network Model for Multi-level Integrative Study of Multi-omics Data for Cancer Subtype Prediction.HCNM:用于癌症亚型预测的多组学数据多层次综合研究的异质相关网络模型。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:1880-1886. doi: 10.1109/EMBC46164.2021.9630781.
4
Improving prediction performance of colon cancer prognosis based on the integration of clinical and multi-omics data.基于临床和多组学数据整合提高结肠癌预后预测性能。
BMC Med Inform Decis Mak. 2020 Feb 7;20(1):22. doi: 10.1186/s12911-020-1043-1.
5
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
6
Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference.通过途径活性推断,对 RPPA 蛋白质组学数据与多组学数据进行拓扑整合,以进行乳腺癌的生存预测。
BMC Med Genomics. 2019 Jul 11;12(Suppl 5):94. doi: 10.1186/s12920-019-0511-x.
7
Knowledge boosting: a graph-based integration approach with multi-omics data and genomic knowledge for cancer clinical outcome prediction.知识增强:一种基于图的整合方法,利用多组学数据和基因组知识进行癌症临床结果预测。
J Am Med Inform Assoc. 2015 Jan;22(1):109-20. doi: 10.1136/amiajnl-2013-002481. Epub 2014 Jul 7.
8
Integrative factorization of bidimensionally linked matrices.二维关联矩阵的综合分解。
Biometrics. 2020 Mar;76(1):61-74. doi: 10.1111/biom.13141. Epub 2019 Nov 10.
9
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
10
Sliced inverse regression for integrative multi-omics data analysis.用于整合多组学数据分析的切片逆回归
Stat Appl Genet Mol Biol. 2019 Jan 26;18(1):/j/sagmb.2019.18.issue-1/sagmb-2018-0028/sagmb-2018-0028.xml. doi: 10.1515/sagmb-2018-0028.

引用本文的文献

1
Gene expression prognostic of early relapse risk in low-risk B-cell acute lymphoblastic leukaemia in children.儿童低危B细胞急性淋巴细胞白血病早期复发风险的基因表达预后分析
EJHaem. 2024 Mar 15;5(2):333-345. doi: 10.1002/jha2.872. eCollection 2024 Apr.
2
A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction.一种基于机器学习和深度学习的用于白血病预测的集成多组学技术。
Heliyon. 2024 Feb 1;10(3):e25369. doi: 10.1016/j.heliyon.2024.e25369. eCollection 2024 Feb 15.
3
Early detection of hepatocellular carcinoma via no end-repair enzymatic methylation sequencing of cell-free DNA and pre-trained neural network.

本文引用的文献

1
Transcriptomic signals in blood prior to lung cancer focusing on time to diagnosis and metastasis.肺癌前血液中的转录组信号,重点关注诊断和转移时间。
Sci Rep. 2021 Apr 1;11(1):7406. doi: 10.1038/s41598-021-86879-8.
2
Distributed estimation of principal eigenspaces.主特征空间的分布式估计
Ann Stat. 2019 Dec;47(6):3009-3031. doi: 10.1214/18-AOS1713. Epub 2019 Oct 31.
3
Consistency and overfitting of multi-omics methods on experimental data.多组学方法在实验数据上的一致性和过拟合。
基于无末端修复酶促甲基化测序的循环游离 DNA 和预训练神经网络早期检测肝细胞癌
Genome Med. 2023 Nov 8;15(1):93. doi: 10.1186/s13073-023-01238-8.
4
Exposures to pesticides and risk of cancer: Evaluation of recent epidemiological evidence in humans and paths forward.暴露于农药和癌症风险:对人类近期流行病学证据的评估及未来方向。
Int J Cancer. 2023 Mar 1;152(5):879-912. doi: 10.1002/ijc.34300. Epub 2022 Oct 25.
5
Ultrasound radiomics in personalized breast management: Current status and future prospects.个性化乳腺管理中的超声影像组学:现状与未来展望
Front Oncol. 2022 Aug 17;12:963612. doi: 10.3389/fonc.2022.963612. eCollection 2022.
6
Multi-Omics Integration-Based Prioritisation of Competing Endogenous RNA Regulation Networks in Small Cell Lung Cancer: Molecular Characteristics and Drug Candidates.基于多组学整合的小细胞肺癌竞争性内源RNA调控网络优先级排序:分子特征与候选药物
Front Oncol. 2022 Jul 4;12:904865. doi: 10.3389/fonc.2022.904865. eCollection 2022.
7
Heterogeneous data integration methods for patient similarity networks.用于患者相似网络的异质数据集成方法。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac207.
Brief Bioinform. 2020 Jul 15;21(4):1277-1284. doi: 10.1093/bib/bbz070.
4
Structural learning and integrative decomposition of multi-view data.多视图数据的结构学习与整合分解
Biometrics. 2019 Dec;75(4):1121-1132. doi: 10.1111/biom.13108. Epub 2019 Sep 15.
5
DNA methylation and associated gene expression in blood prior to lung cancer diagnosis in the Norwegian Women and Cancer cohort.肺癌诊断前挪威妇女与癌症队列中血液中的 DNA 甲基化及其相关基因表达。
Sci Rep. 2018 Nov 13;8(1):16714. doi: 10.1038/s41598-018-34334-6.
6
Multi-omic and multi-view clustering algorithms: review and cancer benchmark.多组学和多视角聚类算法:综述和癌症基准测试。
Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889.
7
Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.多组学因子分析——一种用于无监督整合多组学数据集的框架。
Mol Syst Biol. 2018 Jun 20;14(6):e8124. doi: 10.15252/msb.20178124.
8
mixOmics: An R package for 'omics feature selection and multiple data integration.mixOmics:一个用于“组学”特征选择和多数据整合的R包。
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752. doi: 10.1371/journal.pcbi.1005752. eCollection 2017 Nov.
9
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.利用多分子数据源降维预测患者生存率
Cancer Inform. 2017 Jul 11;16:1176935117718517. doi: 10.1177/1176935117718517. eCollection 2017.
10
More Is Better: Recent Progress in Multi-Omics Data Integration Methods.越多越好:多组学数据整合方法的最新进展
Front Genet. 2017 Jun 16;8:84. doi: 10.3389/fgene.2017.00084. eCollection 2017.