• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模型的组织学和基因组学联合嵌入,使用典型相关分析进行乳腺癌生存预测。

Modelling-based joint embedding of histology and genomics using canonical correlation analysis for breast cancer survival prediction.

机构信息

Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, 61801, IL, USA.

IBM Research Almaden, San Jose, 95120, CA, USA.

出版信息

Artif Intell Med. 2024 Mar;149:102787. doi: 10.1016/j.artmed.2024.102787. Epub 2024 Jan 26.

DOI:10.1016/j.artmed.2024.102787
PMID:38462287
Abstract

Traditional approaches to predicting breast cancer patients' survival outcomes were based on clinical subgroups, the PAM50 genes, or the histological tissue's evaluation. With the growth of multi-modality datasets capturing diverse information (such as genomics, histology, radiology and clinical data) about the same cancer, information can be integrated using advanced tools and have improved survival prediction. These methods implicitly exploit the key observation that different modalities originate from the same cancer source and jointly provide a complete picture of the cancer. In this work, we investigate the benefits of explicitly modelling multi-modality data as originating from the same cancer under a probabilistic framework. Specifically, we consider histology and genomics as two modalities originating from the same breast cancer under a probabilistic graphical model (PGM). We construct maximum likelihood estimates of the PGM parameters based on canonical correlation analysis (CCA) and then infer the underlying properties of the cancer patient, such as survival. Equivalently, we construct CCA-based joint embeddings of the two modalities and input them to a learnable predictor. Real-world properties of sparsity and graph-structures are captured in the penalized variants of CCA (pCCA) and are better suited for cancer applications. For generating richer multi-dimensional embeddings with pCCA, we introduce two novel embedding schemes that encourage orthogonality to generate more informative embeddings. The efficacy of our proposed prediction pipeline is first demonstrated via low prediction errors of the hidden variable and the generation of informative embeddings on simulated data. When applied to breast cancer histology and RNA-sequencing expression data from The Cancer Genome Atlas (TCGA), our model can provide survival predictions with average concordance-indices of up to 68.32% along with interpretability. We also illustrate how the pCCA embeddings can be used for survival analysis through Kaplan-Meier curves.

摘要

传统的预测乳腺癌患者生存结果的方法基于临床亚组、PAM50 基因或组织学评估。随着多模态数据集的增长,这些数据集捕获了关于同一癌症的各种信息(如基因组学、组织学、放射学和临床数据),可以使用先进的工具整合这些信息,并提高生存预测的能力。这些方法隐含地利用了一个关键观察结果,即不同的模态源于同一癌症来源,并共同提供了癌症的完整图景。在这项工作中,我们在概率框架下研究了明确地将多模态数据建模为源于同一癌症的好处。具体来说,我们将组织学和基因组学视为在概率图模型(PGM)下源于同一乳腺癌的两种模态。我们基于典型相关分析(CCA)构建 PGM 参数的最大似然估计,然后推断癌症患者的潜在特征,如生存情况。或者,我们构建基于 CCA 的两种模态的联合嵌入,并将其输入可学习的预测器中。稀疏性和图结构的真实世界特性在 CCA 的惩罚变体(pCCA)中得到了捕捉,并且更适合癌症应用。为了生成具有 pCCA 的更丰富的多维嵌入,我们引入了两种新的嵌入方案,鼓励正交性以生成更具信息量的嵌入。我们提出的预测管道的有效性首先通过隐藏变量的低预测误差和在模拟数据上生成的信息丰富的嵌入来证明。当应用于来自癌症基因组图谱(TCGA)的乳腺癌组织学和 RNA 测序表达数据时,我们的模型可以提供生存预测,平均一致性指数高达 68.32%,并具有可解释性。我们还通过 Kaplan-Meier 曲线说明了 pCCA 嵌入如何用于生存分析。

相似文献

1
Modelling-based joint embedding of histology and genomics using canonical correlation analysis for breast cancer survival prediction.基于模型的组织学和基因组学联合嵌入,使用典型相关分析进行乳腺癌生存预测。
Artif Intell Med. 2024 Mar;149:102787. doi: 10.1016/j.artmed.2024.102787. Epub 2024 Jan 26.
2
Quantifying common and distinct information in single-cell multimodal data with Tilted Canonical Correlation Analysis.使用倾斜典范相关分析量化单细胞多模态数据中的常见和独特信息。
Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2303647120. doi: 10.1073/pnas.2303647120. Epub 2023 Jul 31.
3
Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.基于深度学习的多组学生物标志物数据特征层融合在乳腺癌患者生存分析中的应用。
BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.
4
Integrating multi-omics data by learning modality invariant representations for improved prediction of overall survival of cancer.通过学习模态不变表示来整合多组学数据,以提高癌症总体生存预测的准确性。
Methods. 2021 May;189:74-85. doi: 10.1016/j.ymeth.2020.07.008. Epub 2020 Aug 5.
5
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Tightly integrated multiomics-based deep tensor survival model for time-to-event prediction.基于深度张量生存模型的紧密集成多组学方法用于事件时间预测。
Bioinformatics. 2022 Jun 13;38(12):3259-3266. doi: 10.1093/bioinformatics/btac286.
8
Integrating genomic data and pathological images to effectively predict breast cancer clinical outcome.整合基因组数据和病理图像,有效预测乳腺癌临床预后。
Comput Methods Programs Biomed. 2018 Jul;161:45-53. doi: 10.1016/j.cmpb.2018.04.008. Epub 2018 Apr 19.
9
MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN:用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。
Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.
10
Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference.通过途径活性推断,对 RPPA 蛋白质组学数据与多组学数据进行拓扑整合,以进行乳腺癌的生存预测。
BMC Med Genomics. 2019 Jul 11;12(Suppl 5):94. doi: 10.1186/s12920-019-0511-x.

引用本文的文献

1
Multimodal data integration in early-stage breast cancer.早期乳腺癌的多模态数据整合
Breast. 2025 Apr;80:103892. doi: 10.1016/j.breast.2025.103892. Epub 2025 Jan 28.