• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从泛癌全转录组数据中学习个体生存模型。

Learning Individual Survival Models from PanCancer Whole Transcriptome Data.

机构信息

Alberta Machine Intelligence Institute, Edmonton, Alberta, Canada.

Department of Surgery, University of Alberta, Edmonton, Alberta, Canada.

出版信息

Clin Cancer Res. 2023 Oct 2;29(19):3924-3936. doi: 10.1158/1078-0432.CCR-22-3493.

DOI:10.1158/1078-0432.CCR-22-3493
PMID:37463063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10543961/
Abstract

PURPOSE

Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with cancer.

EXPERIMENTAL DESIGN

This article provides a method that learns a model for predicting the survival time for individual patients with cancer from the PanCancer Atlas: given the (16,335 dimensional) gene expression profiles from 10,173 patients, each having one of 33 cancers, this method uses unsupervised nonnegative matrix factorization (NMF) to reexpress the gene expression data for each patient in terms of 100 learned NMF factors. It then feeds these 100 factors into the Multi-Task Logistic Regression (MTLR) learner to produce cancer-specific models for each of 20 cancers (with >50 uncensored instances); this produces "individual survival distributions" (ISD), which provide survival probabilities at each future time for each individual patient, which provides a patient's risk score and estimated survival time.

RESULTS

Our NMF-MTLR concordance indices outperformed the VAECox benchmark by 14.9% overall. We achieved optimal survival prediction using pan-cancer NMF in combination with cancer-specific MTLR models. We provide biological interpretation of the NMF model and clinical implications of ISDs for prognosis and therapeutic response prediction.

CONCLUSIONS

NMF-MTLR provides many benefits over other models: superior model discrimination, superior calibration, meaningful survival time estimates, and accurate probabilistic estimates of survival over time for each individual patient. We advocate for the adoption of these cancer survival models in clinical and research settings.

摘要

目的

个性化医学试图根据患者的个体肿瘤分子特征预测每个患者的生存时间。我们研究了我们的生存学习器与降维方法相结合,是否可以为各种癌症患者产生有用的生存估计。

实验设计

本文提供了一种方法,用于从 PanCancer Atlas 中学习预测个体癌症患者生存时间的模型:给定来自 10173 名患者的(16335 维)基因表达谱,每个患者有 33 种癌症中的一种,该方法使用无监督非负矩阵分解(NMF)将每个患者的基因表达数据重新表示为 100 个学习到的 NMF 因子。然后,它将这 100 个因子输入到多任务逻辑回归(MTLR)学习器中,为 20 种癌症(每个癌症都有>50 个未删失的实例)中的每一种癌症生成癌症特异性模型;这会产生“个体生存分布”(ISD),为每个个体患者提供每个未来时间的生存概率,这为患者提供了风险评分和估计的生存时间。

结果

我们的 NMF-MTLR 一致性指数总体上优于 VAECox 基准 14.9%。我们通过使用泛癌 NMF 与癌症特异性 MTLR 模型相结合,实现了最佳的生存预测。我们提供了 NMF 模型的生物学解释和 ISD 对预后和治疗反应预测的临床意义。

结论

与其他模型相比,NMF-MTLR 具有许多优势:优越的模型区分度、优越的校准、有意义的生存时间估计以及每个个体患者随时间变化的准确概率生存估计。我们主张在临床和研究环境中采用这些癌症生存模型。

相似文献

1
Learning Individual Survival Models from PanCancer Whole Transcriptome Data.从泛癌全转录组数据中学习个体生存模型。
Clin Cancer Res. 2023 Oct 2;29(19):3924-3936. doi: 10.1158/1078-0432.CCR-22-3493.
2
Individual Survival Distributions Generated by Multi-Task Logistic Regression Yield a New Perspective on Molecular and Clinical Prognostic Factors in Gastric Adenocarcinoma.多任务逻辑回归生成的个体生存分布为胃腺癌分子和临床预后因素提供了新视角。
Cancers (Basel). 2024 Feb 15;16(4):786. doi: 10.3390/cancers16040786.
3
Attention-Based Multi-NMF Deep Neural Network with Multimodality Data for Breast Cancer Prognosis Model.基于注意力的多模态数据的多 NMF 深度神经网络用于乳腺癌预后模型。
Biomed Res Int. 2019 May 13;2019:9523719. doi: 10.1155/2019/9523719. eCollection 2019.
4
A flexible R package for nonnegative matrix factorization.一个用于非负矩阵分解的灵活 R 包。
BMC Bioinformatics. 2010 Jul 2;11:367. doi: 10.1186/1471-2105-11-367.
5
Gene expression based survival prediction for cancer patients-A topic modeling approach.基于基因表达的癌症患者生存预测-一种主题建模方法。
PLoS One. 2019 Nov 15;14(11):e0224446. doi: 10.1371/journal.pone.0224446. eCollection 2019.
6
Learning Microbial Community Structures with Supervised and Unsupervised Non-negative Matrix Factorization.基于监督和无监督非负矩阵分解的微生物群落结构学习。
Microbiome. 2017 Aug 31;5(1):110. doi: 10.1186/s40168-017-0323-1.
7
Metastasis-related gene identification by compound constrained NMF and a semisupervised cluster approach using pancancer multiomics features.基于泛癌多组学特征的复合约束 NMF 和半监督聚类方法识别转移相关基因。
Comput Biol Med. 2022 Dec;151(Pt A):106263. doi: 10.1016/j.compbiomed.2022.106263. Epub 2022 Nov 9.
8
Exploring matrix factorization techniques for significant genes identification of Alzheimer's disease microarray gene expression data.探索矩阵分解技术在阿尔茨海默病基因表达数据中显著基因识别中的应用。
BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2105-12-S5-S7. Epub 2011 Jul 27.
9
Assessment of nonnegative matrix factorization algorithms for electroencephalography spectral analysis.评估用于脑电图频谱分析的非负矩阵分解算法。
Biomed Eng Online. 2020 Jul 31;19(1):61. doi: 10.1186/s12938-020-00796-x.
10
Impact of the Choice of Normalization Method on Molecular Cancer Class Discovery Using Nonnegative Matrix Factorization.归一化方法的选择对使用非负矩阵分解进行分子癌症类别发现的影响
PLoS One. 2016 Oct 14;11(10):e0164880. doi: 10.1371/journal.pone.0164880. eCollection 2016.

引用本文的文献

1
Individual Survival Distributions Generated by Multi-Task Logistic Regression Yield a New Perspective on Molecular and Clinical Prognostic Factors in Gastric Adenocarcinoma.多任务逻辑回归生成的个体生存分布为胃腺癌分子和临床预后因素提供了新视角。
Cancers (Basel). 2024 Feb 15;16(4):786. doi: 10.3390/cancers16040786.

本文引用的文献

1
Pan-cancer analysis of advanced patient tumors reveals interactions between therapy and genomic landscapes.晚期患者肿瘤的泛癌分析揭示了治疗与基因组格局之间的相互作用。
Nat Cancer. 2020 Apr;1(4):452-468. doi: 10.1038/s43018-020-0050-6. Epub 2020 Apr 13.
2
clusterProfiler 4.0: A universal enrichment tool for interpreting omics data.clusterProfiler 4.0:用于解释组学数据的通用富集工具。
Innovation (Camb). 2021 Jul 1;2(3):100141. doi: 10.1016/j.xinn.2021.100141. eCollection 2021 Aug 28.
3
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.
《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
4
Epigenetic regulation in the pathogenesis of non-melanoma skin cancer.表观遗传调控在非黑色素瘤皮肤癌发病机制中的作用。
Semin Cancer Biol. 2022 Aug;83:36-56. doi: 10.1016/j.semcancer.2020.11.009. Epub 2020 Nov 23.
5
Improved survival analysis by learning shared genomic information from pan-cancer data.从泛癌数据中学习共享基因组信息以改善生存分析。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i389-i398. doi: 10.1093/bioinformatics/btaa462.
6
High-Throughput Transcriptome Profiling in Drug and Biomarker Discovery.药物与生物标志物发现中的高通量转录组分析
Front Genet. 2020 Feb 5;11:19. doi: 10.3389/fgene.2020.00019. eCollection 2020.
7
Deciphering drug resistance in Mycobacterium tuberculosis using whole-genome sequencing: progress, promise, and challenges.利用全基因组测序破译结核分枝杆菌的耐药性:进展、前景和挑战。
Genome Med. 2019 Jul 25;11(1):45. doi: 10.1186/s13073-019-0660-8.
8
Predicting in-hospital mortality of patients with acute kidney injury in the ICU using random forest model.应用随机森林模型预测 ICU 中急性肾损伤患者的院内死亡率。
Int J Med Inform. 2019 May;125:55-61. doi: 10.1016/j.ijmedinf.2019.02.002. Epub 2019 Feb 12.
9
Molecular subtypes of pancreatic cancer.胰腺癌的分子亚型。
Nat Rev Gastroenterol Hepatol. 2019 Apr;16(4):207-220. doi: 10.1038/s41575-019-0109-y.
10
Maftools: efficient and comprehensive analysis of somatic variants in cancer.Maftools:癌症体细胞变异的高效全面分析。
Genome Res. 2018 Nov;28(11):1747-1756. doi: 10.1101/gr.239244.118. Epub 2018 Oct 19.