• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习对公开可用的基因表达数据库进行重新评估可在乳腺癌中产生最大的预后能力。

Re-evaluation of publicly available gene-expression databases using machine-learning yields a maximum prognostic power in breast cancer.

作者信息

Tschodu Dimitrij, Lippoldt Jürgen, Gottheil Pablo, Wegscheider Anne-Sophie, Käs Josef A, Niendorf Axel

机构信息

Peter Debye Institute for Soft Matter Physics, Leipzig University, 04103, Leipzig, Germany.

Institute for Histology, Cytology and Molecular Diagnostics, MVZ Prof. Dr. med. A. Niendorf Pathologie Hamburg-West GmbH, 22767, Hamburg, Germany.

出版信息

Sci Rep. 2023 Oct 5;13(1):16402. doi: 10.1038/s41598-023-41090-9.

DOI:10.1038/s41598-023-41090-9
PMID:37798300
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10556090/
Abstract

Gene expression signatures refer to patterns of gene activities and are used to classify different types of cancer, determine prognosis, and guide treatment decisions. Advancements in high-throughput technology and machine learning have led to improvements to predict a patient's prognosis for different cancer phenotypes. However, computational methods for analyzing signatures have not been used to evaluate their prognostic power. Contention remains on the utility of gene expression signatures for prognosis. The prevalent approaches include random signatures, expert knowledge, and machine learning to construct an improved signature. We unify these approaches to evaluate their prognostic power. Re-evaluation of publicly available gene-expression data from 8 databases with 9 machine-learning models revealed previously unreported results. Gene-expression signatures are confirmed to be useful in predicting a patient's prognosis. Convergent evidence from [Formula: see text] 10,000 signatures implicates a maximum prognostic power. By calculating the concordance index, which measures how well patients with different prognoses can be discriminated, we show that a signature can correctly discriminate patients' prognoses no more than 80% of the time. Additionally, we show that more than 50% of the potentially available information is still missing at this value. We surmise that an accurate prognosis must incorporate molecular, clinical, histological, and other complementary factors.

摘要

基因表达特征指的是基因活动模式,用于对不同类型的癌症进行分类、确定预后并指导治疗决策。高通量技术和机器学习的进步使得预测患者不同癌症表型的预后有了改进。然而,用于分析特征的计算方法尚未用于评估其预后能力。关于基因表达特征对预后的效用仍存在争议。常见的方法包括随机特征、专家知识和机器学习来构建改进的特征。我们统一这些方法来评估其预后能力。用9种机器学习模型对来自8个数据库的公开可用基因表达数据进行重新评估,揭示了以前未报告的结果。基因表达特征被证实在预测患者预后方面是有用的。来自超过10000个特征的汇聚证据表明具有最大预后能力。通过计算一致性指数(该指数衡量不同预后患者的区分程度),我们表明一个特征正确区分患者预后的时间不超过80%。此外,我们表明在这个值时仍有超过50%的潜在可用信息缺失。我们推测准确的预后必须纳入分子、临床、组织学和其他补充因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/9dff06cbf07a/41598_2023_41090_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/934510ecfaea/41598_2023_41090_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/624c580137cd/41598_2023_41090_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/36d29d1654b4/41598_2023_41090_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/c45fd7097bac/41598_2023_41090_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/74081e71e871/41598_2023_41090_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/91af2d22afd5/41598_2023_41090_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/9dff06cbf07a/41598_2023_41090_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/934510ecfaea/41598_2023_41090_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/624c580137cd/41598_2023_41090_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/36d29d1654b4/41598_2023_41090_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/c45fd7097bac/41598_2023_41090_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/74081e71e871/41598_2023_41090_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/91af2d22afd5/41598_2023_41090_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1cbd/10556090/9dff06cbf07a/41598_2023_41090_Fig7_HTML.jpg

相似文献

1
Re-evaluation of publicly available gene-expression databases using machine-learning yields a maximum prognostic power in breast cancer.使用机器学习对公开可用的基因表达数据库进行重新评估可在乳腺癌中产生最大的预后能力。
Sci Rep. 2023 Oct 5;13(1):16402. doi: 10.1038/s41598-023-41090-9.
2
Evaluation of public cancer datasets and signatures identifies TP53 mutant signatures with robust prognostic and predictive value.对公共癌症数据集和特征的评估确定了具有强大预后和预测价值的TP53突变特征。
BMC Cancer. 2015 Mar 26;15:179. doi: 10.1186/s12885-015-1102-7.
3
Identification and transfer of spatial transcriptomics signatures for cancer diagnosis.用于癌症诊断的空间转录组学特征的识别和转移。
Breast Cancer Res. 2020 Jan 13;22(1):6. doi: 10.1186/s13058-019-1242-9.
4
Comprehensive Evaluation of Machine Learning Models and Gene Expression Signatures for Prostate Cancer Prognosis Using Large Population Cohorts.使用大样本人群队列对用于前列腺癌预后的机器学习模型和基因表达特征进行综合评估
Cancer Res. 2022 May 3;82(9):1832-1843. doi: 10.1158/0008-5472.CAN-21-3074.
5
BreastMark: an integrated approach to mining publicly available transcriptomic datasets relating to breast cancer outcome.BreastMark:一种挖掘与乳腺癌预后相关的公开转录组数据集的综合方法。
Breast Cancer Res. 2013;15(4):R52. doi: 10.1186/bcr3444.
6
Protein interaction network underpins concordant prognosis among heterogeneous breast cancer signatures.蛋白质相互作用网络为异质乳腺癌特征之间一致的预后提供了基础。
J Biomed Inform. 2010 Jun;43(3):385-96. doi: 10.1016/j.jbi.2010.03.009. Epub 2010 Mar 27.
7
The prognostic potential of alternative transcript isoforms across human tumors.人类肿瘤中可变转录本异构体的预后潜力
Genome Med. 2016 Aug 17;8(1):85. doi: 10.1186/s13073-016-0339-3.
8
Cancer Grade Model: a multi-gene machine learning-based risk classification for improving prognosis in breast cancer.癌症分级模型:一种基于多基因机器学习的风险分类方法,可改善乳腺癌患者的预后。
Br J Cancer. 2021 Aug;125(5):748-758. doi: 10.1038/s41416-021-01455-1. Epub 2021 Jun 15.
9
Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures.乳腺癌基因表达谱的荟萃分析:旨在对乳腺癌亚型和预后特征达成统一认识。
Breast Cancer Res. 2008;10(4):R65. doi: 10.1186/bcr2124. Epub 2008 Jul 28.
10
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

引用本文的文献

1
Towards precision therapy in HER2-positive early-stage breast cancer.迈向HER2阳性早期乳腺癌的精准治疗
Breast. 2025 Jun;81:104461. doi: 10.1016/j.breast.2025.104461. Epub 2025 Mar 24.
2
Multimodal artificial intelligence-based pathogenomics improves survival prediction in oral squamous cell carcinoma.基于多模态人工智能的病原体组学提高了口腔鳞状细胞癌的生存预测。
Sci Rep. 2024 Mar 7;14(1):5687. doi: 10.1038/s41598-024-56172-5.

本文引用的文献

1
Identification of a gene expression signature associated with breast cancer survival and risk that improves clinical genomic platforms.鉴定与乳腺癌生存和风险相关的基因表达特征,该特征可改善临床基因组平台。
Bioinform Adv. 2023 Mar 22;3(1):vbad037. doi: 10.1093/bioadv/vbad037. eCollection 2023.
2
Comparative analysis of molecular signatures reveals a hybrid approach in breast cancer: Combining the Nottingham Prognostic Index with gene expressions into a hybrid signature.分子特征的比较分析揭示了乳腺癌的一种混合方法:将诺丁汉预后指数与基因表达结合成一种混合特征。
PLoS One. 2022 Feb 10;17(2):e0261035. doi: 10.1371/journal.pone.0261035. eCollection 2022.
3
Integration of deep learning-based image analysis and genomic data in cancer pathology: A systematic review.
深度学习图像分析与癌症病理学中基因组数据的整合:系统综述。
Eur J Cancer. 2022 Jan;160:80-91. doi: 10.1016/j.ejca.2021.10.007. Epub 2021 Nov 19.
4
Altona Prognostic Index: A New Prognostic Index for ER-Positive and Her2-Negative Breast Cancer of No Special Type.阿尔托纳预后指数:一种用于非特殊类型雌激素受体阳性且人表皮生长因子受体2阴性乳腺癌的新预后指数。
Cancers (Basel). 2021 Jul 28;13(15):3799. doi: 10.3390/cancers13153799.
5
Prognostic gene expression signatures of breast cancer are lacking a sensible biological meaning.乳腺癌预后基因表达特征缺乏合理的生物学意义。
Sci Rep. 2021 Jan 8;11(1):156. doi: 10.1038/s41598-020-79375-y.
6
Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes.探索联合图像分析与“组学”分析在非癌组织学表型研究中的极限
Front Genet. 2020 Oct 23;11:555886. doi: 10.3389/fgene.2020.555886. eCollection 2020.
7
Clinical Value of RNA Sequencing-Based Classifiers for Prediction of the Five Conventional Breast Cancer Biomarkers: A Report From the Population-Based Multicenter Sweden Cancerome Analysis Network-Breast Initiative.基于RNA测序的分类器对五种传统乳腺癌生物标志物预测的临床价值:来自基于人群的多中心瑞典癌症基因组分析网络-乳腺癌倡议的报告
JCO Precis Oncol. 2018 Mar 9;2. doi: 10.1200/PO.17.00135. eCollection 2018.
8
A practical perspective on the concordance index for the evaluation and selection of prognostic time-to-event models.关于用于评估和选择预后事件发生时间模型的一致性指数的实用观点。
J Biomed Inform. 2020 Aug;108:103496. doi: 10.1016/j.jbi.2020.103496. Epub 2020 Jul 9.
9
PAGE-Net: Interpretable and Integrative Deep Learning for Survival Analysis Using Histopathological Images and Genomic Data.PAGE-Net:利用组织病理学图像和基因组数据进行生存分析的可解释和综合深度学习
Pac Symp Biocomput. 2020;25:355-366.
10
Gene expression based survival prediction for cancer patients-A topic modeling approach.基于基因表达的癌症患者生存预测-一种主题建模方法。
PLoS One. 2019 Nov 15;14(11):e0224446. doi: 10.1371/journal.pone.0224446. eCollection 2019.