• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

乳腺癌生存分析:评估用于预测的集成学习技术

Survival analysis in breast cancer: evaluating ensemble learning techniques for prediction.

作者信息

Buyrukoğlu Gonca

机构信息

Department of Statistics/ Faculty of Science, Çankırı Karatekin University, Çankırı, Turkey.

出版信息

PeerJ Comput Sci. 2024 Jul 10;10:e2147. doi: 10.7717/peerj-cs.2147. eCollection 2024.

DOI:10.7717/peerj-cs.2147
PMID:39145224
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11323082/
Abstract

Breast cancer is most commonly faced with form of cancer amongst women worldwide. In spite of the fact that the breast cancer research and awareness have gained considerable momentum, there is still no one treatment due to disease heterogeneity. Survival data may be of specific interest in breast cancer studies to understand its dynamic and complex trajectories. This study copes with the most important covariates affecting the disease progression. The study utilizes the German Breast Cancer Study Group 2 (GBSG2) and the Molecular Taxonomy of Breast Cancer International Consortium dataset (METABRIC) datasets. In both datasets, interests lie in relapse of the disease and the time when the relapse happens. The three models, namely the Cox proportional hazards (PH) model, random survival forest (RSF) and conditional inference forest (Cforest) were employed to analyse the breast cancer datasets. The goal of this study is to apply these methods in prediction of breast cancer progression and compare their performances based on two different estimation methods: the bootstrap estimation and the bootstrap .632 estimation. The model performance was evaluated in concordance index (C-index) and prediction error curves (pec) for discrimination. The Cox PH model has a lower C-index and bigger prediction error compared to the RSF and the Cforest approach for both datasets. The analysis results of GBSG2 and METABRIC datasets reveal that the RSF and the Cforest algorithms provide non-parametric alternatives to Cox PH model for estimation of the survival probability of breast cancer patients.

摘要

乳腺癌是全球女性中最常见的癌症形式。尽管乳腺癌研究和认知已取得显著进展,但由于疾病的异质性,仍然没有一种统一的治疗方法。在乳腺癌研究中,生存数据对于理解其动态和复杂的发展轨迹可能具有特殊意义。本研究探讨了影响疾病进展的最重要协变量。该研究使用了德国乳腺癌研究组2(GBSG2)和国际乳腺癌分子分类联盟数据集(METABRIC)。在这两个数据集中,关注点在于疾病的复发以及复发发生的时间。采用了三种模型,即Cox比例风险(PH)模型、随机生存森林(RSF)和条件推断森林(Cforest)来分析乳腺癌数据集。本研究的目的是将这些方法应用于预测乳腺癌进展,并基于两种不同的估计方法:自助估计和自助.632估计,比较它们的性能。通过一致性指数(C-index)和预测误差曲线(pec)来评估模型的判别性能。对于两个数据集,与RSF和Cforest方法相比,Cox PH模型的C-index较低且预测误差较大。GBSG2和METABRIC数据集的分析结果表明,RSF和Cforest算法为估计乳腺癌患者的生存概率提供了Cox PH模型的非参数替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/dded1ae054fb/peerj-cs-10-2147-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/8086d575414f/peerj-cs-10-2147-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/124c4aad5dd6/peerj-cs-10-2147-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/6a507e4f6b12/peerj-cs-10-2147-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/17c6846809ca/peerj-cs-10-2147-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/a7689a9f8814/peerj-cs-10-2147-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/dded1ae054fb/peerj-cs-10-2147-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/8086d575414f/peerj-cs-10-2147-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/124c4aad5dd6/peerj-cs-10-2147-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/6a507e4f6b12/peerj-cs-10-2147-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/17c6846809ca/peerj-cs-10-2147-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/a7689a9f8814/peerj-cs-10-2147-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef71/11323082/dded1ae054fb/peerj-cs-10-2147-g006.jpg

相似文献

1
Survival analysis in breast cancer: evaluating ensemble learning techniques for prediction.乳腺癌生存分析:评估用于预测的集成学习技术
PeerJ Comput Sci. 2024 Jul 10;10:e2147. doi: 10.7717/peerj-cs.2147. eCollection 2024.
2
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
3
The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study.机器学习模型在乳腺癌预后预测中的应用与比较:回顾性队列研究
JMIR Med Inform. 2022 Feb 18;10(2):e33440. doi: 10.2196/33440.
4
Which model is better in predicting the survival of laryngeal squamous cell carcinoma?: Comparison of the random survival forest based on machine learning algorithms to Cox regression: analyses based on SEER database.哪种模型更能预测喉鳞状细胞癌的生存情况?:基于机器学习算法的随机生存森林与 Cox 回归的比较:基于 SEER 数据库的分析。
Medicine (Baltimore). 2023 Mar 10;102(10):e33144. doi: 10.1097/MD.0000000000033144.
5
A comparison of the conditional inference survival forest model to random survival forests based on a simulation study as well as on two applications with time-to-event data.基于模拟研究以及对两个事件发生时间数据应用的情况,对条件推断生存森林模型与随机生存森林进行比较。
BMC Med Res Methodol. 2017 Jul 28;17(1):115. doi: 10.1186/s12874-017-0383-8.
6
Individual risk prediction: Comparing random forests with Cox proportional-hazards model by a simulation study.个体风险预测:通过模拟研究比较随机森林与 Cox 比例风险模型。
Biom J. 2023 Aug;65(6):e2100380. doi: 10.1002/bimj.202100380. Epub 2022 Sep 28.
7
[Application value of machine learning algorithms for predicting recurrence after resection of early-stage hepatocellular carcinoma].[机器学习算法在预测早期肝细胞癌切除术后复发中的应用价值]
Zhonghua Wai Ke Za Zhi. 2021 Aug 1;59(8):679-685. doi: 10.3760/cma.j.cn112139-20201026-00768.
8
Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study.基于生存事件的机器学习预测结直肠癌患者生存情况:回顾性队列研究。
J Med Internet Res. 2023 Oct 26;25:e44417. doi: 10.2196/44417.
9
Deep learning models for predicting the survival of patients with chondrosarcoma based on a surveillance, epidemiology, and end results analysis.基于监测、流行病学和最终结果分析的预测软骨肉瘤患者生存率的深度学习模型。
Front Oncol. 2022 Aug 22;12:967758. doi: 10.3389/fonc.2022.967758. eCollection 2022.
10
Comparison of the Tree-Based Machine Learning Algorithms to Cox Regression in Predicting the Survival of Oral and Pharyngeal Cancers: Analyses Based on SEER Database.基于监测、流行病学和最终结果(SEER)数据库分析:基于树的机器学习算法与Cox回归在预测口腔和咽癌生存率方面的比较
Cancers (Basel). 2020 Sep 29;12(10):2802. doi: 10.3390/cancers12102802.

引用本文的文献

1
Comparison of Random Survival Forest Based-Overall Survival With Deep Learning and Cox Proportional Hazard Models in HER-2-Positive HR-Negative Breast Cancer.基于随机生存森林的总生存期与深度学习及Cox比例风险模型在HER-2阳性HR阴性乳腺癌中的比较
Cancer Rep (Hoboken). 2025 Jul;8(7):e70262. doi: 10.1002/cnr2.70262.
2
Machine learning-based prediction model for patients with recurrent Staphylococcus aureus bacteremia.基于机器学习的复发性金黄色葡萄球菌菌血症患者预测模型
BMC Med Inform Decis Mak. 2025 Feb 24;25(1):99. doi: 10.1186/s12911-025-02878-z.
3
Investigating the role of KARS in lung adenocarcinoma via single-cell RNA sequencing.

本文引用的文献

1
A hybrid super ensemble learning model for the early-stage prediction of diabetes risk.一种用于糖尿病风险早期预测的混合超级集成学习模型。
Med Biol Eng Comput. 2023 Mar;61(3):785-797. doi: 10.1007/s11517-022-02749-z. Epub 2023 Jan 5.
2
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
3
Machine learning in sports science: challenges and opportunities.
通过单细胞RNA测序研究KARS在肺腺癌中的作用。
Discov Oncol. 2024 Dec 3;15(1):740. doi: 10.1007/s12672-024-01633-6.
体育科学中的机器学习:挑战与机遇。
Sports Biomech. 2024 Aug;23(8):961-967. doi: 10.1080/14763141.2021.1910334. Epub 2021 Apr 20.
4
Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival.可解释的机器学习可以优于 Cox 回归预测,并提供乳腺癌生存的见解。
Sci Rep. 2021 Mar 26;11(1):6968. doi: 10.1038/s41598-021-86327-7.
5
Machine Learning and Mechanistic Modeling for Prediction of Metastatic Relapse in Early-Stage Breast Cancer.用于预测早期乳腺癌转移复发的机器学习与机制建模
JCO Clin Cancer Inform. 2020 Mar;4:259-274. doi: 10.1200/CCI.19.00133.
6
Predicting factors for survival of breast cancer patients using machine learning techniques.运用机器学习技术预测乳腺癌患者的生存因素。
BMC Med Inform Decis Mak. 2019 Mar 22;19(1):48. doi: 10.1186/s12911-019-0801-4.
7
Dynamics of breast-cancer relapse reveal late-recurring ER-positive genomic subgroups.乳腺癌复发的动力学揭示了晚期复发的 ER 阳性基因组亚群。
Nature. 2019 Mar;567(7748):399-404. doi: 10.1038/s41586-019-1007-8. Epub 2019 Mar 13.
8
Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.全球癌症统计数据 2018:GLOBOCAN 对全球 185 个国家/地区 36 种癌症的发病率和死亡率的估计。
CA Cancer J Clin. 2018 Nov;68(6):394-424. doi: 10.3322/caac.21492. Epub 2018 Sep 12.
9
DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network.DeepSurv:使用 Cox 比例风险深度神经网络的个性化治疗推荐系统。
BMC Med Res Methodol. 2018 Feb 26;18(1):24. doi: 10.1186/s12874-018-0482-1.
10
A comparison of the conditional inference survival forest model to random survival forests based on a simulation study as well as on two applications with time-to-event data.基于模拟研究以及对两个事件发生时间数据应用的情况,对条件推断生存森林模型与随机生存森林进行比较。
BMC Med Res Methodol. 2017 Jul 28;17(1):115. doi: 10.1186/s12874-017-0383-8.