• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

乳腺癌患者发生多原发性癌症的危险因素:一项回顾性研究及机器学习模型的建立/测试

Risk factors of breast cancer patients developing multiple primary cancers: a retrospective study and establishing/testing of machine learning models.

作者信息

Jin Yudi, Su Tong, Fan Yanjia, Zheng Yineng, Tian Cheng, Ouyang Zubin, Lv Fajin

机构信息

Department of Radiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China.

Department of Breast and Thyroid Surgery, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.

出版信息

BMC Med Inform Decis Mak. 2025 Jul 25;25(1):277. doi: 10.1186/s12911-025-03086-5.

DOI:10.1186/s12911-025-03086-5
PMID:40713705
Abstract

BACKGROUND

Breast cancer is a prevalent malignancy globally, with approximately 1 in 10 breast cancer patients at risk of developing additional primary malignant tumors. This study seeks to explore the risk factors linked to the development of multiple primary cancers (MPCs) in breast cancer patients and to develop predictive models to aid in clinical decision-making.

METHODS

A cohort of patients from the Surveillance, Epidemiology, and End Results (SEER) database was analyzed to identify key factors contributing to the occurrence of MPCs. Machine learning models, including logistic regression and random forest, were established and tested to predict the risk of developing multiple primary cancers.

RESULTS

A total of 120,434 breast cancer patients were included in the study. After random undersampling of the majority calss and random selected a quarter of populations, there were 3432 patients in each of the one primary breast cancer (OPBC) group and the MPCs group. A logistic regression and a random forest model were constructed based on age, marital status, laterality, histological type, tumor grade, American Joint Committee on Cance (AJCC) stage, T and N stage, molecular subtype, surgery, chemotherapy, and radiotherapy. The logistic regression model achieved an area under the curve (AUC) of 0.902, a specificity of 0.905, and a sensitivity of 0.767 in the training set, and an AUC of 0.886, a specificity of 0.882, and a sensitivity of 0.782 In the testing set. The random forest model achieved an AUC of 0.955, a specificity of 0.916, and a sensitivity of 0.859 in the training set, and an AUC of 0.874, a specificity of 0.858, and a sensitivity of 0.769 in the testing set. A nomogram was plotted based on the logistic regression model. The Kaplan-Meier (K-M) curves demonstrated statistically significant differences in prognosis among the various risk groups that were stratified based on the nomogram.

CONCLUSIONS

This study assessed several risk factors influencing the development of MPCs in breast cancer patients. The machine learning model could offer a practical tool for personalized risk assessment in this patient population.

摘要

背景

乳腺癌是全球一种常见的恶性肿瘤,约十分之一的乳腺癌患者有发生其他原发性恶性肿瘤的风险。本研究旨在探讨与乳腺癌患者发生多原发性癌症(MPCs)相关的危险因素,并开发预测模型以辅助临床决策。

方法

对来自监测、流行病学和最终结果(SEER)数据库的一组患者进行分析,以确定导致MPCs发生的关键因素。建立并测试了包括逻辑回归和随机森林在内的机器学习模型,以预测发生多原发性癌症的风险。

结果

本研究共纳入120434例乳腺癌患者。在对多数类进行随机欠采样并随机选择四分之一的人群后,单原发性乳腺癌(OPBC)组和MPCs组各有3432例患者。基于年龄、婚姻状况、患侧、组织学类型、肿瘤分级、美国癌症联合委员会(AJCC)分期、T和N分期、分子亚型、手术、化疗和放疗构建了逻辑回归模型和随机森林模型。逻辑回归模型在训练集中的曲线下面积(AUC)为0.902,特异性为0.905,敏感性为0.767;在测试集中的AUC为0.886,特异性为0.882,敏感性为0.782。随机森林模型在训练集中的AUC为0.955,特异性为0.916,敏感性为0.859;在测试集中的AUC为0.874,特异性为0.858,敏感性为0.769。基于逻辑回归模型绘制了列线图。Kaplan-Meier(K-M)曲线显示,根据列线图分层的不同风险组之间的预后存在统计学显著差异。

结论

本研究评估了影响乳腺癌患者发生MPCs的几个危险因素。机器学习模型可为该患者群体进行个性化风险评估提供实用工具。

相似文献

1
Risk factors of breast cancer patients developing multiple primary cancers: a retrospective study and establishing/testing of machine learning models.乳腺癌患者发生多原发性癌症的危险因素:一项回顾性研究及机器学习模型的建立/测试
BMC Med Inform Decis Mak. 2025 Jul 25;25(1):277. doi: 10.1186/s12911-025-03086-5.
2
Competing risk and random survival forest models for predicting survival in post-resection elderly stage I-III colorectal cancer patients.用于预测I-III期老年结直肠癌患者术后生存情况的竞争风险和随机生存森林模型
Sci Rep. 2025 Jul 7;15(1):24269. doi: 10.1038/s41598-025-05824-1.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Predicting Pathological Complete Response Following Neoadjuvant Therapy in Patients With Breast Cancer: Development of Machine Learning-Based Prediction Models in a Retrospective Study.预测乳腺癌患者新辅助治疗后的病理完全缓解:一项回顾性研究中基于机器学习的预测模型的开发
JMIR Cancer. 2025 Jul 18;11:e64685. doi: 10.2196/64685.
6
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
7
A systematic review of evidence on malignant spinal metastases: natural history and technologies for identifying patients at high risk of vertebral fracture and spinal cord compression.一项关于恶性脊柱转移瘤的证据的系统回顾:自然病史和识别高风险椎体骨折和脊髓压迫患者的技术。
Health Technol Assess. 2013 Sep;17(42):1-274. doi: 10.3310/hta17420.
8
Individualized Prediction of Overall Survival Time for Patients with Primary Intramedullary Spinal Cord Astrocytoma: A Population-Based Study.原发性脊髓髓内星形细胞瘤患者总生存时间的个体化预测:一项基于人群的研究
World Neurosurg. 2025 Jan;193:1106-1116. doi: 10.1016/j.wneu.2024.10.092. Epub 2024 Nov 21.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Clinical diagnostic and prognostic value of homocysteine combined with hemoglobin [f (Hcy-Hb)] in cardio-renal syndrome caused by primary acute myocardial infarction.同型半胱氨酸联合血红蛋白[f(Hcy-Hb)]在原发性急性心肌梗死所致心肾综合征中的临床诊断及预后价值
J Transl Med. 2025 Jul 23;23(1):813. doi: 10.1186/s12967-025-06512-4.

本文引用的文献

1
Evaluating nomogram models for predicting survival outcomes in gastric gastrointestinal stromal tumors with SEER database analysis.基于 SEER 数据库分析评估胃胃肠间质瘤生存结局预测的列线图模型。
Sci Rep. 2024 May 20;14(1):11494. doi: 10.1038/s41598-024-62353-z.
2
Risks of second primary cancers among 584,965 female and male breast cancer survivors in England: a 25-year retrospective cohort study.英格兰584,965名女性和男性乳腺癌幸存者中发生第二原发性癌症的风险:一项25年的回顾性队列研究。
Lancet Reg Health Eur. 2024 Apr 24;40:100903. doi: 10.1016/j.lanepe.2024.100903. eCollection 2024 May.
3
Development and testing of a random forest-based machine learning model for predicting events among breast cancer patients with a poor response to neoadjuvant chemotherapy.
基于随机森林的机器学习模型的开发和测试,用于预测新辅助化疗反应不良的乳腺癌患者的事件。
Eur J Med Res. 2023 Sep 30;28(1):394. doi: 10.1186/s40001-023-01361-7.
4
Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study.利用机器学习模型预测非酒精性脂肪性肝病的 5 年风险:前瞻性队列研究。
J Med Internet Res. 2023 Sep 12;25:e46891. doi: 10.2196/46891.
5
Correction: Socioeconomic position and prognosis in premenopausal breast cancer: a population-based cohort study in Denmark.更正:绝经前乳腺癌的社会经济地位与预后:丹麦一项基于人群的队列研究。
BMC Med. 2023 Aug 17;21(1):311. doi: 10.1186/s12916-023-02987-4.
6
Risk of second primary cancer among women in the Kaiser Permanente Breast Cancer Survivors Cohort.凯泽永久乳腺癌生存者队列中女性的第二原发癌风险。
Breast Cancer Res. 2023 May 3;25(1):50. doi: 10.1186/s13058-023-01647-y.
7
Risks of second non-breast primaries following breast cancer in women: a systematic review and meta-analysis.女性乳腺癌后发生第二非乳腺原发癌的风险:系统评价和荟萃分析。
Breast Cancer Res. 2023 Feb 10;25(1):18. doi: 10.1186/s13058-023-01610-x.
8
Cancer statistics, 2023.癌症统计数据,2023 年。
CA Cancer J Clin. 2023 Jan;73(1):17-48. doi: 10.3322/caac.21763.
9
Clinical and sociodemographic risk factors associated with the development of second primary cancers among postmenopausal breast cancer survivors.与绝经后乳腺癌幸存者第二原发癌发展相关的临床和社会人口学危险因素。
Breast Cancer. 2023 Mar;30(2):215-225. doi: 10.1007/s12282-022-01411-8. Epub 2022 Nov 1.
10
Heterogeneity in the Association Between the Presence of Coronary Artery Calcium and Cardiovascular Events: A Machine-Learning Approach in the MESA Study.冠状动脉钙存在与心血管事件之间的关联存在异质性:MESA 研究中的机器学习方法。
Circulation. 2023 Jan 10;147(2):132-141. doi: 10.1161/CIRCULATIONAHA.122.062626. Epub 2022 Oct 31.