• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于监测、流行病学和最终结果(SEER)数据库的乳腺癌骨转移机器学习预测:模型开发与验证

SEER-based machine learning prediction of bone metastasis in breast cancer: model development and validation.

作者信息

Gao Ying, Liu Lei, Wang Shoujun, Tao Weijie, Wang Jinmiao, Duan Ran, Xie Hai, Takahashi Hideaki, Hao Jie, Gao Ming

机构信息

Department of Breast and Thyroid Surgery, Tianjin Key Laboratory of General Surgery in Construction, Tianjin Union Medical Center, The First Affiliated Hospital of Nankai University, Tianjin, China.

Department of Thyroid and Neck Cancer, Tianjin Medical University Cancer Institute and Hospital, National Cancer Clinical Research Center, Tianjin Cancer Clinical Research Center, Tianjin Key Laboratory of Cancer Prevention and Treatment, Tianjin, China.

出版信息

Gland Surg. 2025 Jul 31;14(7):1366-1378. doi: 10.21037/gs-2025-168. Epub 2025 Jul 28.

DOI:10.21037/gs-2025-168
PMID:40771376
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12322768/
Abstract

BACKGROUND

Breast cancer (BC) is the leading cancer in women. It often metastasizes to bone, worsening the prognosis. Diagnostic methods often fail to predict bone metastasis (BM). This study developed a machine learning (ML) model using the Surveillance, Epidemiology, and End Results (SEER) database for BM prediction, to refine treatments and improve outcomes.

METHODS

Using SEER data, we studied 24,584 BC patients diagnosed 2010-2015 with radiologically confirmed BM. Tumor size, grade, tumor (T)/node (N) stages, and estrogen receptor (ER)/progesterone receptor (PR)/human epidermal growth factor receptor 2 (HER2) status were assessed. Stratified randomization divided the data into 70% training (n=18,438) and 30% validation (n=6,146). Six ML algorithms were developed, emphasizing random forest (RF). Receiver operating characteristic (ROC) curve analysis [area under the curve (AUC), sensitivity, specificity, negative predictive value (NPV)] assessed performance. The SHapley Additive exPlanations (SHAP) framework identified key BM predictors.

RESULTS

Our analysis of 24,584 patients identified 1,298 (5.26%) patients with BM. Logistic regression (LR) provided the highest specificity [0.897, 95% confidence interval (CI): 0.889-0.905], contrasting with gradient boosting machine (GBM)'s maximal sensitivity (0.658, 95% CI: 0.609-0.707). With sensitivity at 0.658, better algorithms or multimodal methods are needed for case identification. The multilayer perceptron neural network (MLPNN) model demonstrated superior performance, with the highest AUC of 0.808 (95% CI: 0.798-0.818), surpassing the LR and adaptive boosting (AdaBoost) models, both with AUCs of 0.803 (95% CI: 0.793-0.813). The RF model was particularly adept at ruling out BM, with an NPV above 97%. The SHAP analysis identified tumor size, grade, T/N stages, ER/PR/HER2 status, and brain/liver/lung metastases as key predictors for risk stratification. Decision curve analysis showed RF's superior utility over the American Joint Committee on Cancer (AJCC) Staging System.

CONCLUSIONS

Our ML model demonstrates potential for predicting BM in patients with BC. It may serve as a clinical aid to identify at-risk individuals early. However, moderate sensitivity requires refinement for better case detection. This study supports integrating ML into clinical practice, advancing personalized oncology medicine.

摘要

背景

乳腺癌(BC)是女性中最常见的癌症。它常转移至骨骼,使预后恶化。诊断方法往往无法预测骨转移(BM)。本研究利用监测、流行病学和最终结果(SEER)数据库开发了一种机器学习(ML)模型用于BM预测,以优化治疗并改善预后。

方法

利用SEER数据,我们研究了2010 - 2015年诊断为经放射学证实的BM的24584例BC患者。评估了肿瘤大小、分级、肿瘤(T)/淋巴结(N)分期以及雌激素受体(ER)/孕激素受体(PR)/人表皮生长因子受体2(HER2)状态。分层随机化将数据分为70%训练集(n = 18438)和30%验证集(n = 6146)。开发了六种ML算法,重点是随机森林(RF)。通过受试者工作特征(ROC)曲线分析[曲线下面积(AUC)、敏感性、特异性、阴性预测值(NPV)]评估性能。SHapley加性解释(SHAP)框架确定了关键的BM预测因子。

结果

我们对24584例患者的分析确定了1298例(5.26%)有BM的患者。逻辑回归(LR)具有最高的特异性[0.897,95%置信区间(CI):0.889 - 0.905],与梯度提升机(GBM)的最大敏感性(0.658,95%CI:0.609 - 0.707)形成对比。在敏感性为0.658的情况下,需要更好的算法或多模态方法来进行病例识别。多层感知器神经网络(MLPNN)模型表现出卓越的性能,最高AUC为0.808(95%CI:0.798 - 0.818),超过了LR和自适应增强(AdaBoost)模型,这两种模型的AUC均为0.803(95%CI:|0.793 - 0.813)。RF模型特别擅长排除BM,NPV高于97%。SHAP分析确定肿瘤大小、分级、T/N分期、ER/PR/HER2状态以及脑/肝/肺转移是风险分层的关键预测因子。决策曲线分析显示RF比美国癌症联合委员会(AJCC)分期系统具有更高的效用。

结论

我们的ML模型显示了预测BC患者BM的潜力。它可作为一种临床辅助手段,早期识别高危个体。然而,中等的敏感性需要改进以实现更好的病例检测。本研究支持将ML整合到临床实践中,推动个性化肿瘤医学发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8d55d17260b7/gs-14-07-1366-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8682590aeaf1/gs-14-07-1366-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/4cda03ef5530/gs-14-07-1366-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/dcd8bea5fe14/gs-14-07-1366-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/c0721642c732/gs-14-07-1366-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/ec2efbd4ed47/gs-14-07-1366-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8e8fba07b17e/gs-14-07-1366-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/a8d97809c421/gs-14-07-1366-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8d55d17260b7/gs-14-07-1366-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8682590aeaf1/gs-14-07-1366-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/4cda03ef5530/gs-14-07-1366-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/dcd8bea5fe14/gs-14-07-1366-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/c0721642c732/gs-14-07-1366-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/ec2efbd4ed47/gs-14-07-1366-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8e8fba07b17e/gs-14-07-1366-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/a8d97809c421/gs-14-07-1366-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8085/12322768/8d55d17260b7/gs-14-07-1366-f8.jpg

相似文献

1
SEER-based machine learning prediction of bone metastasis in breast cancer: model development and validation.基于监测、流行病学和最终结果(SEER)数据库的乳腺癌骨转移机器学习预测:模型开发与验证
Gland Surg. 2025 Jul 31;14(7):1366-1378. doi: 10.21037/gs-2025-168. Epub 2025 Jul 28.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
A web-based prediction model for brain metastasis in non-small cell lung cancer patients.一种用于非小细胞肺癌患者脑转移的基于网络的预测模型。
Discov Oncol. 2025 Jul 29;16(1):1438. doi: 10.1007/s12672-025-03298-1.
5
A novel nomogram for survival prediction in renal cell carcinoma patients with brain metastases: an analysis of the SEER database.一种用于预测肾细胞癌脑转移患者生存情况的新型列线图:基于监测、流行病学和最终结果(SEER)数据库的分析
Front Immunol. 2025 Jun 30;16:1572580. doi: 10.3389/fimmu.2025.1572580. eCollection 2025.
6
Development and validation of a machine learning-based clinical prediction model for monitoring liver injury in patients with pan-cancer receiving immunotherapy.基于机器学习的临床预测模型的开发与验证,用于监测接受免疫治疗的泛癌患者的肝损伤
Int J Med Inform. 2025 Jul 5;203:106036. doi: 10.1016/j.ijmedinf.2025.106036.
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.
9
Individualized survival prediction and risk stratification using machine learning for patients with malignant struma ovarii: a population-based study with external validation.使用机器学习对恶性卵巢甲状腺肿患者进行个体化生存预测和风险分层:一项基于人群并经外部验证的研究
Gland Surg. 2025 Jun 30;14(6):1052-1065. doi: 10.21037/gs-2025-35. Epub 2025 Jun 26.
10
Mortality Risk Prediction in Patients With Antimelanoma Differentiation-Associated, Gene 5 Antibody-Positive, Dermatomyositis-Associated Interstitial Lung Disease: Algorithm Development and Validation.抗黑色素瘤分化相关基因5抗体阳性、皮肌炎相关间质性肺疾病患者的死亡风险预测:算法开发与验证
J Med Internet Res. 2025 Feb 5;27:e62836. doi: 10.2196/62836.

本文引用的文献

1
Harnessing machine learning potential for personalised drug design and overcoming drug resistance.利用机器学习潜力进行个性化药物设计并克服耐药性。
J Drug Target. 2024 Sep;32(8):918-930. doi: 10.1080/1061186X.2024.2365934. Epub 2024 Jun 13.
2
Incidence and risk factors for bone metastases at presentation in solid tumors.实体瘤初诊时骨转移的发生率及危险因素。
Front Oncol. 2024 May 10;14:1392667. doi: 10.3389/fonc.2024.1392667. eCollection 2024.
3
Machine learning algorithms to uncover risk factors of breast cancer: insights from a large case-control study.
用于揭示乳腺癌风险因素的机器学习算法:来自一项大型病例对照研究的见解
Front Oncol. 2024 Feb 15;13:1276232. doi: 10.3389/fonc.2023.1276232. eCollection 2023.
4
Identifying cancer risks using spectral subset feature selection based on multi-layer perception neural network for premature treatment.基于多层感知神经网络的光谱子集特征选择识别癌症风险,以便进行早期治疗。
Comput Methods Biomech Biomed Engin. 2024 Oct;27(13):1804-1816. doi: 10.1080/10255842.2023.2262662. Epub 2023 Oct 4.
5
Machine learning-based approaches for cancer prediction using microbiome data.基于机器学习的微生物组数据癌症预测方法。
Sci Rep. 2023 Jul 21;13(1):11821. doi: 10.1038/s41598-023-38670-0.
6
Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework.基于机器学习和SHapley加性解释框架预测乳腺浸润性微乳头状癌患者的淋巴结转移
Front Oncol. 2022 Sep 15;12:981059. doi: 10.3389/fonc.2022.981059. eCollection 2022.
7
Prediction for Distant Metastasis of Breast Cancer Using Dynamic Contrast-Enhanced Magnetic Resonance Imaging Images under Deep Learning.基于深度学习的动态对比增强磁共振成像预测乳腺癌远处转移。
Comput Intell Neurosci. 2022 Jun 8;2022:6126061. doi: 10.1155/2022/6126061. eCollection 2022.
8
Global challenges in breast cancer detection and treatment.乳腺癌检测和治疗的全球挑战。
Breast. 2022 Mar;62 Suppl 1(Suppl 1):S3-S6. doi: 10.1016/j.breast.2022.02.003. Epub 2022 Feb 8.
9
Understanding and overcoming tumor heterogeneity in metastatic breast cancer treatment.理解和克服转移性乳腺癌治疗中的肿瘤异质性。
Nat Cancer. 2021 Jul;2(7):680-692. doi: 10.1038/s43018-021-00229-1. Epub 2021 Jul 19.
10
Exosomal miR-19a and IBSP cooperate to induce osteolytic bone metastasis of estrogen receptor-positive breast cancer.外泌体 miR-19a 和 IBSP 协同诱导雌激素受体阳性乳腺癌溶骨性骨转移。
Nat Commun. 2021 Aug 31;12(1):5196. doi: 10.1038/s41467-021-25473-y.