基于可解释机器学习模型的男性乳腺癌患者远处转移风险预测。

The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model.

机构信息

Department of Breast Surgery, Harbin Medical University Cancer Hospital, Harbin, China.

出版信息

BMC Med Inform Decis Mak. 2023 Apr 21;23(1):74. doi: 10.1186/s12911-023-02166-8.

DOI:10.1186/s12911-023-02166-8

PMID:37085843

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10120176/

Abstract

OBJECTIVES

This research was designed to compare the ability of different machine learning (ML) models and nomogram to predict distant metastasis in male breast cancer (MBC) patients and to interpret the optimal ML model by SHapley Additive exPlanations (SHAP) framework.

METHODS

Four powerful ML models were developed using data from male breast cancer (MBC) patients in the SEER database between 2010 and 2015 and MBC patients from our hospital between 2010 and 2020. The area under curve (AUC) and Brier score were used to assess the capacity of different models. The Delong test was applied to compare the performance of the models. Univariable and multivariable analysis were conducted using logistic regression.

RESULTS

Of 2351 patients were analyzed; 168 (7.1%) had distant metastasis (M1); 117 (5.0%) had bone metastasis, and 71 (3.0%) had lung metastasis. The median age at diagnosis is 68.0 years old. Most patients did not receive radiotherapy (1723, 73.3%) or chemotherapy (1447, 61.5%). The XGB model was the best ML model for predicting M1 in MBC patients. It showed the largest AUC value in the tenfold cross validation (AUC:0.884; SD:0.02), training (AUC:0.907; 95% CI: 0.899-0.917), testing (AUC:0.827; 95% CI: 0.802-0.857) and external validation (AUC:0.754; 95% CI: 0.739-0.771) sets. It also showed powerful ability in the prediction of bone metastasis (AUC: 0.880, 95% CI: 0.856-0.903 in the training set; AUC: 0.823, 95% CI:0.790-0.848 in the test set; AUC: 0.747, 95% CI: 0.727-0.764 in the external validation set) and lung metastasis (AUC: 0.906, 95% CI: 0.877-0.928 in training set; AUC: 0.859, 95% CI: 0.816-0.891 in the test set; AUC: 0.756, 95% CI: 0.732-0.777 in the external validation set). The AUC value of the XGB model was larger than that of nomogram in the training (0.907 vs 0.802) and external validation (0.754 vs 0.706) sets.

CONCLUSIONS

The XGB model is a better predictor of distant metastasis among MBC patients than other ML models and nomogram; furthermore, the XGB model is a powerful model for predicting bone and lung metastasis. Combining with SHAP values, it could help doctors intuitively understand the impact of each variable on outcome.

摘要

目的

本研究旨在比较不同机器学习（ML）模型和列线图预测男性乳腺癌（MBC）患者远处转移的能力，并通过 SHapley Additive exPlanations（SHAP）框架解释最佳 ML 模型。

方法

使用 2010 年至 2015 年 SEER 数据库中男性乳腺癌（MBC）患者和 2010 年至 2020 年我院 MBC 患者的数据，开发了四种强大的 ML 模型。使用曲线下面积（AUC）和 Brier 评分评估不同模型的能力。采用 Delong 检验比较模型的性能。使用逻辑回归进行单变量和多变量分析。

结果

对 2351 例患者进行了分析；168 例（7.1%）发生远处转移（M1）；117 例（5.0%）发生骨转移，71 例（3.0%）发生肺转移。中位诊断年龄为 68.0 岁。大多数患者未接受放疗（1723 例，73.3%）或化疗（1447 例，61.5%）。XGB 模型是预测 MBC 患者 M1 的最佳 ML 模型。它在十折交叉验证中显示出最大的 AUC 值（AUC：0.884；SD：0.02）、训练（AUC：0.907；95%CI：0.899-0.917）、测试（AUC：0.827；95%CI：0.802-0.857）和外部验证（AUC：0.754；95%CI：0.739-0.771）集。它在骨转移（训练集中 AUC：0.880，95%CI：0.856-0.903；测试集中 AUC：0.823，95%CI：0.790-0.848；外部验证集中 AUC：0.747，95%CI：0.727-0.764）和肺转移（训练集中 AUC：0.906，95%CI：0.877-0.928；测试集中 AUC：0.859，95%CI：0.816-0.891；外部验证集中 AUC：0.756，95%CI：0.732-0.777）的预测中也表现出强大的能力。XGB 模型的 AUC 值在训练集（0.907 对 0.802）和外部验证集（0.754 对 0.706）中均大于列线图。

结论

XGB 模型是预测 MBC 患者远处转移的较好模型，优于其他 ML 模型和列线图；此外，XGB 模型是预测骨转移和肺转移的强大模型。结合 SHAP 值，可以帮助医生直观地了解每个变量对结果的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc12/10120176/05988531bf3c/12911_2023_2166_Fig1_HTML.jpg

相似文献

The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model.

BMC Med Inform Decis Mak. 2023 Apr 21;23(1):74. doi: 10.1186/s12911-023-02166-8.

A machine learning-based model for predicting distant metastasis in patients with rectal cancer.

Front Oncol. 2023 Aug 15;13:1235121. doi: 10.3389/fonc.2023.1235121. eCollection 2023.

Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework.

Front Oncol. 2022 Sep 15;12:981059. doi: 10.3389/fonc.2022.981059. eCollection 2022.

Applying machine learning techniques to predict the risk of lung metastases from rectal cancer: a real-world retrospective study.

Front Oncol. 2023 May 24;13:1183072. doi: 10.3389/fonc.2023.1183072. eCollection 2023.

Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.

Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

An External-Validated Prediction Model to Predict Lung Metastasis among Osteosarcoma: A Multicenter Analysis Based on Machine Learning.

Comput Intell Neurosci. 2022 May 6;2022:2220527. doi: 10.1155/2022/2220527. eCollection 2022.

Nomogram for predicting distant metastasis of male breast cancer: A SEER population-based study.

Medicine (Baltimore). 2022 Sep 30;101(39):e30978. doi: 10.1097/MD.0000000000030978.

Predicting diagnosis and survival of bone metastasis in breast cancer using machine learning.

Sci Rep. 2023 Oct 25;13(1):18301. doi: 10.1038/s41598-023-45438-z.

Machine learning approaches for prediction of early death among lung cancer patients with bone metastases using routine clinical characteristics: An analysis of 19,887 patients.

Front Public Health. 2022 Oct 6;10:1019168. doi: 10.3389/fpubh.2022.1019168. eCollection 2022.

Machine learning prediction models and nomogram to predict the risk of in-hospital death for severe DKA: A clinical study based on MIMIC-IV, eICU databases, and a college hospital ICU.

Int J Med Inform. 2023 Jun;174:105049. doi: 10.1016/j.ijmedinf.2023.105049. Epub 2023 Mar 27.

引用本文的文献

Hotspots and future trends of male breast cancer: a global perspective.

Clin Transl Oncol. 2025 Aug 21. doi: 10.1007/s12094-025-04026-5.

Machine learning-based prognostic modeling and surgical value analysis of de novo metastatic invasive ductal carcinoma of the breast.

Updates Surg. 2025 Jan 15. doi: 10.1007/s13304-025-02066-8.

Bone scintigraphy based on deep learning model and modified growth optimizer.

Sci Rep. 2024 Oct 27;14(1):25627. doi: 10.1038/s41598-024-73991-8.

Explainable artificial intelligence in breast cancer detection and risk prediction: A systematic scoping review.

Cancer Innov. 2024 Jul 3;3(5):e136. doi: 10.1002/cai2.136. eCollection 2024 Oct.

Development and validation of AI models using LR and LightGBM for predicting distant metastasis in breast cancer: a dual-center study.

Front Oncol. 2024 Jun 14;14:1409273. doi: 10.3389/fonc.2024.1409273. eCollection 2024.

Prediction models for postoperative recurrence of non-lactating mastitis based on machine learning.

BMC Med Inform Decis Mak. 2024 Apr 22;24(1):106. doi: 10.1186/s12911-024-02499-y.

Predicting mortality and recurrence in colorectal cancer: Comparative assessment of predictive models.

Heliyon. 2024 Mar 12;10(6):e27854. doi: 10.1016/j.heliyon.2024.e27854. eCollection 2024 Mar 30.

Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations.

Heliyon. 2023 Jul 3;9(7):e17772. doi: 10.1016/j.heliyon.2023.e17772. eCollection 2023 Jul.

本文引用的文献

Bone metastasis risk and prognosis assessment models for kidney cancer based on machine learning.

Front Public Health. 2022 Nov 17;10:1015952. doi: 10.3389/fpubh.2022.1015952. eCollection 2022.

Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework.

Front Oncol. 2022 Sep 15;12:981059. doi: 10.3389/fonc.2022.981059. eCollection 2022.

Nomogram for predicting distant metastasis of male breast cancer: A SEER population-based study.

Medicine (Baltimore). 2022 Sep 30;101(39):e30978. doi: 10.1097/MD.0000000000030978.

Modified Brier score for evaluating prediction accuracy for binary outcomes.

Stat Methods Med Res. 2022 Dec;31(12):2287-2296. doi: 10.1177/09622802221122391. Epub 2022 Aug 29.

A machine learning model based on ultrasound image features to assess the risk of sentinel lymph node metastasis in breast cancer patients: Applications of scikit-learn and SHAP.

Front Oncol. 2022 Jul 25;12:944569. doi: 10.3389/fonc.2022.944569. eCollection 2022.

Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations.

Comput Methods Programs Biomed. 2022 Oct;225:107038. doi: 10.1016/j.cmpb.2022.107038. Epub 2022 Jul 23.

Cancer statistics, 2022.

CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.

NCCN Guidelines® Insights: Breast Cancer, Version 4.2021.

J Natl Compr Canc Netw. 2021 May 1;19(5):484-493. doi: 10.6004/jnccn.2021.0023.

Comparison between male and female breast cancer survival using propensity score matching analysis.

Sci Rep. 2021 Jun 2;11(1):11639. doi: 10.1038/s41598-021-91131-4.

Predicting breast cancer 5-year survival using machine learning: A systematic review.

PLoS One. 2021 Apr 16;16(4):e0250370. doi: 10.1371/journal.pone.0250370. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于可解释机器学习模型的男性乳腺癌患者远处转移风险预测。

The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model.

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献