• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统计和机器学习风险预测模型在乳腺癌幸存者监测获益和失败中的性能。

Performance of Statistical and Machine Learning Risk Prediction Models for Surveillance Benefits and Failures in Breast Cancer Survivors.

机构信息

Kaiser Permanente Washington Health Research Institute, Kaiser Permanente WA, Seattle, Washington.

Department of Radiology, University of Washington and Seattle Cancer Care Alliance, Seattle, Washington.

出版信息

Cancer Epidemiol Biomarkers Prev. 2023 Apr 3;32(4):561-571. doi: 10.1158/1055-9965.EPI-22-0677.

DOI:10.1158/1055-9965.EPI-22-0677
PMID:36697364
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10073265/
Abstract

BACKGROUND

Machine learning (ML) approaches facilitate risk prediction model development using high-dimensional predictors and higher-order interactions at the cost of model interpretability and transparency. We compared the relative predictive performance of statistical and ML models to guide modeling strategy selection for surveillance mammography outcomes in women with a personal history of breast cancer (PHBC).

METHODS

We cross-validated seven risk prediction models for two surveillance outcomes, failure (breast cancer within 12 months of a negative surveillance mammogram) and benefit (surveillance-detected breast cancer). We included 9,447 mammograms (495 failures, 1,414 benefits, and 7,538 nonevents) from years 1996 to 2017 using a 1:4 matched case-control samples of women with PHBC in the Breast Cancer Surveillance Consortium. We assessed model performance of conventional regression, regularized regressions (LASSO and elastic-net), and ML methods (random forests and gradient boosting machines) by evaluating their calibration and, among well-calibrated models, comparing the area under the receiver operating characteristic curve (AUC) and 95% confidence intervals (CI).

RESULTS

LASSO and elastic-net consistently provided well-calibrated predicted risks for surveillance failure and benefit. The AUCs of LASSO and elastic-net were both 0.63 (95% CI, 0.60-0.66) for surveillance failure and 0.66 (95% CI, 0.64-0.68) for surveillance benefit, the highest among well-calibrated models.

CONCLUSIONS

For predicting breast cancer surveillance mammography outcomes, regularized regression outperformed other modeling approaches and balanced the trade-off between model flexibility and interpretability.

IMPACT

Regularized regression may be preferred for developing risk prediction models in other contexts with rare outcomes, similar training sample sizes, and low-dimensional features.

摘要

背景

机器学习(ML)方法通过使用高维预测因子和高阶交互作用来促进风险预测模型的开发,但代价是模型的可解释性和透明度降低。我们比较了统计和 ML 模型的相对预测性能,以指导具有乳腺癌个人史(PHBC)的女性进行监测乳房 X 线照片结果的建模策略选择。

方法

我们使用 1996 年至 2017 年期间乳腺癌监测联盟中 PHBC 女性的 1:4 匹配病例对照样本,对两种监测结果(失败[阴性监测乳房 X 线照片后 12 个月内发生乳腺癌]和获益[监测发现的乳腺癌])的七个风险预测模型进行了交叉验证。我们纳入了 9447 例乳房 X 线照片(495 例失败,1414 例获益和 7538 例无事件)。我们评估了常规回归、正则化回归(LASSO 和弹性网络)和 ML 方法(随机森林和梯度提升机)的模型性能,方法是评估其校准情况,并在具有良好校准的模型中比较接受者操作特征曲线(ROC)下的面积(AUC)和 95%置信区间(CI)。

结果

LASSO 和弹性网络一致地为监测失败和获益提供了校准良好的预测风险。LASSO 和弹性网络的 AUC 对于监测失败均为 0.63(95%CI,0.60-0.66),对于监测获益为 0.66(95%CI,0.64-0.68),在具有良好校准的模型中均为最高。

结论

对于预测乳腺癌监测乳房 X 线照片结果,正则化回归优于其他建模方法,并在模型灵活性和可解释性之间取得了平衡。

影响

在具有罕见结局、相似训练样本量和低维特征的其他情况下,正则化回归可能更适合开发风险预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/dd4d8321603a/nihms-1870560-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/b01f588e76f7/nihms-1870560-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/92f16200b743/nihms-1870560-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/8bafdd55fc3d/nihms-1870560-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/b6315198bc4b/nihms-1870560-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/dd4d8321603a/nihms-1870560-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/b01f588e76f7/nihms-1870560-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/92f16200b743/nihms-1870560-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/8bafdd55fc3d/nihms-1870560-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/b6315198bc4b/nihms-1870560-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b77f/10073265/dd4d8321603a/nihms-1870560-f0005.jpg

相似文献

1
Performance of Statistical and Machine Learning Risk Prediction Models for Surveillance Benefits and Failures in Breast Cancer Survivors.统计和机器学习风险预测模型在乳腺癌幸存者监测获益和失败中的性能。
Cancer Epidemiol Biomarkers Prev. 2023 Apr 3;32(4):561-571. doi: 10.1158/1055-9965.EPI-22-0677.
2
Predicting five-year interval second breast cancer risk in women with prior breast cancer.预测有既往乳腺癌史的女性五年间间隔性二次乳腺癌风险。
J Natl Cancer Inst. 2024 Jun 7;116(6):929-937. doi: 10.1093/jnci/djae063.
3
Breast Biopsy Intensity and Findings Following Breast Cancer Screening in Women With and Without a Personal History of Breast Cancer.乳腺癌筛查中有无乳腺癌个人史女性的乳腺活检强度和结果。
JAMA Intern Med. 2018 Apr 1;178(4):458-468. doi: 10.1001/jamainternmed.2017.8549.
4
Deep learning modeling using normal mammograms for predicting breast cancer risk.使用正常乳房 X 光片进行深度学习建模以预测乳腺癌风险。
Med Phys. 2020 Jan;47(1):110-118. doi: 10.1002/mp.13886. Epub 2019 Nov 19.
5
Identifying predictive factors for neuropathic pain after breast cancer surgery using machine learning.利用机器学习识别乳腺癌手术后发生神经性疼痛的预测因素。
Int J Med Inform. 2020 Sep;141:104170. doi: 10.1016/j.ijmedinf.2020.104170. Epub 2020 Jun 13.
6
Performance of Breast Cancer Risk-Assessment Models in a Large Mammography Cohort.乳腺癌风险评估模型在大型乳腺 X 线摄影队列中的表现。
J Natl Cancer Inst. 2020 May 1;112(5):489-497. doi: 10.1093/jnci/djz177.
7
Patterns of Breast Imaging Use Among Women with a Personal History of Breast Cancer.有乳腺癌个人病史的女性的乳房成像使用模式。
J Gen Intern Med. 2019 Oct;34(10):2098-2106. doi: 10.1007/s11606-019-05181-6. Epub 2019 Aug 13.
8
Surveillance Mammography Behaviors in Black and White Breast Cancer Survivors: Behavioral Risk Factors and Surveillance System, 2016.黑人及白人乳腺癌幸存者的乳房 X 线筛查行为:行为风险因素与监测系统,2016 年。
BMC Womens Health. 2023 Mar 30;23(1):148. doi: 10.1186/s12905-023-02246-x.
9
A two-stage modeling approach for breast cancer survivability prediction.两阶段建模方法用于乳腺癌生存预测。
Int J Med Inform. 2021 May;149:104438. doi: 10.1016/j.ijmedinf.2021.104438. Epub 2021 Mar 11.
10
A Mixed Method Approach to Examine Surveillance Mammography Experiences in Black and White Breast Cancer Survivors.一种混合方法来考察黑人和白人乳腺癌幸存者的乳房 X 光筛查体验。
Clin Breast Cancer. 2022 Dec;22(8):801-811. doi: 10.1016/j.clbc.2022.08.009. Epub 2022 Aug 27.

引用本文的文献

1
Development and validation of prediction models for sentinel lymph node status indicating postmastectomy radiotherapy in breast cancer: population-based study.乳腺癌乳房切除术后放疗前哨淋巴结状态预测模型的开发与验证:基于人群的研究
BJS Open. 2025 Mar 4;9(2). doi: 10.1093/bjsopen/zraf047.
2
Predicting five-year interval second breast cancer risk in women with prior breast cancer.预测有既往乳腺癌史的女性五年间间隔性二次乳腺癌风险。
J Natl Cancer Inst. 2024 Jun 7;116(6):929-937. doi: 10.1093/jnci/djae063.
3
Sources of Disparities in Surveillance Mammography Performance and Risk-Guided Recommendations for Supplemental Breast Imaging: A Simulation Study.

本文引用的文献

1
Digital Mammography and Breast Tomosynthesis Performance in Women with a Personal History of Breast Cancer, 2007-2016.2007-2016 年有乳腺癌个人病史女性的数字乳腺 X 线摄影与乳腺断层合成性能
Radiology. 2021 Aug;300(2):290-300. doi: 10.1148/radiol.2021204581. Epub 2021 May 18.
2
Essentialism and Exclusion: Racism in Cancer Risk Prediction Models.本质主义与排斥:癌症风险预测模型中的种族主义。
J Natl Cancer Inst. 2021 Nov 29;113(12):1620-1624. doi: 10.1093/jnci/djab074.
3
Embracing Genetic Diversity to Improve Black Health.拥抱基因多样性以改善黑人健康。
监测乳房 X 光摄影术表现差异的来源,以及补充乳房成像的风险导向建议:一项模拟研究。
Cancer Epidemiol Biomarkers Prev. 2023 Nov 1;32(11):1531-1541. doi: 10.1158/1055-9965.EPI-23-0330.
N Engl J Med. 2021 Mar 25;384(12):1163-1167. doi: 10.1056/NEJMms2031080. Epub 2021 Feb 10.
4
Factors to Consider in Developing Breast Cancer Risk Models to Implement into Clinical Care.在开发用于临床护理的乳腺癌风险模型时需考虑的因素。
Curr Epidemiol Rep. 2020 Jun;7(2):113-116. doi: 10.1007/s40471-020-00230-9. Epub 2020 Apr 29.
5
Multiple Myeloma, Version 3.2021, NCCN Clinical Practice Guidelines in Oncology.多发性骨髓瘤,第 3.2021 版,NCCN 肿瘤学临床实践指南。
J Natl Compr Canc Netw. 2020 Dec 2;18(12):1685-1717. doi: 10.6004/jnccn.2020.0057.
6
Racial/Ethnic Disparities in All-Cause Mortality among Patients Diagnosed with Triple-Negative Breast Cancer.三阴性乳腺癌患者全因死亡率的种族/民族差异。
Cancer Res. 2021 Feb 15;81(4):1163-1170. doi: 10.1158/0008-5472.CAN-20-3094. Epub 2020 Dec 3.
7
Hidden in Plain Sight - Reconsidering the Use of Race Correction in Clinical Algorithms.隐匿于众目睽睽之下——重新审视临床算法中种族校正的应用
N Engl J Med. 2020 Aug 27;383(9):874-882. doi: 10.1056/NEJMms2004740. Epub 2020 Jun 17.
8
Predictably unequal: understanding and addressing concerns that algorithmic clinical prediction may increase health disparities.可预见的不平等:理解并解决有关算法临床预测可能加剧健康差异的担忧。
NPJ Digit Med. 2020 Jul 30;3:99. doi: 10.1038/s41746-020-0304-9. eCollection 2020.
9
Breast Cancer Population Attributable Risk Proportions Associated with Body Mass Index and Breast Density by Race/Ethnicity and Menopausal Status.按种族/族裔和绝经状态划分的与体质指数和乳腺密度相关的乳腺癌人群归因风险比例。
Cancer Epidemiol Biomarkers Prev. 2020 Oct;29(10):2048-2056. doi: 10.1158/1055-9965.EPI-20-0358. Epub 2020 Jul 29.
10
Machine learning-based lifetime breast cancer risk reclassification compared with the BOADICEA model: impact on screening recommendations.基于机器学习的终生乳腺癌风险重新分类与 BOADICEA 模型比较:对筛查建议的影响。
Br J Cancer. 2020 Sep;123(5):860-867. doi: 10.1038/s41416-020-0937-0. Epub 2020 Jun 22.