• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测英格兰普通女性人群 10 年乳腺癌死亡率:模型开发和验证研究。

Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study.

机构信息

Cancer Research UK Oxford Centre, University of Oxford, UK; Nuffield Department of Primary Care Health Sciences, University of Oxford, UK.

Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, UK.

出版信息

Lancet Digit Health. 2023 Sep;5(9):e571-e581. doi: 10.1016/S2589-7500(23)00113-9.

DOI:10.1016/S2589-7500(23)00113-9
PMID:37625895
Abstract

BACKGROUND

Identifying female individuals at highest risk of developing life-threatening breast cancers could inform novel stratified early detection and prevention strategies to reduce breast cancer mortality, rather than only considering cancer incidence. We aimed to develop a prognostic model that accurately predicts the 10-year risk of breast cancer mortality in female individuals without breast cancer at baseline.

METHODS

In this model development and validation study, we used an open cohort study from the QResearch primary care database, which was linked to secondary care and national cancer and mortality registers in England, UK. The data extracted were from female individuals aged 20-90 years without previous breast cancer or ductal carcinoma in situ who entered the cohort between Jan 1, 2000, and Dec 31, 2020. The primary outcome was breast cancer-related death, which was assessed in the full dataset. Cox proportional hazards, competing risks regression, XGBoost, and neural network modelling approaches were used to predict the risk of breast cancer death within 10 years using routinely collected health-care data. Death due to causes other than breast cancer was the competing risk. Internal-external validation was used to evaluate prognostic model performance (using Harrell's C, calibration slope, and calibration in the large), performance heterogeneity, and transportability. Internal-external validation involved dataset partitioning by time period and geographical region. Decision curve analysis was used to assess clinical utility.

FINDINGS

We identified data for 11 626 969 female individuals, with 70 095 574 person-years of follow-up. There were 142 712 (1·2%) diagnoses of breast cancer, 24 043 (0·2%) breast cancer-related deaths, and 696 106 (6·0%) deaths from other causes. Meta-analysis pooled estimates of Harrell's C were highest for the competing risks model (0·932, 95% CI 0·917-0·946). The competing risks model was well calibrated overall (slope 1·011, 95% CI 0·978-1·044), and across different ethnic groups. Decision curve analysis suggested favourable clinical utility across all age groups. The XGBoost and neural network models had variable performance across age and ethnic groups.

INTERPRETATION

A model that predicts the combined risk of developing and then dying from breast cancer at the population level could inform stratified screening or chemoprevention strategies. Further evaluation of the competing risks model should comprise effect and health economic assessment of model-informed strategies.

FUNDING

Cancer Research UK.

摘要

背景

确定罹患危及生命的乳腺癌风险最高的女性个体,有助于制定新的分层早期检测和预防策略,以降低乳腺癌死亡率,而不仅仅考虑癌症发病率。我们旨在开发一种预后模型,该模型能够准确预测基线时无乳腺癌的女性个体在 10 年内的乳腺癌死亡风险。

方法

在这项模型开发和验证研究中,我们使用了来自英国 QResearch 初级保健数据库的开放队列研究,该数据库与二级保健以及国家癌症和死亡率登记处相关联。提取的数据来自年龄在 20-90 岁之间、无既往乳腺癌或导管原位癌且在 2000 年 1 月 1 日至 2020 年 12 月 31 日期间入组队列的女性个体。主要结局是乳腺癌相关死亡,在全数据集进行评估。使用常规收集的医疗保健数据,采用 Cox 比例风险、竞争风险回归、XGBoost 和神经网络建模方法预测 10 年内乳腺癌死亡的风险。乳腺癌以外的其他原因导致的死亡为竞争风险。内部-外部验证用于评估预后模型的性能(使用 Harrell 的 C、校准斜率和大样本校准)、性能异质性和可转移性。内部-外部验证包括按时间和地理区域划分数据集。决策曲线分析用于评估临床实用性。

结果

我们确定了 11626969 名女性个体的数据,随访时间为 70095574 人年。有 142712(1.2%)例乳腺癌诊断,24043(0.2%)例乳腺癌相关死亡,696106(6.0%)例其他原因死亡。荟萃分析中,竞争风险模型的 Harrell 的 C 估计值最高(0.932,95%CI 0.917-0.946)。总体而言,竞争风险模型校准良好(斜率 1.011,95%CI 0.978-1.044),并且在不同种族群体中也是如此。决策曲线分析表明,在所有年龄组中均具有良好的临床实用性。XGBoost 和神经网络模型在年龄和种族群体之间的性能存在差异。

解释

能够预测人群中罹患乳腺癌和死于乳腺癌的综合风险的模型,可以为分层筛查或化学预防策略提供信息。应进一步评估竞争风险模型,包括对模型指导策略的效果和健康经济学评估。

资助

英国癌症研究基金会。

相似文献

1
Predicting 10-year breast cancer mortality risk in the general female population in England: a model development and validation study.预测英格兰普通女性人群 10 年乳腺癌死亡率:模型开发和验证研究。
Lancet Digit Health. 2023 Sep;5(9):e571-e581. doi: 10.1016/S2589-7500(23)00113-9.
2
Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort study.统计和机器学习模型在乳腺癌预后预测中的开发和内外验证:队列研究。
BMJ. 2023 May 10;381:e073800. doi: 10.1136/bmj-2022-073800.
3
Predicting the future risk of lung cancer: development, and internal and external validation of the CanPredict (lung) model in 19·67 million people and evaluation of model performance against seven other risk prediction models.预测肺癌未来风险:CanPredict(肺部)模型在 1967 万人中的开发、内部和外部验证以及该模型与其他七个风险预测模型的性能评估。
Lancet Respir Med. 2023 Aug;11(8):685-697. doi: 10.1016/S2213-2600(23)00050-4. Epub 2023 Apr 5.
4
Development and validation of clinical prediction models for breast cancer incidence and mortality: a protocol for a dual cohort study.中文译文:用于乳腺癌发病率和死亡率的临床预测模型的开发和验证:一项双队列研究方案。
BMJ Open. 2022 Mar 28;12(3):e050828. doi: 10.1136/bmjopen-2021-050828.
5
Development and validation of prediction models for fetal growth restriction and birthweight: an individual participant data meta-analysis.胎儿生长受限和出生体重预测模型的建立与验证:个体参与者数据的荟萃分析。
Health Technol Assess. 2024 Aug;28(47):1-119. doi: 10.3310/DABW4814.
6
Multi-cancer risk stratification based on national health data: a retrospective modelling and validation study.基于国家健康数据的多癌种风险分层:一项回顾性建模和验证研究。
Lancet Digit Health. 2024 Jun;6(6):e396-e406. doi: 10.1016/S2589-7500(24)00062-1.
7
Development and validation of risk prediction equations to estimate survival in patients with colorectal cancer: cohort study.用于估计结直肠癌患者生存率的风险预测方程的开发与验证:队列研究
BMJ. 2017 Jun 15;357:j2497. doi: 10.1136/bmj.j2497.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Cancer incidence, treatment, and survival in the prison population compared with the general population in England: a population-based, matched cohort study.与英格兰普通人群相比,监狱人群中的癌症发病率、治疗及生存率:一项基于人群的匹配队列研究。
Lancet Oncol. 2024 May;25(5):553-562. doi: 10.1016/S1470-2045(24)00035-4.
10
Mortality impact, risks, and benefits of general population screening for ovarian cancer: the UKCTOCS randomised controlled trial.普通人群卵巢癌筛查的死亡率影响、风险及益处:英国卵巢癌筛查协作试验(UKCTOCS)随机对照试验
Health Technol Assess. 2025 May;29(10):1-93. doi: 10.3310/BHBR5832.

引用本文的文献

1
Machine learning model for predicting recurrence following intensity-modulated radiation therapy in nasopharyngeal carcinoma.用于预测鼻咽癌调强放射治疗后复发的机器学习模型。
World J Surg Oncol. 2025 Jun 18;23(1):237. doi: 10.1186/s12957-025-03860-9.
2
Development and validation of nomograms including individual- and area-level variables to predict risk of fatal and non-fatal cardiovascular diseases among Russian population.包含个体和地区层面变量的列线图的开发与验证,用于预测俄罗斯人群中致命和非致命心血管疾病的风险。
PLoS One. 2025 Jun 2;20(5):e0324736. doi: 10.1371/journal.pone.0324736. eCollection 2025.
3
Predicting the risk of ibrutinib in combination with R-ICE in patients with relapsed or refractory DLBCL using explainable machine learning algorithms.
使用可解释的机器学习算法预测依鲁替尼联合R-ICE方案治疗复发或难治性弥漫性大B细胞淋巴瘤(DLBCL)患者的风险。
Clin Exp Med. 2025 May 26;25(1):177. doi: 10.1007/s10238-025-01709-9.
4
Personalized screening based on risk and density: prevalence data from the RIBBS study.基于风险和密度的个性化筛查:RIBBS研究的患病率数据。
Radiol Med. 2025 Mar 21. doi: 10.1007/s11547-025-01981-5.
5
Predicting In-Hospital Fall Risk Using Machine Learning With Real-Time Location System and Electronic Medical Records.利用机器学习结合实时定位系统和电子病历预测住院期间跌倒风险
J Cachexia Sarcopenia Muscle. 2025 Feb;16(1):e13713. doi: 10.1002/jcsm.13713.
6
Machine learning-based prognostic modeling and surgical value analysis of de novo metastatic invasive ductal carcinoma of the breast.基于机器学习的乳腺原发性转移性浸润性导管癌预后建模及手术价值分析
Updates Surg. 2025 Jan 15. doi: 10.1007/s13304-025-02066-8.
7
A hybrid machine learning approach for the personalized prognostication of aggressive skin cancers.一种用于侵袭性皮肤癌个性化预后预测的混合机器学习方法。
NPJ Digit Med. 2025 Jan 8;8(1):15. doi: 10.1038/s41746-024-01329-9.
8
An Advanced Machine Learning Model for a Web-Based Artificial Intelligence-Based Clinical Decision Support System Application: Model Development and Validation Study.基于人工智能的临床决策支持系统的基于网络的人工智能临床决策支持系统应用的高级机器学习模型:模型开发和验证研究。
J Med Internet Res. 2024 Sep 4;26:e56022. doi: 10.2196/56022.
9
Development and validation of a cuproptosis-related prognostic model for acute myeloid leukemia patients using machine learning with stacking.基于机器学习堆叠技术的急性髓系白血病患者铜死亡相关预后模型的建立与验证。
Sci Rep. 2024 Feb 2;14(1):2802. doi: 10.1038/s41598-024-53306-7.