• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习技术解锁完整的血细胞计数作为乳腺癌风险分层工具:一项大规模回顾性研究。

Unlocking the complete blood count as a risk stratification tool for breast cancer using machine learning: a large scale retrospective study.

机构信息

Huna, São Paulo, Brazil.

Departamento de Ciências da Computação, Instituto de Ciências Exatas, Universidade Federal de Minas Gerais/UFMG, Campus Belo Horizonte, Minas Gerais, Brazil.

出版信息

Sci Rep. 2024 May 12;14(1):10841. doi: 10.1038/s41598-024-61215-y.

DOI:10.1038/s41598-024-61215-y
PMID:38736010
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11089041/
Abstract

Optimizing early breast cancer (BC) detection requires effective risk assessment tools. This retrospective study from Brazil showcases the efficacy of machine learning in discerning complex patterns within routine blood tests, presenting a globally accessible and cost-effective approach for risk evaluation. We analyzed complete blood count (CBC) tests from 396,848 women aged 40-70, who underwent breast imaging or biopsies within six months after their CBC test. Of these, 2861 (0.72%) were identified as cases: 1882 with BC confirmed by anatomopathological tests, and 979 with highly suspicious imaging (BI-RADS 5). The remaining 393,987 participants (99.28%), with BI-RADS 1 or 2 results, were classified as controls. The database was divided into modeling (including training and validation) and testing sets based on diagnostic certainty. The testing set comprised cases confirmed by anatomopathology and controls cancer-free for 4.5-6.5 years post-CBC. Our ridge regression model, incorporating neutrophil-lymphocyte ratio, red blood cells, and age, achieved an AUC of 0.64 (95% CI 0.64-0.65). We also demonstrate that these results are slightly better than those from a boosting machine learning model, LightGBM, plus having the benefit of being fully interpretable. Using the probabilistic output from this model, we divided the study population into four risk groups: high, moderate, average, and low risk, which obtained relative ratios of BC of 1.99, 1.32, 1.02, and 0.42, respectively. The aim of this stratification was to streamline prioritization, potentially improving the early detection of breast cancer, particularly in resource-limited environments. As a risk stratification tool, this model offers the potential for personalized breast cancer screening by prioritizing women based on their individual risk, thereby indicating a shift from a broad population strategy.

摘要

优化早期乳腺癌(BC)检测需要有效的风险评估工具。这项来自巴西的回顾性研究展示了机器学习在辨别常规血液测试中的复杂模式方面的功效,提供了一种全球可及且具有成本效益的风险评估方法。我们分析了 396848 名 40-70 岁女性的完整血液计数(CBC)测试,这些女性在 CBC 测试后六个月内接受了乳房成像或活检。其中,2861 例(0.72%)被确定为病例:1882 例经解剖病理学检查证实为 BC,979 例影像学高度可疑(BI-RADS 5)。其余 393987 名参与者(99.28%),BI-RADS 1 或 2 结果,被归类为对照组。根据诊断确定性,数据库分为建模(包括训练和验证)和测试集。测试集由解剖病理学证实的病例和 CBC 后 4.5-6.5 年无癌症的对照组组成。我们的岭回归模型结合中性粒细胞-淋巴细胞比、红细胞和年龄,AUC 为 0.64(95%CI 0.64-0.65)。我们还证明,这些结果略优于 LightGBM 增强机器学习模型的结果,并且具有完全可解释的优点。使用该模型的概率输出,我们将研究人群分为四个风险组:高、中、平均和低风险,分别获得乳腺癌的相对比值为 1.99、1.32、1.02 和 0.42。这种分层的目的是简化优先级排序,有可能改善乳腺癌的早期检测,特别是在资源有限的环境中。作为一种风险分层工具,该模型通过根据女性的个体风险对其进行优先排序,为个性化乳腺癌筛查提供了可能性,从而表明从广泛的人群策略向个体策略转变。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/d95aaa3908a2/41598_2024_61215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/a41482126a41/41598_2024_61215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/16bfb71e34a4/41598_2024_61215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/2a71245b7344/41598_2024_61215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/d95aaa3908a2/41598_2024_61215_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/a41482126a41/41598_2024_61215_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/16bfb71e34a4/41598_2024_61215_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/2a71245b7344/41598_2024_61215_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e28a/11089041/d95aaa3908a2/41598_2024_61215_Fig4_HTML.jpg

相似文献

1
Unlocking the complete blood count as a risk stratification tool for breast cancer using machine learning: a large scale retrospective study.利用机器学习技术解锁完整的血细胞计数作为乳腺癌风险分层工具:一项大规模回顾性研究。
Sci Rep. 2024 May 12;14(1):10841. doi: 10.1038/s41598-024-61215-y.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Complete blood count as a biomarker for preeclampsia with severe features diagnosis: a machine learning approach.全血细胞计数作为重度子痫前期诊断的生物标志物:一种机器学习方法。
BMC Pregnancy Childbirth. 2024 Oct 1;24(1):628. doi: 10.1186/s12884-024-06821-4.
4
Prospective validation of the NCI Breast Cancer Risk Assessment Tool (Gail Model) on 40,000 Australian women.40000 名澳大利亚女性的 NCI 乳腺癌风险评估工具( Gail 模型)前瞻性验证。
Breast Cancer Res. 2018 Dec 20;20(1):155. doi: 10.1186/s13058-018-1084-x.
5
Pioneering predictions of AKI and AKIN severity in burn patients: a comprehensive CBC approach.开创性预测烧伤患者 AKI 和 AKIN 严重程度:综合 CBC 方法。
Sci Rep. 2024 Jan 5;14(1):675. doi: 10.1038/s41598-024-51253-x.
6
Association of the Interaction Between Mammographic Breast Density, Body Mass Index, and Menopausal Status With Breast Cancer Risk Among Korean Women.韩国女性乳腺钼靶密度、体重指数和绝经状态之间的相互作用与乳腺癌风险的关联
JAMA Netw Open. 2021 Dec 1;4(12):e2139161. doi: 10.1001/jamanetworkopen.2021.39161.
7
Classification of mammographic breast density and its correlation with BI-RADS in elder women using machine learning approach.基于机器学习方法的老年女性乳腺 X 线摄影密度分类及其与 BI-RADS 的相关性研究。
J Med Imaging Radiat Sci. 2022 Mar;53(1):28-34. doi: 10.1016/j.jmir.2021.10.004. Epub 2021 Nov 17.
8
Predicting breast cancer risk using personal health data and machine learning models.利用个人健康数据和机器学习模型预测乳腺癌风险。
PLoS One. 2019 Dec 27;14(12):e0226765. doi: 10.1371/journal.pone.0226765. eCollection 2019.
9
Micronucleus Test for Diagnosing Uncertain Cases (BI-RADS 3) in Breast Cancer Screening: A Review and Preliminary Results.用于诊断乳腺癌筛查不确定病例(BI-RADS 3)的微核试验:综述与初步结果
Gulf J Oncolog. 2020 Jan;1(32):45-50.
10
Role of Clinical and Imaging Risk Factors in Predicting Breast Cancer Diagnosis Among BI-RADS 4 Cases.临床和影像学风险因素在 BI-RADS 4 类病例中预测乳腺癌诊断的作用。
Clin Breast Cancer. 2019 Feb;19(1):e142-e151. doi: 10.1016/j.clbc.2018.08.008. Epub 2018 Sep 5.

引用本文的文献

1
Preoperative systemic inflammatory biomarkers can improve recurrence prediction of non-muscle invasive bladder cancer after endoscopic resection - a prospective observational study.术前全身炎症生物标志物可改善内镜切除术后非肌层浸润性膀胱癌的复发预测——一项前瞻性观察研究。
Contemp Oncol (Pozn). 2025;29(2):188-194. doi: 10.5114/wo.2025.151064. Epub 2025 May 13.
2
Cancer screening: recent developments and future directions.癌症筛查:最新进展与未来方向。
Sci Rep. 2025 Mar 19;15(1):9436. doi: 10.1038/s41598-025-89820-5.

本文引用的文献

1
Breast Cancer: Risk Assessment, Screening, and Primary Prevention.乳腺癌:风险评估、筛查和一级预防。
Med Clin North Am. 2023 Mar;107(2):271-284. doi: 10.1016/j.mcna.2022.10.007.
2
Association between Oxidative Stress Parameters and Hematological Indices in Breast Cancer Patients.乳腺癌患者氧化应激参数与血液学指标之间的关联
Int J Breast Cancer. 2022 Oct 3;2022:1459410. doi: 10.1155/2022/1459410. eCollection 2022.
3
Prediction of SARS-CoV-2-positivity from million-scale complete blood counts using machine learning.使用机器学习从百万级全血细胞计数预测新冠病毒2型阳性
Commun Med (Lond). 2022 Jun 15;2:72. doi: 10.1038/s43856-022-00129-0. eCollection 2022.
4
Early identification of ICU patients at risk of complications: Regularization based on robustness and stability of explanations.早期识别 ICU 患者并发症风险:基于稳健性和稳定性解释的正则化。
Artif Intell Med. 2022 Jun;128:102283. doi: 10.1016/j.artmed.2022.102283. Epub 2022 Mar 22.
5
Multi-Institutional Validation of a Mammography-Based Breast Cancer Risk Model.基于乳腺 X 线摄影的乳腺癌风险模型的多机构验证。
J Clin Oncol. 2022 Jun 1;40(16):1732-1740. doi: 10.1200/JCO.21.01337. Epub 2021 Nov 12.
6
The current status of risk-stratified breast screening.风险分层乳腺癌筛查的现状。
Br J Cancer. 2022 Mar;126(4):533-550. doi: 10.1038/s41416-021-01550-3. Epub 2021 Oct 26.
7
Systemic inflammation markers and cancer incidence in the UK Biobank.英国生物库中的系统性炎症标志物与癌症发病率。
Eur J Epidemiol. 2021 Aug;36(8):841-848. doi: 10.1007/s10654-021-00752-6. Epub 2021 May 25.
8
The Clinical Application Value of RDW, CA153, and MPV in Breast Cancer.红细胞分布宽度(RDW)、糖类抗原153(CA153)和平均血小板体积(MPV)在乳腺癌中的临床应用价值
Clin Lab. 2021 Feb 1;67(2). doi: 10.7754/Clin.Lab.2020.200507.
9
Neutrophil to lymphocyte ratio and breast cancer risk: analysis by subtype and potential interactions.中性粒细胞与淋巴细胞比值与乳腺癌风险:基于亚型的分析及潜在的交互作用。
Sci Rep. 2020 Aug 6;10(1):13203. doi: 10.1038/s41598-020-70077-z.
10
Prognostic Value of Preoperative Systemic Immune-Inflammation Index in Breast Cancer: A Propensity Score-Matching Study.术前全身免疫炎症指数在乳腺癌中的预后价值:一项倾向评分匹配研究
Front Oncol. 2020 Apr 21;10:580. doi: 10.3389/fonc.2020.00580. eCollection 2020.