• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SMOTE增强的机器学习模型通过微生物组分析预测复发性和转移性乳腺癌。

SMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis.

作者信息

Hong Ji Eun, Kim Yeon Eun, Kang Yun Soo, Choi Dong Hyeok, Ahn So Hyun, An Jeongshin

机构信息

Department of Medical Science, Ewha Womans University College of Medicine, Seoul, Republic of Korea.

Ewha Womans University College of Medicine, Seoul, Republic of Korea.

出版信息

Sci Rep. 2025 Sep 26;15(1):33096. doi: 10.1038/s41598-025-16790-z.

DOI:10.1038/s41598-025-16790-z
PMID:41006422
Abstract

Recurrence and metastasis of breast cancer (RMBC) have a decisive impact on patient survival, necessitating reliable biomarkers for its early prediction. This study used machine learning to evaluate blood microbiome profiles as predictive biomarkers of RMBC. A retrospective predictive analysis was conducted on 288 participants, including 96 patients with breast cancer and 192 healthy controls. After 7 years of follow-up, patients were classified into disease-free survival (DFS, n = 88) and RMBC (n = 8) groups. Blood microbiome composition was analysed using 16S rRNA sequencing, followed by quality control. The Synthetic Minority Oversampling Technique (SMOTE) was employed to address class imbalance. Eleven machine learning models were trained using leave-one-out cross-validation (LOOCV) and k-fold cross-validation, and evaluated based on the area under the receiver operating characteristic curve (AUROC), recall, precision, F1-score, and Matthews correlation coefficient (MCC). Alpha diversity was significantly lower in DFS and RMBC groups than in the healthy control group (p < 0.05), while beta diversity analysis revealed distinct clustering. The random forest achieved an AUROC of 0.94, a recall of 0.81, an F1-score of 0.83, and an MCC of 0.88. Enterobacter, Bacteroides, Klebsiella, and Bifidobacterium were among the key microbial genera predicting RMBC in the top five models. Blood microbiome profiling shows potential as a non-invasive RMBC biomarker. Machine learning effectively distinguished RMBC, warranting further validation.

摘要

乳腺癌的复发和转移对患者生存有着决定性影响,因此需要可靠的生物标志物用于早期预测。本研究利用机器学习评估血液微生物组谱作为乳腺癌复发和转移的预测生物标志物。对288名参与者进行了回顾性预测分析,包括96例乳腺癌患者和192名健康对照。经过7年随访,患者被分为无病生存(DFS,n = 88)组和乳腺癌复发和转移(RMBC,n = 8)组。使用16S rRNA测序分析血液微生物组组成,随后进行质量控制。采用合成少数过采样技术(SMOTE)解决类别不平衡问题。使用留一法交叉验证(LOOCV)和k折交叉验证训练了11种机器学习模型,并根据受试者工作特征曲线下面积(AUROC)、召回率、精确率、F1分数和马修斯相关系数(MCC)进行评估。DFS组和RMBC组的α多样性显著低于健康对照组(p < 0.05),而β多样性分析显示出明显的聚类。随机森林模型的AUROC为0.94,召回率为0.81,F1分数为0.83,MCC为0.88。在排名前五的模型中,肠杆菌属、拟杆菌属、克雷伯菌属和双歧杆菌属是预测RMBC的关键微生物属。血液微生物组谱显示出作为非侵入性RMBC生物标志物的潜力。机器学习有效地鉴别了RMBC,值得进一步验证。

相似文献

1
SMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis.SMOTE增强的机器学习模型通过微生物组分析预测复发性和转移性乳腺癌。
Sci Rep. 2025 Sep 26;15(1):33096. doi: 10.1038/s41598-025-16790-z.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Predicting Pathological Complete Response Following Neoadjuvant Therapy in Patients With Breast Cancer: Development of Machine Learning-Based Prediction Models in a Retrospective Study.预测乳腺癌患者新辅助治疗后的病理完全缓解:一项回顾性研究中基于机器学习的预测模型的开发
JMIR Cancer. 2025 Jul 18;11:e64685. doi: 10.2196/64685.
5
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
Vesicoureteral Reflux膀胱输尿管反流
8
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
9
Fecal Microbiome Reflects Disease State and Prognosis in Inflammatory Bowel Disease in an Adult Population-Based Inception Cohort.在一项基于成人人群的初始队列研究中,粪便微生物群反映炎症性肠病的疾病状态和预后。
Inflamm Bowel Dis. 2025 Apr 25. doi: 10.1093/ibd/izaf060.
10
Explainable machine learning model incorporating social determinants of health to predict chronic kidney disease in type 2 diabetes patients.纳入健康社会决定因素的可解释机器学习模型,用于预测2型糖尿病患者的慢性肾脏病
J Diabetes Metab Disord. 2025 May 9;24(1):115. doi: 10.1007/s40200-025-01621-9. eCollection 2025 Jun.

本文引用的文献

1
Microbiome-Induced Microenvironmental Changes Before and After Breast Cancer Treatment.乳腺癌治疗前后微生物群诱导的微环境变化
Microorganisms. 2025 May 1;13(5):1057. doi: 10.3390/microorganisms13051057.
2
Gut microbio-me and pancreatic cancer.肠道微生物组与胰腺癌。
Klin Onkol. 2024;37(1):20-26. doi: 10.48095/ccko202420.
3
Tumor biomarkers for diagnosis, prognosis and targeted therapy.肿瘤标志物用于诊断、预后和靶向治疗。
Signal Transduct Target Ther. 2024 May 20;9(1):132. doi: 10.1038/s41392-024-01823-2.
4
Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.2022 年全球癌症统计数据:全球 185 个国家和地区 36 种癌症的发病率和死亡率全球估计数。
CA Cancer J Clin. 2024 May-Jun;74(3):229-263. doi: 10.3322/caac.21834. Epub 2024 Apr 4.
5
Microbiota enterotoxigenic Bacteroides fragilis-secreted BFT-1 promotes breast cancer cell stemness and chemoresistance through its functional receptor NOD1.产肠毒素脆弱拟杆菌菌毛蛋白 BFT-1 通过其功能受体 NOD1 促进乳腺癌细胞干性和化疗耐药性
Protein Cell. 2024 May 28;15(6):419-440. doi: 10.1093/procel/pwae005.
6
Urinary Microbiome Dysbiosis and Immune Dysregulations as Potential Diagnostic Indicators of Bladder Cancer.尿微生物群失调和免疫失调作为膀胱癌的潜在诊断指标
Cancers (Basel). 2024 Jan 17;16(2):394. doi: 10.3390/cancers16020394.
7
Costs of breast cancer recurrence after initial treatment for HR+, HER2-, high-risk early breast cancer: estimates from SEER-Medicare linked data.初始治疗后 HR+、HER2-、高危早期乳腺癌复发的成本:来自 SEER-医疗保险链接数据的估计。
J Med Econ. 2024 Jan-Dec;27(1):84-96. doi: 10.1080/13696998.2023.2291266. Epub 2023 Dec 19.
8
Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。
BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.
9
Data pre-processing for analyzing microbiome data - A mini review.用于分析微生物组数据的数据预处理——一篇综述短文
Comput Struct Biotechnol J. 2023 Oct 4;21:4804-4815. doi: 10.1016/j.csbj.2023.10.001. eCollection 2023.
10
Machine learning in metastatic cancer research: Potentials, possibilities, and prospects.转移性癌症研究中的机器学习:潜力、可能性与前景。
Comput Struct Biotechnol J. 2023 Mar 29;21:2454-2470. doi: 10.1016/j.csbj.2023.03.046. eCollection 2023.