• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用随机森林分类器对非洲女性的乳腺癌风险进行预测。

Breast cancer risk prediction in African women using Random Forest Classifier.

机构信息

Department of Computer Science, Lagos State University, Nigeria.

Department of Computer Science, Lagos State University, Nigeria.

出版信息

Cancer Treat Res Commun. 2021;28:100396. doi: 10.1016/j.ctarc.2021.100396. Epub 2021 May 15.

DOI:10.1016/j.ctarc.2021.100396
PMID:34049004
Abstract

INTRODUCTION

One of the most important steps in combating breast cancer is early and accurate diagnosis. Unfortunately, breast cancer is asymptomatic at the early stage, although some symptoms are presented at a later time, but at symptomatic stage treatment could be complicated or even become impossible thereby leading to death. Proper risk assessment is hence very important in reducing mortality. Some computational techniques have been developed for breast cancer risk assessment in the developed world, but such techniques do not work well in Africa because of the difference in risk profiles of African women e.g. later menarche, low drug abuse and low smoking rate.

AIM

In this work, we propose a bespoke risk prediction model for African women using Random Forest Classifier (RFC) machine learning technique.

METHODS

A total of 180 subjects were studied out of which 90 were confirmed cases of breast cancer and 90 were benign. Twenty-five risk factors were included, for example, smoking, alcohol intake, occupational hazards and age at menopause. Four approaches were empirically used in the feature selection, these are the use of Chi-Square, mutual information gain, Spearman correlation and the entire features. RFC algorithm was used to develop the prediction model.

RESULTS

We found that family history of breast cancer, dense breast, deliberate abortion, age at first child, fruit intake and regular exercise are predictors of breast cancer. The RFC model gave an accuracy of 91.67%, sensitivity of 87.10%, specificity of 96.55% and Area under curve (AUC) of 92% when all the risk factors were included in the model while an accuracy of 96.67%, sensitivity of 93.75%, specificity of 100% and AUC of 97% were obtained when correlation-selected features were included in the model. The Chi-Square selected features gave the best performance with 98.33% accuracy, 100% sensitivity, 96.55 specificity and 98% AUC. Mutual information gain selected feature gave the same results as Chi-Square selected features.

CONCLUSION

Random Forest Classifier has a good potential at predicting the risk of breast cancer in African women. The study helped to identify the risk factors of breast cancer in African women. This is a valuable information which can help African women to pay attention to those risk factors with the intention of reducing the incidence of breast cancer in Africa.

摘要

引言

乳腺癌防治最重要的措施之一是早期、准确诊断。遗憾的是,乳腺癌在早期阶段无症状,尽管一些症状在后期出现,但在出现症状时,治疗可能会变得复杂,甚至无法进行,从而导致死亡。因此,适当的风险评估对于降低死亡率非常重要。一些计算技术已经被开发出来用于发达国家的乳腺癌风险评估,但由于非洲女性的风险特征不同,例如月经初潮较晚、药物滥用和吸烟率较低等,这些技术在非洲效果不佳。

目的

本研究采用随机森林分类器(Random Forest Classifier,RFC)机器学习技术,为非洲女性建立一种定制的风险预测模型。

方法

共纳入 180 名受试者,其中 90 名为乳腺癌确诊病例,90 名为良性病例。纳入了 25 个风险因素,例如吸烟、饮酒、职业危害和绝经年龄等。我们采用了卡方检验、互信息增益、斯皮尔曼相关系数和全部特征四种方法进行特征选择,使用 RFC 算法建立预测模型。

结果

我们发现乳腺癌家族史、致密乳腺、故意流产、初产年龄、水果摄入和规律运动是乳腺癌的预测因素。当模型中包含所有风险因素时,RFC 模型的准确性为 91.67%,灵敏度为 87.10%,特异性为 96.55%,曲线下面积(area under the curve,AUC)为 92%;当模型中包含相关选择特征时,准确性为 96.67%,灵敏度为 93.75%,特异性为 100%,AUC 为 97%。卡方检验选择特征的表现最佳,准确性为 98.33%,灵敏度为 100%,特异性为 96.55%,AUC 为 98%。互信息增益选择特征的结果与卡方检验选择特征相同。

结论

随机森林分类器在预测非洲女性乳腺癌风险方面具有良好的潜力。本研究有助于确定非洲女性乳腺癌的风险因素。这是一个有价值的信息,可以帮助非洲女性关注这些风险因素,以减少非洲乳腺癌的发病率。

相似文献

1
Breast cancer risk prediction in African women using Random Forest Classifier.使用随机森林分类器对非洲女性的乳腺癌风险进行预测。
Cancer Treat Res Commun. 2021;28:100396. doi: 10.1016/j.ctarc.2021.100396. Epub 2021 May 15.
2
Evaluation of Feature Selection Techniques for Breast Cancer Risk Prediction.乳腺癌风险预测特征选择技术评估。
Int J Environ Res Public Health. 2021 Oct 12;18(20):10670. doi: 10.3390/ijerph182010670.
3
Machine learning models in breast cancer survival prediction.用于乳腺癌生存预测的机器学习模型。
Technol Health Care. 2016;24(1):31-42. doi: 10.3233/THC-151071.
4
A comparative study on feature selection for a risk prediction model for colorectal cancer.用于结直肠癌风险预测模型的特征选择的比较研究。
Comput Methods Programs Biomed. 2019 Aug;177:219-229. doi: 10.1016/j.cmpb.2019.06.001. Epub 2019 Jun 4.
5
Global parenchymal texture features based on histograms of oriented gradients improve cancer development risk estimation from healthy breasts.基于方向梯度直方图的全局实质纹理特征可提高对健康乳房癌症发展风险的评估。
Comput Methods Programs Biomed. 2019 Aug;177:123-132. doi: 10.1016/j.cmpb.2019.05.022. Epub 2019 May 22.
6
Predicting Breast Cancer in Chinese Women Using Machine Learning Techniques: Algorithm Development.运用机器学习技术预测中国女性乳腺癌:算法开发
JMIR Med Inform. 2020 Jun 8;8(6):e17364. doi: 10.2196/17364.
7
Impact of Machine Learning With Multiparametric Magnetic Resonance Imaging of the Breast for Early Prediction of Response to Neoadjuvant Chemotherapy and Survival Outcomes in Breast Cancer Patients.机器学习联合乳腺多参数磁共振成像对乳腺癌新辅助化疗早期疗效及生存预后评估的影响。
Invest Radiol. 2019 Feb;54(2):110-117. doi: 10.1097/RLI.0000000000000518.
8
Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only.仅基于流行病学数据对韩国女性乳腺癌的计算判别
J Korean Med Sci. 2015 Aug;30(8):1025-34. doi: 10.3346/jkms.2015.30.8.1025. Epub 2015 Jul 15.
9
Breast cancer prediction using different machine learning methods applying multi factors.应用多因素的不同机器学习方法进行乳腺癌预测。
J Cancer Res Clin Oncol. 2023 Dec;149(19):17133-17146. doi: 10.1007/s00432-023-05388-5. Epub 2023 Sep 29.
10
Prediction of breast cancer risk using a machine learning approach embedded with a locality preserving projection algorithm.基于局部保持投影算法的机器学习方法预测乳腺癌风险。
Phys Med Biol. 2018 Jan 30;63(3):035020. doi: 10.1088/1361-6560/aaa1ca.

引用本文的文献

1
Development and performance of female breast cancer incidence risk prediction models: a systematic review and meta-analysis.女性乳腺癌发病风险预测模型的开发与性能:一项系统评价和荟萃分析。
Ann Med. 2025 Dec;57(1):2534522. doi: 10.1080/07853890.2025.2534522. Epub 2025 Jul 20.
2
Brain tumor classification utilizing pixel distribution and spatial dependencies higher-order statistical measurements through explainable ML models.利用可解释 ML 模型进行像素分布和空间相关性高阶统计测量的脑肿瘤分类。
Sci Rep. 2024 Oct 28;14(1):25800. doi: 10.1038/s41598-024-74731-8.
3
Utilizing radiomics and dosiomics with AI for precision prediction of radiation dermatitis in breast cancer patients.
利用放射组学和剂量组学与人工智能对乳腺癌患者放射性皮炎进行精准预测。
BMC Cancer. 2024 Aug 6;24(1):965. doi: 10.1186/s12885-024-12753-1.
4
Prediction of Neoadjuvant Chemoradiotherapy Response in Rectal Cancer Patients Using Harmonized Radiomics of Multcenter F-FDG-PET Image.利用多中心F-FDG-PET图像的统一放射组学预测直肠癌患者新辅助放化疗反应
Cancers (Basel). 2023 Nov 30;15(23):5662. doi: 10.3390/cancers15235662.
5
Disulfidptosis-associated lncRNAs predict breast cancer subtypes.二硫键相关长非编码 RNA 预测乳腺癌亚型。
Sci Rep. 2023 Sep 27;13(1):16268. doi: 10.1038/s41598-023-43414-1.
6
Comparative study of convolutional neural network architectures for gastrointestinal lesions classification.卷积神经网络架构在胃肠道病变分类中的比较研究。
PeerJ. 2023 Mar 16;11:e14806. doi: 10.7717/peerj.14806. eCollection 2023.
7
Construction of machine learning-based models for cancer outcomes in low and lower-middle income countries: A scoping review.低收入和中低收入国家癌症预后的机器学习模型构建:一项范围综述。
Front Oncol. 2022 Dec 1;12:976168. doi: 10.3389/fonc.2022.976168. eCollection 2022.
8
A hybrid breast cancer classification algorithm based on meta-learning and artificial neural networks.一种基于元学习和人工神经网络的混合乳腺癌分类算法。
Front Oncol. 2022 Nov 11;12:1042964. doi: 10.3389/fonc.2022.1042964. eCollection 2022.
9
Artificial intelligence-based prediction for cancer-related outcomes in Africa: Status and potential refinements.基于人工智能对非洲癌症相关结局的预测:现状与潜在改进
J Glob Health. 2022 Apr 23;12:03017. doi: 10.7189/jogh.12.03017. eCollection 2022.