• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于决策树的集成学习在乳腺癌分类中的应用。

Application of decision tree-based ensemble learning in the classification of breast cancer.

作者信息

Ghiasi Mohammad M, Zendehboudi Sohrab

机构信息

Faculty of Engineering and Applied Science, Memorial University, St. John's, NL A1B 3X5, Canada.

出版信息

Comput Biol Med. 2021 Jan;128:104089. doi: 10.1016/j.compbiomed.2020.104089. Epub 2020 Oct 31.

DOI:10.1016/j.compbiomed.2020.104089
PMID:33338982
Abstract

As a common screening and diagnostic tool, Fine Needle Aspiration Biopsy (FNAB) of the suspicious breast lumps can be used to distinguish between malignant and benign breast cytology. In this study, we first review published works on the classification of breast cancer where the machine learning and data mining algorithms have been applied by using the Wisconsin Breast Cancer Database (WBCD). This work then introduces useful new tools, based on Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) algorithms to classify breast cancer. The RF and ET strategies use the decision trees as proper classifiers to attain the ultimate classification. The RF and ET approaches include four main stages: input identification, determination of the optimal number of trees, voting analysis, and final decision. The models implemented in this research consider important factors such as uniformity of cell size, bland chromatin, mitoses, and clump thickness as the input parameters. According to the statistical analysis, the proposed methods are able to classify the type of breast cancer accurately. The error analysis results reveal that the designed RF and ET models offer easy-to-use outcomes and the highest diagnostic performance, compared to previous tools/models in the literature for the WBCD classification. The highest and lowest magnitudes of relative importance are attributed to the uniformity of cell size and mitoses among the factors. It is expected that the RF and ET algorithms play an important role in medicine and health systems for screening and diagnosis in the near future.

摘要

作为一种常见的筛查和诊断工具,对可疑乳腺肿块进行细针穿刺活检(FNAB)可用于区分乳腺细胞学的恶性和良性情况。在本研究中,我们首先回顾了已发表的关于乳腺癌分类的著作,其中通过使用威斯康星乳腺癌数据库(WBCD)应用了机器学习和数据挖掘算法。这项工作随后引入了基于随机森林(RF)和极端随机树或Extra Trees(ET)算法的有用新工具来对乳腺癌进行分类。RF和ET策略使用决策树作为合适的分类器来实现最终分类。RF和ET方法包括四个主要阶段:输入识别、确定最佳树数量、投票分析和最终决策。本研究中实现的模型将细胞大小均匀性、平淡染色质、有丝分裂和团块厚度等重要因素作为输入参数。根据统计分析,所提出的方法能够准确地对乳腺癌类型进行分类。误差分析结果表明,与文献中用于WBCD分类的先前工具/模型相比,所设计的RF和ET模型提供了易于使用的结果和最高的诊断性能。在这些因素中,相对重要性的最高和最低程度分别归因于细胞大小均匀性和有丝分裂。预计RF和ET算法在不久的将来将在医学和卫生系统的筛查和诊断中发挥重要作用。

相似文献

1
Application of decision tree-based ensemble learning in the classification of breast cancer.基于决策树的集成学习在乳腺癌分类中的应用。
Comput Biol Med. 2021 Jan;128:104089. doi: 10.1016/j.compbiomed.2020.104089. Epub 2020 Oct 31.
2
Tree-Based and Machine Learning Algorithm Analysis for Breast Cancer Classification.基于树的和机器学习算法在乳腺癌分类中的分析。
Comput Intell Neurosci. 2022 Jul 7;2022:6715406. doi: 10.1155/2022/6715406. eCollection 2022.
3
Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。
Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.
4
Predicting factors for survival of breast cancer patients using machine learning techniques.运用机器学习技术预测乳腺癌患者的生存因素。
BMC Med Inform Decis Mak. 2019 Mar 22;19(1):48. doi: 10.1186/s12911-019-0801-4.
5
Analysis of Decision Tree and K-Nearest Neighbor Algorithm in the Classification of Breast Cancer.决策树和K近邻算法在乳腺癌分类中的分析
Asian Pac J Cancer Prev. 2019 Dec 1;20(12):3777-3781. doi: 10.31557/APJCP.2019.20.12.3777.
6
Breast Tumor Classification Using an Ensemble Machine Learning Method.基于集成机器学习方法的乳腺肿瘤分类
J Imaging. 2020 May 29;6(6):39. doi: 10.3390/jimaging6060039.
7
Application of rotation forest with decision trees as base classifier and a novel ensemble model in spatial modeling of groundwater potential.旋转森林与决策树作为基分类器在地下水潜力空间建模中的应用及一种新的集成模型。
Environ Monit Assess. 2019 Mar 27;191(4):248. doi: 10.1007/s10661-019-7362-y.
8
Classification of genomic islands using decision trees and their ensemble algorithms.基于决策树及其集成算法的基因组岛分类。
BMC Genomics. 2010 Nov 2;11 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2164-11-S2-S1.
9
A hybrid cost-sensitive ensemble for imbalanced breast thermogram classification.一种用于不平衡乳腺热成像分类的混合成本敏感集成方法。
Artif Intell Med. 2015 Nov;65(3):219-27. doi: 10.1016/j.artmed.2015.07.005. Epub 2015 Jul 31.
10
Using machine learning models to improve stroke risk level classification methods of China national stroke screening.利用机器学习模型改进中国国家卒中筛查的卒中风险水平分类方法。
BMC Med Inform Decis Mak. 2019 Dec 10;19(1):261. doi: 10.1186/s12911-019-0998-2.

引用本文的文献

1
Data-driven frameworks to robustly predict solubility parameter of diverse polymers.用于稳健预测多种聚合物溶解度参数的数据驱动框架。
Sci Rep. 2025 Aug 25;15(1):31157. doi: 10.1038/s41598-025-12758-1.
2
Artificial Intelligence Models in Diagnosis and Treatment of Kidney Diseases: Current Status and Prospects.人工智能模型在肾脏疾病诊断与治疗中的现状与展望
Kidney Dis (Basel). 2025 Jun 12;11(1):491-507. doi: 10.1159/000546397. eCollection 2025 Jan-Dec.
3
New hybrid features extracted from US images for breast cancer classification.
从超声图像中提取的用于乳腺癌分类的新型混合特征。
Sci Rep. 2025 Jul 16;15(1):25690. doi: 10.1038/s41598-025-09554-2.
4
CT-based radiomics integrated model for brain metastases in stage III/IV ALK-positive lung adenocarcinoma patients.基于CT的放射组学综合模型用于Ⅲ/Ⅳ期ALK阳性肺腺癌患者脑转移的研究
Front Oncol. 2025 Jun 18;15:1585930. doi: 10.3389/fonc.2025.1585930. eCollection 2025.
5
The Role of Artificial Intelligence in Advancing Biosensor Technology: Past, Present, and Future Perspectives.人工智能在推动生物传感器技术发展中的作用:过去、现在和未来展望。
Adv Mater. 2025 Aug;37(34):e2504796. doi: 10.1002/adma.202504796. Epub 2025 Jun 16.
6
Improved Machine Learning Predictions of EC50s Using Uncertainty Estimation from Dose-Response Data.利用剂量反应数据的不确定性估计改进机器学习对半数有效浓度(EC50)的预测
J Chem Inf Model. 2025 Jun 9;65(11):5623-5634. doi: 10.1021/acs.jcim.5c00249. Epub 2025 May 19.
7
Benign and Malignant Breast Lesions: Differentiation Using Microstructural Metrics Derived from Time-Dependent Diffusion MRI.良性和恶性乳腺病变:利用基于时间依赖性扩散磁共振成像的微观结构指标进行鉴别
Radiol Imaging Cancer. 2025 May;7(3):e240287. doi: 10.1148/rycan.240287.
8
Enhancing breast cancer diagnosis: transfer learning on DenseNet with neural hashing for histopathology fine-grained image classification.增强乳腺癌诊断:基于带有神经哈希的DenseNet进行迁移学习以实现组织病理学细粒度图像分类
Med Biol Eng Comput. 2025 Apr 6. doi: 10.1007/s11517-025-03346-6.
9
Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: a single-centre retrospective cohort study in China.2型糖尿病患者糖尿病周围神经病变可解释性风险预测模型的开发与内部验证:一项中国单中心回顾性队列研究
BMJ Open. 2025 Apr 3;15(4):e092463. doi: 10.1136/bmjopen-2024-092463.
10
Application of the Random Forest Algorithm for Accurate Bipolar Disorder Classification.随机森林算法在双相情感障碍精确分类中的应用。
Life (Basel). 2025 Mar 3;15(3):394. doi: 10.3390/life15030394.