Suppr超能文献

从超声图像中提取的用于乳腺癌分类的新型混合特征。

New hybrid features extracted from US images for breast cancer classification.

作者信息

Tăbăcaru Gigi, Moldovanu Simona, Munteanu Dan, Barbu Marian

机构信息

Department of Automatic Control, Faculty of Automation, Computers, Electrical, Engineering and Electronics, "Dunărea de Jos" University of Galati, Galaţi, Romania.

Computer Science and Information Technology, Faculty of Automation, Computers, Electrical Engineering and Electronics, "Dunărea de Jos" University of Galati, Galaţi, Romania.

出版信息

Sci Rep. 2025 Jul 16;15(1):25690. doi: 10.1038/s41598-025-09554-2.

Abstract

Artificial intelligence (AI), and image processing fields play a vital role in classifying benign and malignant breast cancer (BC). The novelty of this paper lies in computing original hybrid features (HF) from textural and shape features of BC integrated into a polynomial regression, and their classification with two different Automated Machine Learning (AutoML). The obtained data are original; therefore, a previous analysis of them with violin graphs was needed. For computing of the hybrid features, the Haralick textural features and Hu moments were integrated in a polynomial regression way. In this context, two different AutoML, PyCaret and TPOT (Tree-based Pipeline Optimization Tool) were proposed, and the optimal model for hybrid features included in the classification process was identified during the tuning process. The experimental results indicated that the HF, composed of entropy and Hu moments, was selected by PyCaret using the AdaBoost Classifier (ADB) as the optimal classifier, achieving an accuracy of 91.4%. Additionally, TPOT employed a Multilayer Perceptron Classifier, which provided an accuracy of 90.6%. These findings identified the most effective features for classifying benign and malignant breast cancer (BC). Enhancing computational efficiency reduces the risk of overfitting; hence, the bagging, boosting, and stacking Ensemble Machine Learning (EML) techniques were proposed to validate the obtained results. The study's originality lies in the HF's capacity to accurately represent and capture the lesion's texture and shape, just like a physician makes a BC diagnosis.

摘要

人工智能(AI)和图像处理领域在乳腺癌(BC)良恶性分类中起着至关重要的作用。本文的新颖之处在于从乳腺癌的纹理和形状特征中计算出原始混合特征(HF),并将其整合到多项式回归中,然后使用两种不同的自动机器学习(AutoML)方法进行分类。所获得的数据是原始的;因此,需要用小提琴图对其进行先前的分析。为了计算混合特征,将哈勒克纹理特征和胡矩以多项式回归的方式进行整合。在此背景下,提出了两种不同的AutoML方法,即PyCaret和TPOT(基于树的管道优化工具),并在调优过程中确定了分类过程中包含的混合特征的最优模型。实验结果表明,由熵和胡矩组成的HF被PyCaret选择,使用AdaBoost分类器(ADB)作为最优分类器,准确率达到91.4%。此外,TPOT采用了多层感知器分类器,其准确率为90.6%。这些发现确定了用于乳腺癌(BC)良恶性分类的最有效特征。提高计算效率可降低过拟合风险;因此,提出了装袋、提升和堆叠集成机器学习(EML)技术来验证所得结果。该研究的独创性在于HF能够像医生诊断乳腺癌一样准确地表示和捕捉病变的纹理和形状。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aba6/12267442/6f14c7d18362/41598_2025_9554_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验