El Kati Yassine, Wang Shu-Lin, Taresh Mundher Mohammed, Ali Talal Ahmed Ali
College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China.
PeerJ Comput Sci. 2024 Dec 23;10:e2578. doi: 10.7717/peerj-cs.2578. eCollection 2024.
Breast cancer (BC) is one of the most common causes of mortality among women. However, early detection of BC can effectively improve the treatment outcomes. Computer-aided diagnosis (CAD) systems can be utilized clinical specialists for accurate diagnosis of BC in its early stages. Due to their superior classification performance, deep learning (DL) methods have been extensively used in CAD systems. The classification accuracy of a DL model mainly depends on the parameters, such as weights and biases, of the deep neural network (DNN), which are optimized during the training phase. The training of DL models has been carried out by gradient-based techniques, ., stochastic gradient descent with momentum (SGDM) and adaptive momentum estimation (ADAM), and metaheuristic techniques, ., genetic algorithms (GA) and particle swarm optimization (PSO). However, these techniques suffer from frequent stagnation in local optima due to the huge search space, which can lead to sub-optimal DL performance. This article proposes a hybrid optimization algorithm, based on incorporating a simple gradient search mechanism into a metaheuristic technique, multi-verse optimizer (MVO), to facilitate the search for global optimal solution in the high-dimensional search space of DL models. A DL model for BC diagnosis is developed based on a three-hidden-layer DNN whose parameters are optimized using the proposed hybrid optimizer. Experimental analysis is carried out on the Wisconsin breast cancer dataset (WBCD) and the Wisconsin Diagnosis Breast Cancer (WDBC) dataset, each is divided into 70% for training and 30% for testing. For comparison reasons, similar DL models trained using various optimizers, including gradient-based, metaheuristic, and recently-proposed hybrid optimization algorithms, are also analyzed. The results demonstrate the superior performance of our optimizer in terms of attaining the most accurate DL model in the fastest convergence rate. The proposed model achieves outstanding metrics, including accuracy at 93.5%, precision at 88.06%, specificity at 93.06%, sensitivity at 95.64%, F1 score at 91.67%, and Matthew's correlation coefficient (MCC) at 87.14% on WBCD, and accuracy at 96.73%, precision at 93.38%, specificity at 95.83%, sensitivity at 98.25%, F1 score at 95.75%, and MCC at 93.18% on WDBC, in just six epochs. This research significantly contributes to advancing CAD systems for BC, emphasizing the potential benefits of the proposed optimizer in medical classification domains.
乳腺癌(BC)是女性死亡的最常见原因之一。然而,早期发现乳腺癌可以有效改善治疗效果。计算机辅助诊断(CAD)系统可协助临床专家对乳腺癌进行早期准确诊断。由于其卓越的分类性能,深度学习(DL)方法已在CAD系统中广泛应用。DL模型的分类准确率主要取决于深度神经网络(DNN)的参数,如权重和偏差,这些参数在训练阶段进行优化。DL模型的训练通过基于梯度的技术(如带动量的随机梯度下降(SGDM)和自适应动量估计(ADAM))以及元启发式技术(如遗传算法(GA)和粒子群优化(PSO))来进行。然而,由于搜索空间巨大,这些技术经常陷入局部最优停滞,这可能导致DL性能次优。本文提出一种混合优化算法,该算法将简单梯度搜索机制融入元启发式技术——多宇宙优化器(MVO),以促进在DL模型的高维搜索空间中寻找全局最优解。基于具有三个隐藏层的DNN开发了一种用于乳腺癌诊断的DL模型,其参数使用所提出的混合优化器进行优化。在威斯康星乳腺癌数据集(WBCD)和威斯康星诊断乳腺癌(WDBC)数据集上进行了实验分析,每个数据集分为70%用于训练和30%用于测试。为作比较,还分析了使用各种优化器(包括基于梯度的、元启发式的以及最近提出的混合优化算法)训练的类似DL模型。结果表明,我们的优化器在以最快收敛速度获得最准确DL模型方面具有卓越性能。所提出的模型在WBCD上实现了出色的指标,包括准确率93.5%、精确率88.06%、特异性93.06%、灵敏度95.64%、F1分数91.67%以及马修斯相关系数(MCC)87.14%;在WDBC上,在仅六个训练轮次中就实现了准确率96.73%、精确率93.38%、特异性95.83%、灵敏度98.25%、F1分数95.75%以及MCC 93.18%。这项研究对推进乳腺癌CAD系统具有重要意义,强调了所提出的优化器在医学分类领域的潜在益处。