Hussain Mohammed, Thaher Thaer, Almourad Mohamed Basel, Mafarja Majdi
College of Technological Innovation, Zayed University, Dubai, United Arab Emirates.
Department of Computer Systems Engineering, Arab American University, Jenin, Palestine.
Sci Rep. 2024 Dec 30;14(1):31759. doi: 10.1038/s41598-024-82022-5.
Accurate classification of logos is a challenging task in image recognition due to variations in logo size, orientation, and background complexity. Deep learning models, such as VGG16, have demonstrated promising results in handling such tasks. However, their performance is highly dependent on optimal hyperparameter settings, whose fine-tuning is both labor-intensive and time-consuming. Swarm intelligence algorithms have been widely adopted to solve many highly nonlinear, multimodal problems and have succeeded significantly. The Hunger Games Search (HGS) is a recent swarm intelligence algorithm that has shown good performance across various applications. However, the standard HGS still faces limitations, such as restricted population diversity and a tendency to get trapped in local optima, which can hinder its effectiveness. In this paper, we propose an optimized deep learning architecture called EHGS-VGG16 designed based on the VGG16 model and boosted by an enhanced Hunger Games Search (EHGS) algorithm for hyperparameter tuning. The proposed enhancement to HGS involves modified search strategies, incorporating the concepts of "local best" and a "local escaping mechanism" to improve its exploration capability. To validate our approach, the evaluation is conducted in three folds. First, the EHGS algorithm is evaluated through 30 real-valued benchmark functions from the IEEE CEC2014 suite. Second, a custom-developed VGG16 model is tested on the Flickr-27 logo classification dataset and compared against state-of-the-art deep learning models such as ResNet50V2, InceptionV3, DenseNet121, EfficientNetB0, and MobileNetV2. Finally, EHGS is integrated into the VGG16 model to optimize its hyperparameters. The experimental results show that VGG16 outperformed the other counterparts with an accuracy of 0.956966, a precision of 0.957137, and a recall of 0.956966. Moreover, the integration of EHGS further improved classification quality by 3%. These findings highlight the potential of combining evolutionary optimization techniques with deep learning for enhanced accuracy in log classification tasks.
由于标志的大小、方向和背景复杂性存在差异,准确分类标志在图像识别中是一项具有挑战性的任务。深度学习模型,如VGG16,在处理此类任务方面已显示出有前景的结果。然而,它们的性能高度依赖于最优超参数设置,而超参数的微调既耗费人力又耗时。群体智能算法已被广泛用于解决许多高度非线性、多模态问题,并取得了显著成功。饥饿游戏搜索(HGS)是一种最近的群体智能算法,在各种应用中都表现出良好的性能。然而,标准的HGS仍然面临局限性,如群体多样性受限以及容易陷入局部最优,这可能会阻碍其有效性。在本文中,我们提出了一种优化的深度学习架构,称为EHGS-VGG16,它基于VGG16模型设计,并通过增强的饥饿游戏搜索(EHGS)算法进行超参数调整来提升性能。对HGS的改进包括修改搜索策略,融入“局部最优”和“局部逃逸机制”的概念,以提高其探索能力。为了验证我们的方法,进行了三方面的评估。首先,通过来自IEEE CEC2014套件的30个实值基准函数对EHGS算法进行评估。其次,在Flickr-27标志分类数据集上测试一个定制开发的VGG16模型,并与诸如ResNet50V2、InceptionV3、DenseNet121、EfficientNetB0和MobileNetV2等当前最先进的深度学习模型进行比较。最后,将EHGS集成到VGG16模型中以优化其超参数。实验结果表明,VGG16的准确率为0.956966,精确率为0.957137,召回率为0.956966,优于其他同类模型。此外,EHGS的集成进一步将分类质量提高了3%。这些发现凸显了将进化优化技术与深度学习相结合在提高标志分类任务准确性方面的潜力。