Khan Zeshan Aslam, Waqar Muhammad, Cheema Khalid Mehmood, Bakar Mahmood Ali Abu, Ain Quratul, Chaudhary Naveed Ishtiaq, Alshehri Abdullah, Alshamrani Sultan S, Zahoor Raja Muhammad Asif
International Graduate Institute of Artificial Intelligence, National Yunlin University of Science and Technology, 123 University Road, Section 3, Douliou, Yunlin, 64002, Taiwan, ROC.
Department of Electronic Engineering, Fatima Jinnah Women University, Rawalpindi, Pakistan.
Heliyon. 2024 Nov 30;10(23):e40820. doi: 10.1016/j.heliyon.2024.e40820. eCollection 2024 Dec 15.
The quality of vegetables and fruits are judged by their visual features. Misclassification of fruits and vegetables lead to a financial loss. To prevent the loss, superstores need to classify fruits and vegetables in terms of size, color and shape. To improve the accuracy of models for classifying fruits and vegetables, researchers have introduced various CNN architectures like VGG16, AlexNet, DenseNet-121 and different attention-based variants by compromising models' complexity leading to high computational cost and overlooking feature's contribution for interpretable predictions. Additionally, the major drawback in most of the existing works is the utilization of the limited dataset and number of classes for fruit and vegetable classification task instead of the benchmark dataset like Fruit-360 which comprises of 141 distinct fruit and vegetable classes. In this study, an explainable artificial intelligence (XAI) driven enhanced attention-CNN (EA-CNN) is proposed for accomplishing the fruit and vegetable classification task accurately and efficiently. The proposed EA-CNN model (a) detects the class of fruits and vegetable accurately through visual features by manipulating undiscover customized pooling technique and enhanced attention feature extraction mechanism, (b) classifies the fruits and vegetables efficiently through a simplified architecture, (c) provides interpretable predictions through XAI approach. By utilizing entire fruit-360 database, this research noteworthily enhances the model's capability to classify wide range of fruits and vegetables, thus providing an effective and reliable solution for practical applications. The proposed study outclasses baseline models with regard to accuracy and computational cost on fruit-360 benchmark dataset. EA-CNN achieves better accuracy of 98.1 % in smaller number of iterations as compared to benchmark models. Moreover, to further prove the efficiency, generalization ability, robustness, scalability and adaptability of this research, the architecture of proposed EA-CNN model is validated on another real-world dataset named as 'Fruit Recognition'. The experimental outcomes display that the proposed EA-CNN model produces substantial performance on Fruit-Recognition dataset as well by attaining a generalized accuracy of 96 %.
蔬菜和水果的质量通过其视觉特征来判断。水果和蔬菜的错误分类会导致经济损失。为防止损失,超市需要根据大小、颜色和形状对水果和蔬菜进行分类。为提高水果和蔬菜分类模型的准确性,研究人员引入了各种卷积神经网络(CNN)架构,如VGG16、AlexNet、DenseNet - 121以及不同的基于注意力的变体,但这些方法在模型复杂度上做出了妥协,导致计算成本高昂,且忽视了特征对可解释预测的贡献。此外,大多数现有工作的主要缺点是在水果和蔬菜分类任务中使用的数据集和类别数量有限,而不是像包含141个不同水果和蔬菜类别的Fruit - 360这样的基准数据集。在本研究中,提出了一种由可解释人工智能(XAI)驱动的增强注意力CNN(EA - CNN),以准确、高效地完成水果和蔬菜分类任务。所提出的EA - CNN模型:(a)通过操纵未发现的定制池化技术和增强注意力特征提取机制,通过视觉特征准确检测水果和蔬菜的类别;(b)通过简化架构高效地对水果和蔬菜进行分类;(c)通过XAI方法提供可解释的预测。通过利用整个Fruit - 360数据库,本研究显著提高了模型对各种水果和蔬菜进行分类的能力,从而为实际应用提供了有效且可靠的解决方案。在所提出的研究中,在Fruit - 360基准数据集上,EA - CNN在准确率和计算成本方面优于基线模型。与基准模型相比,EA - CNN在较少的迭代次数下实现了98.1%的更高准确率。此外,为进一步证明本研究的效率、泛化能力、鲁棒性、可扩展性和适应性,在所提出的EA - CNN模型架构在另一个名为“水果识别”的真实世界数据集上进行了验证。实验结果表明,所提出的EA - CNN模型在水果识别数据集上也取得了显著性能,达到了96%的泛化准确率。