Ahmad Naeem, Singh Shubham, AlAjmi Mohamed Fahad, Hussain Afzal, Raza Khalid
Department of Computer Applications, National Institute of Technology Raipur (NITR), Raipur, India.
Department of Pharmacognosy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia.
PeerJ Comput Sci. 2024 Jul 29;10:e2136. doi: 10.7717/peerj-cs.2136. eCollection 2024.
Classifying images is one of the most important tasks in computer vision. Recently, the best performance for image classification tasks has been shown by networks that are both deep and well-connected. These days, most datasets are made up of a fixed number of color images. The input images are taken in red green blue (RGB) format and classified without any changes being made to the original. It is observed that color spaces (basically changing original RGB images) have a major impact on classification accuracy, and we delve into the significance of color spaces. Moreover, datasets with a highly variable number of classes, such as the PlantVillage dataset utilizing a model that incorporates numerous color spaces inside the same model, achieve great levels of accuracy, and different classes of images are better represented in different color spaces. Furthermore, we demonstrate that this type of model, in which the input is preprocessed into many color spaces simultaneously, requires significantly fewer parameters to achieve high accuracy for classification. The proposed model basically takes an RGB image as input, turns it into seven separate color spaces at once, and then feeds each of those color spaces into its own Convolutional Neural Network (CNN) model. To lessen the load on the computer and the number of hyperparameters needed, we employ group convolutional layers in the proposed CNN model. We achieve substantial gains over the present state-of-the-art methods for the classification of crop disease.
图像分类是计算机视觉中最重要的任务之一。最近,深度且连接良好的网络在图像分类任务中展现出了最佳性能。如今,大多数数据集由固定数量的彩色图像组成。输入图像采用红绿蓝(RGB)格式,且在不对原始图像进行任何更改的情况下进行分类。据观察,颜色空间(基本上是对原始RGB图像进行变换)对分类准确率有重大影响,我们深入探讨了颜色空间的重要性。此外,具有高度可变类别数量的数据集,例如利用在同一模型中纳入多种颜色空间的模型的植物村数据集,能够达到很高的准确率,并且不同类别的图像在不同颜色空间中能得到更好的呈现。此外,我们证明了这种同时将输入预处理为多种颜色空间的模型,在实现高分类准确率时所需的参数要少得多。所提出的模型基本上以RGB图像作为输入,一次性将其转换为七个独立的颜色空间,然后将每个颜色空间输入到其各自的卷积神经网络(CNN)模型中。为了减轻计算机的负载以及所需超参数的数量,我们在所提出的CNN模型中采用了分组卷积层。在作物病害分类方面,我们相对于当前的先进方法取得了显著的进展。