Rimiru Richard M, Gateri Judy, Kimwele Micheal W
Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya.
PeerJ Comput Sci. 2022 Feb 25;8:e890. doi: 10.7717/peerj-cs.890. eCollection 2022.
Content-Based Image Retrieval (CBIR) is the cornerstone of today's image retrieval systems. The most distinctive retrieval approach used, involves the submission of an image-based query whereby the system is used in the extraction of visual characteristics like the shape, color, and texture from the images. Examination of the characteristics is done for ensuring the searching and retrieval of proportional images from the image database. Majority of the datasets utilized for retrieval lean towards to comprise colored images. The colored images are regarded as in RGB (Red, Green, Blue) form. Most colored images use the RGB image for classifying the images. The research presents the transformation of RGB to other color spaces, extraction of features using different color spaces techniques, Gabor filter and use Convolutional Neural Networks for retrieval to find the most efficient combination. The model is also known as Gabor Convolution Network. Even though the notion of the Gabor filter being induced in CNN has been suggested earlier, this work introduces an entirely different and very simple Gabor-based CNN which produces high recognition efficiency. In this paper, Gabor Convolutional Networks (GCNs or GaborNet), with different color spaces are used to examine which combination is efficient to retrieve natural images. An extensive experiment using Cifar 10 dataset was made and comparison of simple CNN, ResNet 50 and GCN model was also made. The models were evaluated through a several statistical analysis based on accuracy, precision, recall, F-Score, area under the curve (AUC), and receiving operating characteristic (ROC) curve. The results shows GaborNet model effectively retrieve images with 99.68% of AUC and 99.09% of Recall. The results also shows different images are effectively retrieved using different color space. Therefore research concluded it is very significance to transform images to different color space and use GaborNet for effective retrieval.
基于内容的图像检索(CBIR)是当今图像检索系统的基石。所使用的最具特色的检索方法涉及提交基于图像的查询,通过该查询系统用于从图像中提取形状、颜色和纹理等视觉特征。对这些特征进行检查是为了确保从图像数据库中搜索和检索相似的图像。用于检索的大多数数据集倾向于包含彩色图像。彩色图像被视为RGB(红、绿、蓝)格式。大多数彩色图像使用RGB图像对图像进行分类。该研究提出了将RGB转换为其他颜色空间,使用不同颜色空间技术提取特征,使用Gabor滤波器,并使用卷积神经网络进行检索以找到最有效的组合。该模型也被称为Gabor卷积网络。尽管之前已经有人提出在卷积神经网络中引入Gabor滤波器的概念,但这项工作引入了一种完全不同且非常简单的基于Gabor的卷积神经网络,其具有很高的识别效率。在本文中,使用不同颜色空间的Gabor卷积网络(GCNs或GaborNet)来研究哪种组合在检索自然图像时是有效的。使用Cifar 10数据集进行了广泛的实验,并对简单卷积神经网络、ResNet 50和GCN模型进行了比较。通过基于准确率、精确率、召回率、F值、曲线下面积(AUC)和接收操作特征(ROC)曲线的多种统计分析对模型进行评估。结果表明,GaborNet模型能够有效地检索图像,并具有99.68%的AUC和99.09%的召回率。结果还表明,使用不同的颜色空间可以有效地检索不同的图像。因此,研究得出结论,将图像转换为不同的颜色空间并使用GaborNet进行有效检索具有非常重要的意义。