Department of ICT Education, University of Education, Winneba, Ghana.
Department of Computer Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana.
Comput Intell Neurosci. 2022 Sep 27;2022:1189509. doi: 10.1155/2022/1189509. eCollection 2022.
Computer vision is the science that enables computers and machines to see and perceive image content on a semantic level. It combines concepts, techniques, and ideas from various fields such as digital image processing, pattern matching, artificial intelligence, and computer graphics. A computer vision system is designed to model the human visual system on a functional basis as closely as possible. Deep learning and Convolutional Neural Networks (CNNs) in particular which are biologically inspired have significantly contributed to computer vision studies. This research develops a computer vision system that uses CNNs and handcrafted filters from Log-Gabor filters to identify medicinal plants based on their leaf textural features in an ensemble manner. The system was tested on a dataset developed from the Centre of Plant Medicine Research, Ghana (MyDataset) consisting of forty-nine (49) plant species. Using the concept of transfer learning, ten pretrained networks including Alexnet, GoogLeNet, DenseNet201, Inceptionv3, Mobilenetv2, Restnet18, Resnet50, Resnet101, vgg16, and vgg19 were used as feature extractors. The DenseNet201 architecture resulted with the best outcome of 87% accuracy and GoogLeNet with 79% preforming the worse averaged across six supervised learning algorithms. The proposed model (OTAMNet), created by fusing a Log-Gabor layer into the transition layers of the DenseNet201 architecture achieved 98% accuracy when tested on MyDataset. OTAMNet was tested on other benchmark datasets; Flavia, Swedish Leaf, MD2020, and the Folio dataset. The Flavia dataset achieved 99%, Swedish Leaf 100%, MD2020 99%, and the Folio dataset 97%. A false-positive rate of less than 0.1% was achieved in all cases.
计算机视觉是一门使计算机和机器能够在语义层面上感知图像内容的科学。它结合了数字图像处理、模式匹配、人工智能和计算机图形学等领域的概念、技术和思想。计算机视觉系统旨在在功能基础上尽可能紧密地模拟人类视觉系统。特别是受生物启发的深度学习和卷积神经网络(CNN)极大地促进了计算机视觉研究。本研究开发了一种计算机视觉系统,该系统使用 CNN 和从 Log-Gabor 滤波器提取的手工滤波器,以纹理特征的方式对药用植物进行识别。该系统在加纳植物医学研究中心(MyDataset)开发的数据集上进行了测试,该数据集包含四十九(49)种植物物种。使用迁移学习的概念,使用包括 Alexnet、GoogLeNet、DenseNet201、Inceptionv3、Mobilenetv2、Restnet18、Resnet50、Resnet101、vgg16 和 vgg19 在内的十个预训练网络作为特征提取器。DenseNet201 架构的结果是准确率最高的 87%,而 GoogLeNet 的平均准确率最差,为 79%,这是在六个监督学习算法中得出的。所提出的模型(OTAMNet)通过将 Log-Gabor 层融合到 DenseNet201 架构的过渡层中,在 MyDataset 上测试时达到了 98%的准确率。OTAMNet 在其他基准数据集 Flavia、Swedish Leaf、MD2020 和 Folio 上进行了测试。Flavia 数据集的准确率为 99%,Swedish Leaf 为 100%,MD2020 为 99%,Folio 数据集为 97%。在所有情况下,假阳性率都低于 0.1%。