Suppr超能文献

基于视觉注意的深度学习方法的花卉识别。

Visual attentional-driven deep learning method for flower recognition.

机构信息

School of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China.

Nanjing University of Information Science and Technology, Nanjing 210044, China.

出版信息

Math Biosci Eng. 2021 Feb 25;18(3):1981-1991. doi: 10.3934/mbe.2021103.

Abstract

As a typical fine-grained image recognition task, flower category recognition is one of the most popular research topics in the field of computer vision and forestry informatization. Although the image recognition method based on Deep Convolutional Neural Network (DCNNs) has achieved acceptable performance on natural scene image, there are still shortcomings such as lack of training samples, intra-class similarity and low accuracy in flowers category recognition. In this paper, we study deep learning-based flowers' category recognition problem, and propose a novel attention-driven deep learning model to solve it. Specifically, since training the deep learning model usually requires massive training samples, we perform image augmentation for the training sample by using image rotation and cropping. The augmented images and the original image are merged as a training set. Then, inspired by the mechanism of human visual attention, we propose a visual attention-driven deep residual neural network, which is composed of multiple weighted visual attention learning blocks. Each visual attention learning block is composed by a residual connection and an attention connection to enhance the learning ability and discriminating ability of the whole network. Finally, the model is training in the fusion training set and recognize flowers in the testing set. We verify the performance of our new method on public Flowers 17 dataset and it achieves the recognition accuracy of 85.7%.

摘要

作为典型的细粒度图像识别任务,花卉分类识别是计算机视觉和林业信息化领域最热门的研究课题之一。虽然基于深度卷积神经网络(DCNN)的图像识别方法在自然场景图像上取得了可接受的性能,但在花卉分类识别中仍然存在训练样本不足、类内相似度低和准确率低等缺点。在本文中,我们研究了基于深度学习的花卉分类识别问题,并提出了一种新的基于注意力的深度学习模型来解决该问题。具体来说,由于训练深度学习模型通常需要大量的训练样本,我们通过图像旋转和裁剪对训练样本进行图像增强。增强后的图像和原始图像合并为一个训练集。然后,受人类视觉注意力机制的启发,我们提出了一种视觉注意力驱动的深度残差神经网络,它由多个加权视觉注意力学习块组成。每个视觉注意力学习块由一个残差连接和一个注意力连接组成,以增强整个网络的学习能力和辨别能力。最后,在融合的训练集上对模型进行训练,并在测试集上识别花卉。我们在公共 Flowers 17 数据集上验证了我们新方法的性能,其识别准确率达到 85.7%。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验