School of Electronic Information, Wuhan University, Wuhan 430072, China.
School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China.
Sensors (Basel). 2021 Jan 8;21(2):392. doi: 10.3390/s21020392.
In production, due to natural conditions or process peculiarities, a single product often may exhibit more than one type of defect. The accurate identification of all defects has an important guiding significance and practical value to improve the planting and production processes. Concerning the surface defect classification task, convolutional neural networks can be implemented as a powerful instrument. However, a typical convolutional neural network tends to consider an image as an inseparable entity and a single instance when extracting features; moreover, it may overlook semantic correlations between different labels. To address these limitations, in the present paper, we proposed a feature-wise attention-based relation network (FAR-Net) for multilabel jujube defect classification. The network included four different modules designed for (1) image feature extraction, (2) label-wise feature aggregation, (3) feature activation and deactivation, and (4) correlation learning among labels. To evaluate the proposed method, a unique multilabel jujube defect dataset was constructed as a benchmark for the multilabel classification task of the jujube defect images. The results of experiments show that owing to the relation learning mechanism, the average precision of the three main composite defects in the dataset increases by 5.77%, 4.07%, and 3.50%, respectively, compared to the backbone of our network, namely Inception v3, which indicated that the proposed FAR-Net effectively facilitated the learning of correlation between labels and eventually, improved the multilabel classification accuracy.
在生产中,由于自然条件或工艺特点,单一产品往往可能表现出多种类型的缺陷。准确识别所有缺陷对改进种植和生产工艺具有重要的指导意义和实用价值。对于表面缺陷分类任务,可以使用卷积神经网络作为一种强大的工具。然而,典型的卷积神经网络在提取特征时往往将图像视为不可分割的实体和单个实例,并且可能忽略不同标签之间的语义相关性。为了解决这些限制,在本文中,我们提出了一种基于特征的注意力关系网络(FAR-Net),用于多标签红枣缺陷分类。该网络包括四个不同的模块,分别用于(1)图像特征提取,(2)标签特征聚合,(3)特征激活和去激活,以及(4)标签之间的相关性学习。为了评估所提出的方法,构建了一个独特的多标签红枣缺陷数据集,作为红枣缺陷图像多标签分类任务的基准。实验结果表明,由于关系学习机制,与我们网络的主干网络(即 Inception v3)相比,数据集的三个主要复合缺陷的平均精度分别提高了 5.77%、4.07%和 3.50%,这表明所提出的 FAR-Net 有效地促进了标签之间的相关性学习,最终提高了多标签分类精度。