Al-Sahaf Harith, Al-Sahaf Ausama, Xue Bing, Zhang Mengjie
School of Engineering and Computer Science, Victoria University of Wellington, PO Box 600, Wellington 6140, New Zealand
Evol Comput. 2021 Sep 1;29(3):331-366. doi: 10.1162/evco_a_00284.
The performance of image classification is highly dependent on the quality of the extracted features that are used to build a model. Designing such features usually requires prior knowledge of the domain and is often undertaken by a domain expert who, if available, is very costly to employ. Automating the process of designing such features can largely reduce the cost and efforts associated with this task. Image descriptors, such as local binary patterns, have emerged in computer vision, and aim at detecting keypoints, for example, corners, line-segments, and shapes, in an image and extracting features from those keypoints. In this article, genetic programming (GP) is used to automatically evolve an image descriptor using only two instances per class by utilising a multitree program representation. The automatically evolved descriptor operates directly on the raw pixel values of an image and generates the corresponding feature vector. Seven well-known datasets were adapted to the few-shot setting and used to assess the performance of the proposed method and compared against six handcrafted and one evolutionary computation-based image descriptor as well as three convolutional neural network (CNN) based methods. The experimental results show that the new method has significantly outperformed the competitor image descriptors and CNN-based methods. Furthermore, different patterns have been identified from analysing the evolved programs.
图像分类的性能高度依赖于用于构建模型的提取特征的质量。设计此类特征通常需要领域先验知识,并且通常由领域专家进行,而领域专家如果有的话,聘请成本非常高。自动化设计此类特征的过程可以在很大程度上降低与此任务相关的成本和工作量。图像描述符,如局部二值模式,已出现在计算机视觉中,旨在检测图像中的关键点,例如角点、线段和形状,并从这些关键点提取特征。在本文中,遗传编程(GP)用于通过利用多树程序表示,仅使用每个类的两个实例来自动演化图像描述符。自动演化的描述符直接对图像的原始像素值进行操作,并生成相应的特征向量。七个著名的数据集被调整为少样本设置,并用于评估所提出方法的性能,并与六个手工制作的和一个基于进化计算的图像描述符以及三种基于卷积神经网络(CNN)的方法进行比较。实验结果表明,新方法显著优于竞争的图像描述符和基于CNN的方法。此外,通过分析演化后的程序识别出了不同的模式。