IEEE Trans Image Process. 2021;30:7228-7240. doi: 10.1109/TIP.2021.3104163. Epub 2021 Aug 20.
The rotation, scale and translation invariance of extracted features have a high significance in image recognition. Local binary pattern (LBP) and LBP-based descriptors have been widely used in image recognition due to feature discrimination and computational efficiency. However, most of the existing LBP-based descriptors have been designed to achieve rotation invariance while fail to achieve scale invariance. Moreover, it is usually difficult to achieve a good trade-off between the feature discrimination and the feature dimension. In this work, a learning 2D co-occurrence LBP termed 2D-LCoLBP is proposed to address these issues. Firstly, a weighted joint histogram is constructed in different neighborhoods and scales of an image to represent the multi-neighborhood and multi-scale LBP (2D-MLBP) and achieve the rotation invariance. A feature learning strategy is then designed to learn the compact and robust descriptor (2D-LCoLBP) from LBP pattern pairs across different scales in the extracted 2D-MLBP to characterize the most stable local structures and achieve the scale invariance, as well as decrease the feature dimension and improve the noise robustness. Finally, a linear SVM classifier is employed for recognition. We applied the proposed 2D-LCoLBP on four image recognition tasks-texture, object, face and food recognition with ten image databases. Experimental results show that 2D-LCoLBP has obviously low feature dimension but outperforms the state-of-the-art LBP-based descriptors in terms of recognition accuracy under noise-free, Gaussian noise and JPEG compression conditions.
提取特征的旋转、缩放和平移不变性在图像识别中具有重要意义。局部二值模式(LBP)及其基于 LBP 的描述符由于具有特征判别和计算效率高的特点,已被广泛应用于图像识别。然而,现有的大多数基于 LBP 的描述符都是为了实现旋转不变性而设计的,而不能实现尺度不变性。此外,在特征判别和特征维数之间通常很难取得良好的折衷。在这项工作中,提出了一种学习的二维共生局部二值模式,称为 2D-LCoLBP,以解决这些问题。首先,在图像的不同邻域和尺度上构建加权联合直方图来表示多邻域和多尺度 LBP(2D-MLBP),从而实现旋转不变性。然后,设计了一种特征学习策略,从提取的 2D-MLBP 中不同尺度的 LBP 模式对中学习紧凑和鲁棒的描述符(2D-LCoLBP),以描述最稳定的局部结构,实现尺度不变性,并降低特征维数,提高噪声鲁棒性。最后,采用线性 SVM 分类器进行识别。我们将所提出的 2D-LCoLBP 应用于纹理、目标、人脸和食物识别的四个图像识别任务,使用十个图像数据库进行实验。实验结果表明,2D-LCoLBP 的特征维度明显较低,但在无噪声、高斯噪声和 JPEG 压缩条件下,识别精度优于现有的基于 LBP 的描述符。