Liu Li, Chen Jie, Zhao Guoying, Fieguth Paul, Chen Xilin, Pietikainen Matti
IEEE Trans Image Process. 2019 Aug;28(8):3910-3922. doi: 10.1109/TIP.2019.2903300. Epub 2019 Mar 8.
Research in texture recognition often concentrates on recognizing textures with intraclass variations, such as illumination, rotation, viewpoint, and small-scale changes. In contrast, in real-world applications, a change in scale can have a dramatic impact on texture appearance to the point of changing completely from one texture category to another. As a result, texture variations due to changes in scale are among the hardest to handle. In this paper, we conduct the first study of classifying textures with extreme variations in scale. To address this issue, we first propose and then reduce scale proposals on the basis of dominant texture patterns. Motivated by the challenges posed by this problem, we propose a new GANet network where we use a genetic algorithm to change the filters in the hidden layers during network training in order to promote the learning of more informative semantic texture patterns. Finally, we adopt a Fisher vector pooling of a convolutional neural network filter bank feature encoder for global texture representation. Because extreme scale variations are not necessarily present in most standard texture databases, to support the proposed extreme-scale aspects of texture understanding, we are developing a new dataset, the extreme scale variation textures (ESVaT), to test the performance of our framework. It is demonstrated that the proposed framework significantly outperforms the gold-standard texture features by more than 10% on ESVaT. We also test the performance of our proposed approach on the KTHTIPS2b and OS datasets and a further dataset synthetically derived from Forrest, showing the superior performance compared with the state-of-the-art.
纹理识别研究通常集中于识别具有类内变化的纹理,如光照、旋转、视角和小尺度变化。相比之下,在实际应用中,尺度变化会对纹理外观产生巨大影响,甚至可能导致纹理类别完全改变。因此,因尺度变化而产生的纹理差异是最难处理的。在本文中,我们首次对具有极端尺度变化的纹理分类进行了研究。为解决这一问题,我们首先提出并基于主导纹理模式减少尺度提议。受此问题所带来挑战的启发,我们提出了一种新的GANet网络,在网络训练期间使用遗传算法来改变隐藏层中的滤波器,以促进对更具信息性的语义纹理模式的学习。最后,我们采用卷积神经网络滤波器组特征编码器的Fisher向量池化来进行全局纹理表示。由于大多数标准纹理数据库中不一定存在极端尺度变化,为支持所提出的纹理理解的极端尺度方面,我们正在开发一个新的数据集,即极端尺度变化纹理(ESVaT),以测试我们框架的性能。结果表明,所提出的框架在ESVaT上比黄金标准纹理特征显著高出10%以上。我们还在KTHTIPS2b和OS数据集以及一个综合源自Forrest的进一步数据集上测试了我们提出方法的性能,显示出与现有技术相比的卓越性能。