IEEE Trans Image Process. 2017 Nov;26(11):5519-5530. doi: 10.1109/TIP.2017.2738560. Epub 2017 Aug 10.
This paper develops a novel method to address the structural noise in samples for image classification. Recently, regression-related classification methods have shown promising results when facing the pixelwise noise. However, they become weak in coping with the structural noise due to ignoring of relationships between pixels of noise image. Meanwhile, most of them need to implement the iterative process for computing representation coefficients, which leads to the high time consumption. To overcome these problems, we exploit a latent pattern model called low-rank latent pattern approximation (LLPA) to reconstruct the test image having structural noise. The rank function is applied to characterize the structure of the reconstruction residual between test image and the corresponding latent pattern. Simultaneously, the error between the latent pattern and the reference image is constrained by Frobenius norm to prevent overfitting. LLPA involves a closed-form solution by the virtue of a singular value thresholding operator. The provided theoretic analysis demonstrates that LLPA indeed removes the structural noise during classification task. Additionally, LLPA is further extended to the form of matrix regression by connecting multiple training samples, and alternating direction of multipliers method with Gaussian back substitution algorithm is used to solve the extended LLPA. Experimental results on several popular data sets validate that the proposed methods are more robust to image classification with occlusion and illumination changes, as compared to some existing state-of-the-art reconstruction-based methods and one deep neural network-based method.
本文提出了一种新的方法来解决图像分类中样本的结构噪声问题。最近,回归相关的分类方法在处理像素噪声方面表现出了很有前景的结果。然而,由于忽略了噪声图像像素之间的关系,它们在处理结构噪声方面变得很弱。同时,它们中的大多数需要执行迭代过程来计算表示系数,这导致了高的时间消耗。为了克服这些问题,我们利用一种称为低秩潜在模式逼近(LLPA)的潜在模式模型来重建具有结构噪声的测试图像。秩函数用于描述测试图像和相应潜在模式之间的重建残差的结构。同时,通过 Frobenius 范数约束潜在模式和参考图像之间的误差,以防止过拟合。LLPA 通过奇异值阈值算子的闭式解来实现。提供的理论分析表明,LLPA 确实可以在分类任务中去除结构噪声。此外,通过连接多个训练样本,将 LLPA 进一步扩展到矩阵回归的形式,并使用交替方向乘子法和高斯后向替换算法来求解扩展的 LLPA。在几个流行的数据集上的实验结果验证了与一些现有的基于重建的方法和一种基于深度神经网络的方法相比,所提出的方法在具有遮挡和光照变化的图像分类中更具鲁棒性。