Zhao Tianle, Blu Thierry
IEEE Trans Image Process. 2020 May 1. doi: 10.1109/TIP.2020.2990483.
Computing the convolution between a 2D signal and a corresponding filter with variable orientations is a basic problem that arises in various tasks ranging from low level image processing (e.g. ridge/edge detection) to high level computer vision (e.g. pattern recognition). Through decades of research, there still lacks an efficient method for solving this problem. In this paper, we investigate this problem from the perspective of approximation by considering the following problem: what is the optimal basis for approximating all rotated versions of a given bivariate function? Surprisingly, solely minimising the L2-approximation-error leads to a rotation-covariant linear expansion, which we name Fourier-Argand representation. This representation presents two major advantages: 1) rotation-covariance of the basis, which implies a "strong steerability" - rotating by an angle α corresponds to multiplying each basis function by a complex scalar e-ikα; 2) optimality of the Fourier-Argand basis, which ensures a few number of basis functions suffice to accurately approximate complicated patterns and highly direction-selective filters. We show the relation between the Fourier-Argand representation and the Radon transform, leading to an efficient implementation of the decomposition for digital filters. We also show how to retrieve accurate orientation of local structures/patterns using a fast frequency estimation algorithm.
计算二维信号与具有可变方向的相应滤波器之间的卷积是一个基本问题,它出现在从低级图像处理(如脊线/边缘检测)到高级计算机视觉(如图像识别)等各种任务中。经过数十年的研究,仍然缺乏解决这个问题的有效方法。在本文中,我们通过考虑以下问题从近似的角度来研究这个问题:逼近给定二元函数的所有旋转版本的最优基是什么?令人惊讶的是,仅最小化L2逼近误差会导致一种旋转协变线性展开,我们将其命名为傅里叶 - 阿尔冈德表示。这种表示具有两个主要优点:1)基的旋转协变性,这意味着“强可控性”——旋转角度α相当于将每个基函数乘以复标量e-ikα;2)傅里叶 - 阿尔冈德基的最优性,这确保了少量的基函数就足以准确逼近复杂模式和高度方向选择性滤波器。我们展示了傅里叶 - 阿尔冈德表示与拉东变换之间的关系,从而实现了数字滤波器分解的高效实现。我们还展示了如何使用快速频率估计算法来检索局部结构/模式的准确方向。