Center for Automation Research, University of Maryland, College Park, MD 20742-3275, USA.
IEEE Trans Image Process. 2002;11(11):1209-26. doi: 10.1109/TIP.2002.800896.
We propose an approach to accurately detecting two-dimensional (2-D) shapes. The cross section of the shape boundary is modeled as a step function. We first derive a one-dimensional (1-D) optimal step edge operator, which minimizes both the noise power and the mean squared error between the input and the filter output. This operator is found to be the derivative of the double exponential (DODE) function, originally derived by Ben-Arie and Rao. We define an operator for shape detection by extending the DODE filter along the shape's boundary contour. The responses are accumulated at the centroid of the operator to estimate the likelihood of the presence of the given shape. This method of detecting a shape is in fact a natural extension of the task of edge detection at the pixel level to the problem of global contour detection. This simple filtering scheme also provides a tool for a systematic analysis of edge-based shape detection. We investigate how the error is propagated by the shape geometry. We have found that, under general assumptions, the operator is locally linear at the peak of the response. We compute the expected shape of the response and derive some of its statistical properties. This enables us to predict both its localization and detection performance and adjust its parameters according to imaging conditions and given performance specifications. Applications to the problem of vehicle detection in aerial images, human facial feature detection, and contour tracking in video are presented.
我们提出了一种准确检测二维(2-D)形状的方法。形状边界的横截面被建模为阶跃函数。我们首先推导出一维(1-D)最优阶跃边缘算子,该算子最小化输入和滤波器输出之间的噪声功率和均方误差。该算子被发现是双指数(DODE)函数的导数,最初由 Ben-Arie 和 Rao 推导得出。我们通过沿形状边界轮廓扩展 DODE 滤波器来定义用于形状检测的算子。响应在算子的质心处累积,以估计给定形状存在的可能性。这种检测形状的方法实际上是将像素级边缘检测任务扩展到全局轮廓检测问题的自然延伸。这种简单的滤波方案还为基于边缘的形状检测提供了一种系统分析的工具。我们研究了误差如何通过形状几何传播。我们发现,在一般假设下,算子在响应的峰值处是局部线性的。我们计算响应的预期形状,并推导其一些统计特性。这使我们能够预测其定位和检测性能,并根据成像条件和给定的性能规范调整其参数。将该方法应用于航空图像中的车辆检测、人脸特征检测和视频中的轮廓跟踪问题。