Xu Ziyue, Saha Punam K, Dasgupta Soura
Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, United States.
Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, United States ; Department of Radiology, University of Iowa, Iowa City, IA 52242, United States.
Comput Vis Image Underst. 2012 Oct 1;116(10):1060-1075. doi: 10.1016/j.cviu.2012.05.006.
Scale is a widely used notion in computer vision and image understanding that evolved in the form of scale-space theory where the key idea is to represent and analyze an image at various resolutions. Recently, we introduced a notion of local morphometric scale referred to as "tensor scale" using an ellipsoidal model that yields a unified representation of structure size, orientation and anisotropy. In the previous work, tensor scale was described using a 2-D algorithmic approach and a precise analytic definition was missing. Also, the application of tensor scale in 3-D using the previous framework is not practical due to high computational complexity. In this paper, an analytic definition of tensor scale is formulated for -dimensional (-D) images that captures local structure size, orientation and anisotropy. Also, an efficient computational solution in 2- and 3-D using several novel differential geometric approaches is presented and the accuracy of results is experimentally examined. Also, a matrix representation of tensor scale is derived facilitating several operations including tensor field smoothing to capture larger contextual knowledge. Finally, the applications of tensor scale in image filtering and -linear interpolation are presented and the performance of their results is examined in comparison with respective state-of-art methods. Specifically, the performance of tensor scale based image filtering is compared with gradient and Weickert's structure tensor based diffusive filtering algorithms. Also, the performance of tensor scale based -linear interpolation is evaluated in comparison with standard -linear and windowed-sinc interpolation methods.
尺度是计算机视觉和图像理解中广泛使用的概念,它以尺度空间理论的形式发展而来,其核心思想是在不同分辨率下表示和分析图像。最近,我们引入了一种局部形态尺度的概念,称为“张量尺度”,使用椭球模型,该模型能对结构大小、方向和各向异性进行统一表示。在之前的工作中,张量尺度是用二维算法方法描述的,缺少精确的解析定义。此外,由于计算复杂度高,使用先前框架在三维中应用张量尺度并不实际。本文为n维(n-D)图像制定了张量尺度的解析定义,该定义捕捉了局部结构大小、方向和各向异性。此外,还提出了一种使用几种新颖微分几何方法在二维和三维中的高效计算解决方案,并通过实验检验了结果的准确性。此外,还推导了张量尺度的矩阵表示,便于进行包括张量场平滑在内的多种操作,以获取更大的上下文知识。最后,介绍了张量尺度在图像滤波和n线性插值中的应用,并与各自的现有方法相比,检验了其结果的性能。具体而言,将基于张量尺度的图像滤波性能与基于梯度和魏克特结构张量的扩散滤波算法进行了比较。此外,还将基于张量尺度的n线性插值性能与标准n线性和加窗 sinc 插值方法进行了评估。