Department of Computer Science and Engineeringand Centre for Vision Research (CVR), York University, CSB 1003, 4700 Keele Street, Toronto, ON M3J 1P3, Canada.
IEEE Trans Pattern Anal Mach Intell. 2010 Jul;32(7):1310-6. doi: 10.1109/TPAMI.2010.64.
A theoretical investigation of the frequency structure of multiplicative image motion signals is presented, e.g., as associated with translucency phenomena. Previous work has claimed that the multiplicative composition of visual signals generally results in the annihilation of oriented structure in the spectral domain. As a result, research has focused on multiplicative signals in highly specialized scenarios where highly structured spectral signatures are prevalent, or introduced a nonlinearity to transform the multiplicative image signal to an additive one. In contrast, in this paper, it is shown that oriented structure is present in multiplicative cases when natural domain constraints are taken into account. This analysis suggests that the various instances of naturally occurring multiple motion structures can be treated in a unified manner. As an example application of the developed theory, a multiple motion estimator previously proposed for translation, additive transparency, and occlusion is adapted to multiplicative image motions. This estimator is shown to yield superior performance over the alternative practice of introducing a nonlinear preprocessing step.
本文对乘性图像运动信号的频率结构进行了理论研究,例如与半透明现象相关的信号。先前的工作声称,视觉信号的乘性组合通常会导致谱域中定向结构的消失。因此,研究主要集中在谱特征高度结构化的高度专业化场景中的乘性信号上,或者引入非线性将乘性图像信号转换为加性信号。相比之下,在本文中,当考虑到自然域约束时,表明乘性情况下存在定向结构。这种分析表明,可以以统一的方式处理各种自然出现的多重运动结构实例。作为所提出理论的一个示例应用,针对平移、加性透明度和遮挡提出的先前的多运动估计器被适用于乘性图像运动。结果表明,该估计器在引入非线性预处理步骤的替代方法中表现出更好的性能。