Zhang Jun, Liu Yamei, Zhang Shengping, Poppe Ronald, Wang Meng
IEEE Trans Image Process. 2020 Feb 5. doi: 10.1109/TIP.2020.2970529.
Light field imaging presents an attractive alternative to RGB imaging because of the recording of the direction of the incoming light. The detection of salient regions in a light field image benefits from the additional modeling of angular patterns. For RGB imaging, methods using CNNs have achieved excellent results on a range of tasks, including saliency detection. However, it is not trivial to use CNN-based methods for saliency detection on light field images because these methods are not specifically designed for processing light field inputs. In addition, current light field datasets are not sufficiently large to train CNNs. To overcome these issues, we present a new Lytro Illum dataset, which contains 640 light fields and their corresponding ground-truth saliency maps. Compared to current publicly available light field saliency datasets [1], [2], our new dataset is larger, of higher quality, contains more variation and more types of light field inputs. This makes our dataset suitable for training deeper networks and benchmarking. Furthermore, we propose a novel end-to-end CNN-based framework for light field saliency detection. Specifically, we propose three novel MAC (Model Angular Changes) blocks to process light field micro-lens images. We systematically study the impact of different architecture variants and compare light field saliency with regular 2D saliency. Our extensive comparisons indicate that our novel network significantly outperforms state-of-the-art methods on the proposed dataset and has desired generalization abilities on other existing datasets.
由于能够记录入射光的方向,光场成像为RGB成像提供了一种有吸引力的替代方案。光场图像中显著区域的检测受益于角度模式的额外建模。对于RGB成像,使用卷积神经网络(CNN)的方法在包括显著性检测在内的一系列任务上取得了优异的成果。然而,将基于CNN的方法用于光场图像的显著性检测并非易事,因为这些方法并非专门为处理光场输入而设计。此外,当前的光场数据集规模不足够大,无法训练CNN。为了克服这些问题,我们提出了一个新的Lytro Illum数据集,它包含640个光场及其相应的地面真值显著性图。与当前公开可用的光场显著性数据集[1][2]相比,我们的新数据集更大、质量更高,包含更多变化和更多类型的光场输入。这使得我们的数据集适合训练更深的网络并进行基准测试。此外,我们提出了一种新颖的基于端到端CNN的光场显著性检测框架。具体来说,我们提出了三个新颖的MAC(Model Angular Changes,模型角度变化)块来处理光场微透镜图像。我们系统地研究了不同架构变体的影响,并将光场显著性与常规二维显著性进行了比较。我们广泛的比较表明,我们的新颖网络在所提出的数据集上显著优于现有方法,并在其他现有数据集上具有理想的泛化能力。