IEEE Trans Image Process. 2019 Nov;28(11):5296-5307. doi: 10.1109/TIP.2019.2913549. Epub 2019 May 2.
Artificial visual attention has been an active research area for over two decades. Especially, the concept of saliency has been implemented in many different ways. Early approaches aimed at closely modeling saliency processing with concepts from biological attention to provide (at least in the long run) general-purpose attention for technical systems. More recent approaches have departed from this agenda, turning to more specific attention-guided tasks, most notably the accurate extraction of salient objects, for which large-scale ground truth datasets make it possible to quantify progress. While the first type of models is troubled by weak performance in these specific tasks, the second type, as we show with a new benchmark, has lost the ability to predict saliency in the original sense, which may be an important factor for future general-purpose attention systems. Here, we describe a new approach using growing neural gas to obtain pre-attentional structures for a scene at an early processing stage. On this basis, traditional saliency concepts can be applied while at the same time they can be linked to mechanisms that make models successful in salient object detection. The model shows high performance at predicting traditional saliency and makes substantial progress toward salient object detection, although it cannot reach the top-level performance of some specialized methods. We discuss the important implications of our findings.
人工视觉注意已经是一个活跃的研究领域超过二十年了。特别是,显着性的概念已经以许多不同的方式实现。早期的方法旨在通过生物注意力的概念紧密模拟显着性处理,为技术系统提供(至少从长远来看)通用注意力。最近的方法已经偏离了这个议程,转向更具体的注意力引导任务,最显著的是突出对象的准确提取,对于这些任务,大规模的地面实况数据集使得量化进展成为可能。虽然第一种类型的模型在这些特定任务中的表现不佳,但正如我们在新基准中所示,第二种类型已经失去了预测原始意义上的显着性的能力,这可能是未来通用注意力系统的一个重要因素。在这里,我们描述了一种新的方法,使用生长神经网络气体在早期处理阶段获得场景的预注意结构。在此基础上,可以应用传统的显着性概念,同时可以将它们与使模型在显着性目标检测中成功的机制联系起来。该模型在预测传统显着性方面表现出很高的性能,并在显着性目标检测方面取得了实质性的进展,尽管它无法达到一些专门方法的顶级性能。我们讨论了我们发现的重要意义。