School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, Hubei, China.
Hubei Luojia Laboratory, Wuhan, Hubei, China.
Cogn Res Princ Implic. 2024 Mar 26;9(1):17. doi: 10.1186/s41235-024-00541-1.
Previous work has demonstrated similarities and differences between aerial and terrestrial image viewing. Aerial scene categorization, a pivotal visual processing task for gathering geoinformation, heavily depends on rotation-invariant information. Aerial image-centered research has revealed effects of low-level features on performance of various aerial image interpretation tasks. However, there are fewer studies of viewing behavior for aerial scene categorization and of higher-level factors that might influence that categorization. In this paper, experienced subjects' eye movements were recorded while they were asked to categorize aerial scenes. A typical viewing center bias was observed. Eye movement patterns varied among categories. We explored the relationship of nine image statistics to observers' eye movements. Results showed that if the images were less homogeneous, and/or if they contained fewer or no salient diagnostic objects, viewing behavior became more exploratory. Higher- and object-level image statistics were predictive at both the image and scene category levels. Scanpaths were generally organized and small differences in scanpath randomness could be roughly captured by critical object saliency. Participants tended to fixate on critical objects. Image statistics included in this study showed rotational invariance. The results supported our hypothesis that the availability of diagnostic objects strongly influences eye movements in this task. In addition, this study provides supporting evidence for Loschky et al.'s (Journal of Vision, 15(6), 11, 2015) speculation that aerial scenes are categorized on the basis of image parts and individual objects. The findings were discussed in relation to theories of scene perception and their implications for automation development.
先前的研究已经证明了航空影像和地面影像在观察方面的相似性和差异性。航空场景分类是获取地理信息的关键视觉处理任务,它严重依赖于旋转不变信息。以航空图像为中心的研究揭示了低水平特征对各种航空图像解释任务性能的影响。然而,对于航空场景分类的观察行为以及可能影响这种分类的更高层次因素的研究较少。在本文中,当经验丰富的被试者被要求对航空场景进行分类时,记录了他们的眼动。观察到了典型的注视中心偏向。眼动模式在类别之间有所不同。我们探讨了 9 种图像统计数据与观察者眼动之间的关系。结果表明,如果图像的同质性较低,或者如果它们包含较少或没有明显的诊断对象,那么观察行为就会变得更加探索性。较高层次和对象层次的图像统计数据在图像和场景类别级别都具有预测性。扫视路径通常是有组织的,并且通过关键对象的显著度可以大致捕捉到扫视路径随机性的微小差异。参与者倾向于注视关键对象。本研究中包含的图像统计数据显示出旋转不变性。研究结果支持了我们的假设,即诊断对象的可用性强烈影响了该任务中的眼动。此外,本研究为 Loschky 等人(《视觉杂志》,15(6),11,2015)的推测提供了支持证据,即航空场景是基于图像部分和单个对象进行分类的。研究结果与场景感知理论及其对自动化发展的影响进行了讨论。