Einhäuser Wolfgang, Mundhenk T Nathan, Baldi Pierre, Koch Christof, Itti Laurent
Division of Biology, California Institute of Technology, Pasadena, CA, USA.
J Vis. 2007 Jul 20;7(10):6.1-13. doi: 10.1167/7.10.6.
Humans demonstrate a peculiar ability to detect complex targets in rapidly presented natural scenes. Recent studies suggest that (nearly) no focal attention is required for overall performance in such tasks. Little is known, however, of how detection performance varies from trial to trial and which stages in the processing hierarchy limit performance: bottom-up visual processing (attentional selection and/or recognition) or top-down factors (e.g., decision-making, memory, or alertness fluctuations)? To investigate the relative contribution of these factors, eight human observers performed an animal detection task in natural scenes presented at 20 Hz. Trial-by-trial performance was highly consistent across observers, far exceeding the prediction of independent errors. This consistency demonstrates that performance is not primarily limited by idiosyncratic factors but by visual processing. Two statistical stimulus properties, contrast variation in the target image and the information-theoretical measure of "surprise" in adjacent images, predict performance on a trial-by-trial basis. These measures are tightly related to spatial attention, demonstrating that spatial attention and rapid target detection share common mechanisms. To isolate the causal contribution of the surprise measure, eight additional observers performed the animal detection task in sequences that were reordered versions of those all subjects had correctly recognized in the first experiment. Reordering increased surprise before and/or after the target while keeping the target and distractors themselves unchanged. Surprise enhancement impaired target detection in all observers. Consequently, and contrary to several previously published findings, our results demonstrate that attentional limitations, rather than target recognition alone, affect the detection of targets in rapidly presented visual sequences.
人类展现出一种独特的能力,能够在快速呈现的自然场景中检测复杂目标。最近的研究表明,在这类任务中,总体表现(几乎)不需要集中注意力。然而,对于每次试验中检测性能如何变化,以及处理层次结构中的哪些阶段限制了性能,我们却知之甚少:是自下而上的视觉处理(注意力选择和/或识别),还是自上而下的因素(例如决策、记忆或警觉性波动)?为了研究这些因素的相对贡献,八名人类观察者在以20赫兹呈现的自然场景中执行了动物检测任务。观察者之间每次试验的表现高度一致,远远超出了独立错误的预测。这种一致性表明,性能并非主要受个体因素限制,而是受视觉处理限制。两种统计刺激属性,即目标图像中的对比度变化以及相邻图像中“惊奇”的信息理论度量,能够逐次预测性能。这些度量与空间注意力密切相关,表明空间注意力和快速目标检测共享共同机制。为了分离惊奇度量的因果贡献,另外八名观察者在序列中执行动物检测任务,这些序列是所有受试者在第一个实验中正确识别的序列的重新排序版本。重新排序增加了目标之前和/或之后的惊奇,同时保持目标和干扰物本身不变。惊奇增强损害了所有观察者的目标检测。因此,与之前发表的几项研究结果相反,我们的结果表明,注意力限制而非仅仅目标识别,会影响在快速呈现的视觉序列中对目标的检测。