Yuan Yichen, Van der Stoep Nathan, Gayet Surya
Department of Experimental Psychology, Helmholtz Institute, Utrecht University.
J Exp Psychol Gen. 2025 May;154(5):1351-1367. doi: 10.1037/xge0001725. Epub 2025 Jan 27.
Predicting the location of moving objects in noisy environments is essential to everyday behavior, like when participating in traffic. Although many objects provide multisensory information, it remains unknown how humans use multisensory information to localize moving objects, and how this depends on expected sensory interference (e.g., occlusion). In four experiments, we systematically investigated localization for auditory, visual, and audiovisual targets (AV). Performance for audiovisual targets was compared to performance predicted by maximum likelihood estimation (MLE). In Experiment 1A, moving targets were occluded by an audiovisual occluder, and their final locations had to be inferred from target speed and occlusion duration. Participants relied exclusively on the visual component of the audiovisual target, even though the auditory component demonstrably provided useful location information when presented in isolation. In contrast, when a visual-only occluder was used in Experiment 1B, participants relied exclusively on the auditory component of the audiovisual target, even though the visual component demonstrably provided useful location information when presented in isolation. In Experiment 2, although localization estimates were in line with MLE predictions, no multisensory precision benefits were found when participants localized moving audiovisual target. In Experiment 3, a substantial multisensory benefit was found when participants localized static audiovisual target, showing near-MLE integration. In sum, observers use both hearing and vision when localizing static objects, but use only unisensory input when localizing moving objects and predicting motion under occlusion. Moreover, observers can flexibly prioritize one sense over the other, in anticipation of modality-specific interference. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
预测嘈杂环境中移动物体的位置对于日常行为至关重要,比如参与交通时。尽管许多物体提供多感官信息,但人类如何利用多感官信息来定位移动物体,以及这如何依赖于预期的感官干扰(如遮挡),仍然未知。在四项实验中,我们系统地研究了听觉、视觉和视听目标(AV)的定位。将视听目标的表现与最大似然估计(MLE)预测的表现进行了比较。在实验1A中,移动物体被视听遮挡物遮挡,其最终位置必须根据目标速度和遮挡持续时间来推断。参与者完全依赖视听目标的视觉成分,尽管听觉成分单独呈现时明显提供了有用的位置信息。相比之下,在实验1B中使用仅视觉遮挡物时,参与者完全依赖视听目标的听觉成分,尽管视觉成分单独呈现时明显提供了有用的位置信息。在实验2中,尽管定位估计与MLE预测一致,但参与者定位移动视听目标时未发现多感官精度优势。在实验3中,参与者定位静态视听目标时发现了显著的多感官优势,显示出接近MLE的整合。总之,观察者在定位静态物体时使用听觉和视觉,但在定位移动物体和预测遮挡下的运动时仅使用单感官输入。此外,观察者可以根据特定模态的干扰,灵活地优先选择一种感官而非另一种。(PsycInfo数据库记录(c)2025美国心理学会,保留所有权利)