Zahorik Pavel
Department of Otolaryngology and Communicative Disorders, Heuser Hearing Institute, University of Louisville, Louisville, KY, United States.
Department of Psychological and Brain Sciences, University of Louisville, Louisville, KY, United States.
Front Neurosci. 2022 Sep 1;16:958577. doi: 10.3389/fnins.2022.958577. eCollection 2022.
Visual capture describes the tendency of a sound to be mislocalized to the location of a plausible visual target. This effect, also known as the ventriloquist effect, has been extensively studied in humans, but primarily for mismatches in the angular direction between auditory and visual targets. Here, visual capture was examined in the distance dimension using a single visual target (an un-energized loudspeaker) and invisible virtual sound sources presented over headphones. The sound sources were synthesized from binaural impulse-response measurements at distances ranging from 1 to 5 m (0.25 m steps) in the semi-reverberant room (7.7 × 4.2 × 2.7 m) in which the experiment was conducted. Listeners ( = 11) were asked whether or not the auditory target appeared to be at the same distance as the visual target. Within a block of trials, the visual target was placed at a fixed distance of 1.5, 3, or 4.5 m, and the auditory target varied randomly from trial-to-trial over the sample of measurement distances. The resulting psychometric functions were generally consistent with visual capture in distance, but the capture was asymmetric: Sound sources behind the visual target were more strongly captured than sources in front of the visual target. This asymmetry is consistent with previous reports in the literature, and is shown here to be well predicted by a simple model of sensory integration and decision in which perceived auditory space is compressed logarithmically in distance and has lower resolution than perceived visual space.
视觉捕捉描述了一种声音被错误定位到一个看似合理的视觉目标位置的倾向。这种效应,也被称为腹语效应,已在人类中得到广泛研究,但主要是针对听觉和视觉目标之间角度方向的不匹配。在这里,使用单个视觉目标(一个未通电的扬声器)和通过耳机呈现的不可见虚拟声源,在距离维度上研究了视觉捕捉。声源是根据在进行实验的半混响室(7.7×4.2×2.7米)中距离从1到5米(步长0.25米)的双耳脉冲响应测量合成的。要求听众(n = 11)判断听觉目标是否看起来与视觉目标在同一距离。在一组试验中,视觉目标被放置在1.5、3或4.5米的固定距离处,听觉目标在每次试验中在测量距离样本上随机变化。由此产生的心理测量函数总体上与距离上的视觉捕捉一致,但捕捉是不对称的:视觉目标后面的声源比视觉目标前面的声源被更强地捕捉。这种不对称与文献中先前的报告一致,并且在这里通过一个简单的感觉整合和决策模型得到了很好的预测,在该模型中,感知到的听觉空间在距离上按对数方式压缩,并且比感知到的视觉空间具有更低的分辨率。