Department of Psychology, University of Muenster, Muenster, Germany.
Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany.
PLoS One. 2021 Jul 22;16(7):e0253130. doi: 10.1371/journal.pone.0253130. eCollection 2021.
Auditory and visual percepts are integrated even when they are not perfectly temporally aligned with each other, especially when the visual signal precedes the auditory signal. This window of temporal integration for asynchronous audiovisual stimuli is relatively well examined in the case of speech, while other natural action-induced sounds have been widely neglected. Here, we studied the detection of audiovisual asynchrony in three different whole-body actions with natural action-induced sounds-hurdling, tap dancing and drumming. In Study 1, we examined whether audiovisual asynchrony detection, assessed by a simultaneity judgment task, differs as a function of sound production intentionality. Based on previous findings, we expected that auditory and visual signals should be integrated over a wider temporal window for actions creating sounds intentionally (tap dancing), compared to actions creating sounds incidentally (hurdling). While percentages of perceived synchrony differed in the expected way, we identified two further factors, namely high event density and low rhythmicity, to induce higher synchrony ratings as well. Therefore, we systematically varied event density and rhythmicity in Study 2, this time using drumming stimuli to exert full control over these variables, and the same simultaneity judgment tasks. Results suggest that high event density leads to a bias to integrate rather than segregate auditory and visual signals, even at relatively large asynchronies. Rhythmicity had a similar, albeit weaker effect, when event density was low. Our findings demonstrate that shorter asynchronies and visual-first asynchronies lead to higher synchrony ratings of whole-body action, pointing to clear parallels with audiovisual integration in speech perception. Overconfidence in the naturally expected, that is, synchrony of sound and sight, was stronger for intentional (vs. incidental) sound production and for movements with high (vs. low) rhythmicity, presumably because both encourage predictive processes. In contrast, high event density appears to increase synchronicity judgments simply because it makes the detection of audiovisual asynchrony more difficult. More studies using real-life audiovisual stimuli with varying event densities and rhythmicities are needed to fully uncover the general mechanisms of audiovisual integration.
听觉和视觉感知即使在彼此不完全同步的情况下也会被整合,特别是当视觉信号先于听觉信号出现时。在语音的情况下,这种异步视听刺激的时间整合窗口已经得到了很好的研究,而其他自然动作引发的声音则被广泛忽视。在这里,我们研究了三种不同的全身动作(跨越障碍物、踢踏舞和击鼓)与自然动作引发的声音一起的视听异步检测。在研究 1 中,我们考察了视听异步检测是否会根据声音产生的意图而有所不同,评估方法是通过同时性判断任务。基于先前的研究结果,我们预期与产生声音的动作(踢踏舞)相比,产生声音的动作(跨越障碍物)的听觉和视觉信号应在更宽的时间窗口内被整合。虽然感知同步的百分比以预期的方式有所不同,但我们还确定了另外两个因素,即高事件密度和低节奏性,它们也会导致更高的同步率评分。因此,在研究 2 中,我们系统地改变了事件密度和节奏性,这次使用击鼓刺激来对这些变量进行全面控制,并使用相同的同时性判断任务。结果表明,即使在较大的异步情况下,高事件密度也会导致对听觉和视觉信号的整合而不是分离的偏向。当事件密度较低时,节奏性也会产生类似的但较弱的影响。我们的研究结果表明,较短的异步和视觉优先异步会导致更高的全身动作同步率评分,这与语音感知中的视听整合有明显的相似之处。对声音和视觉自然预期的过度自信,即声音和视觉的同步性,对于有意(与偶然)的声音产生更强,对于高(与低)节奏性的运动更强,这可能是因为两者都鼓励预测过程。相比之下,高事件密度似乎只是因为它增加了视听异步检测的难度而增加了同步性判断。需要进行更多使用具有不同事件密度和节奏性的真实视听刺激的研究,以充分揭示视听整合的一般机制。