Ye Chengjia, McQueen James M, Bosker Hans Rutger
Donders Institute for Brain, Cognition and Behaviour, Radboud University, Thomas Van Aquinostraat 4, 6525 GD, Nijmegen, The Netherlands.
Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.
Atten Percept Psychophys. 2025 Apr 30. doi: 10.3758/s13414-025-03072-z.
Speech is often accompanied by gestures. Since beat gestures-simple nonreferential up-and-down hand movements-frequently co-occur with prosodic prominence, they can indicate stress in a word and hence influence spoken-word recognition. However, little is known about the reverse influence of auditory speech on visual perception. The current study investigated whether lexical stress has an effect on the perceived timing of hand beats. We used videos in which a disyllabic word, embedded in a carrier sentence (Experiment 1) or in isolation (Experiment 2), was coupled with an up-and-down hand beat, while varying their degrees of asynchrony. Results from Experiment 1, a novel beat timing estimation task, revealed that gestures were estimated to occur closer in time to the pitch peak in a stressed syllable than their actual timing, hence reducing the perceived temporal distance between gestures and stress by around 60%. Using a forced-choice task, Experiment 2 further demonstrated that listeners tended to perceive a gesture, falling midway between two syllables, on the syllable receiving stronger cues to stress than the other, and this auditory effect was greater when gestural timing was most ambiguous. Our findings suggest that f0 and intensity are the driving force behind the temporal attraction effect of stress on perceived gestural timing. This study provides new evidence for auditory influences on visual perception, supporting bidirectionality in audiovisual interaction between speech-related signals that occur in everyday face-to-face communication.
言语通常伴随着手势。由于节拍手势(简单的无指代性上下手部动作)经常与韵律重音同时出现,它们可以指示单词中的重音,从而影响口语单词的识别。然而,关于听觉言语对视觉感知的反向影响却知之甚少。当前的研究调查了词汇重音是否会对手部节拍的感知时间产生影响。我们使用了视频,其中一个双音节单词,嵌入在载体句子中(实验1)或单独出现(实验2),并与一个上下手部节拍配对,同时改变它们的异步程度。实验1是一项新颖的节拍时间估计任务,结果显示,与实际时间相比,手势被估计在重读音节的音高峰值附近出现的时间更近,从而将手势与重音之间的感知时间距离缩短了约60%。实验2使用强制选择任务进一步证明,听众倾向于将落在两个音节中间的手势感知在比另一个音节接收到更强重音线索的音节上,并且当手势时间最模糊时,这种听觉效应更大。我们的研究结果表明,基频(f0)和强度是重音对感知手势时间的时间吸引效应背后的驱动力。这项研究为听觉对视觉感知的影响提供了新的证据,支持了日常面对面交流中与言语相关信号之间视听交互的双向性。