University of Hamburg, Germany.
J Cogn Neurosci. 2011 Aug;23(8):1845-54. doi: 10.1162/jocn.2010.21462. Epub 2010 Mar 4.
During face-to-face communication, one does not only hear speech but also see a speaker's communicative hand movements. It has been shown that such hand gestures play an important role in communication where the two modalities influence each other's interpretation. A gesture typically temporally overlaps with coexpressive speech, but the gesture is often initiated before (but not after) the coexpressive speech. The present ERP study investigated what degree of asynchrony in the speech and gesture onsets are optimal for semantic integration of the concurrent gesture and speech. Videos of a person gesturing were combined with speech segments that were either semantically congruent or incongruent with the gesture. Although gesture and speech always overlapped in time, gesture and speech were presented with three different degrees of asynchrony. In the SOA 0 condition, the gesture onset and the speech onset were simultaneous. In the SOA 160 and 360 conditions, speech was delayed by 160 and 360 msec, respectively. ERPs time locked to speech onset showed a significant difference between semantically congruent versus incongruent gesture-speech combinations on the N400 for the SOA 0 and 160 conditions. No significant difference was found for the SOA 360 condition. These results imply that speech and gesture are integrated most efficiently when the differences in onsets do not exceed a certain time span because of the fact that iconic gestures need speech to be disambiguated in a way relevant to the speech context.
在面对面交流中,人们不仅听到言语,还看到说话者的交际手势。已经表明,这些手势在交流中起着重要作用,两种模态相互影响彼此的解释。手势通常与共现的言语在时间上重叠,但手势通常在共现的言语之前(但不在之后)开始。本研究通过 ERP 技术来探讨在言语和手势开始时存在多大程度的异步性,才能实现手势和言语的最佳语义整合。将一个人打手势的视频与语义上与手势一致或不一致的语音片段相结合。尽管手势和言语总是在时间上重叠,但手势和言语以三种不同的异步性呈现。在 SOA0 条件下,手势和语音同时开始。在 SOA160 和 360 条件下,语音分别延迟了 160 和 360 毫秒。ERP 时间锁定在语音开始时,对于 SOA0 和 160 条件,语义一致与不一致的手势-语音组合在 N400 上显示出显著差异。对于 SOA360 条件,则未发现显著差异。这些结果表明,当开始时间的差异不超过一定时间范围时,语音和手势的整合效率最高,因为手势需要言语以与言语上下文相关的方式进行消歧。