Suppr超能文献

可见语音的感知:语速估计和时间反转检测。

The perception of visible speech: estimation of speech rate and detection of time reversals.

机构信息

Laboratory of Neuromotor Physiology, Santa Lucia Foundation, via Ardeatina, 306, 00179, Rome, Italy.

出版信息

Exp Brain Res. 2011 Nov;215(2):141-61. doi: 10.1007/s00221-011-2883-9. Epub 2011 Oct 11.

Abstract

Four experiments investigated the perception of visible speech. Experiment 1 addressed the perception of speech rate. Observers were shown video-clips of the lower face of actors speaking at their spontaneous rate. Then, they were shown muted versions of the video-clips, which were either accelerated or decelerated. The task (scaling) was to compare visually the speech rate of the stimulus to the spontaneous rate of the actor being shown. Rate estimates were accurate when the video-clips were shown in the normal direction (forward mode). In contrast, speech rate was underestimated when the video-clips were shown in reverse (backward mode). Experiments 2-4 (2AFC) investigated how accurately one discriminates forward and backward speech movements. Unlike in Experiment 1, observers were never exposed to the sound track of the video-clips. Performance was well above chance when playback mode was crossed with rate modulation, and the number of repetitions of the stimuli allowed some amount of speechreading to take place in forward mode (Experiment 2). In Experiment 3, speechreading was made much more difficult by using a different and larger set of muted video-clips. Yet, accuracy decreased only slightly with respect to Experiment 2. Thus, kinematic rather then speechreading cues are most important for discriminating movement direction. Performance worsened, but remained above chance level when the same stimuli of Experiment 3 were rotated upside down (Experiment 4). We argue that the results are in keeping with the hypothesis that visual perception taps into implicit motor competence. Thus, lawful instances of biological movements (forward stimuli) are processed differently from backward stimuli representing movements that the observer cannot perform.

摘要

四个实验研究了可见语音的感知。实验 1 探讨了语速感知。观察者观看演员自然语速说话的下半张脸的视频片段。然后,他们观看视频片段的静音版本,这些版本被加速或减速。任务(缩放)是通过视觉比较刺激的语速与正在展示的演员的自然语速。当视频片段以正常方向(正向模式)播放时,估计的语速是准确的。相比之下,当视频片段以反向(反向模式)播放时,语速会被低估。实验 2-4(2AFC)研究了人们如何准确地区分正向和反向语音运动。与实验 1 不同,观察者从未接触过视频片段的音轨。当回放模式与速率调制交叉时,表现明显优于机会水平,并且允许刺激的重复次数在正向模式下进行一定程度的语音阅读(实验 2)。在实验 3 中,通过使用不同且更大的静音视频片段集,使语音阅读变得更加困难。然而,与实验 2 相比,准确性仅略有下降。因此,对于区分运动方向,运动学线索而不是语音阅读线索更为重要。当实验 3 的相同刺激被上下颠倒(实验 4)时,性能会恶化,但仍保持在机会水平之上。我们认为,这些结果与假设相符,即视觉感知利用了隐含的运动能力。因此,正向刺激代表的合法生物运动实例与观察者无法执行的反向刺激所代表的运动不同。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验