可见语音的感知：语速估计和时间反转检测。

The perception of visible speech: estimation of speech rate and detection of time reversals.

机构信息

Laboratory of Neuromotor Physiology, Santa Lucia Foundation, via Ardeatina, 306, 00179, Rome, Italy.

出版信息

Exp Brain Res. 2011 Nov;215(2):141-61. doi: 10.1007/s00221-011-2883-9. Epub 2011 Oct 11.

PMID:21986668

Abstract

Four experiments investigated the perception of visible speech. Experiment 1 addressed the perception of speech rate. Observers were shown video-clips of the lower face of actors speaking at their spontaneous rate. Then, they were shown muted versions of the video-clips, which were either accelerated or decelerated. The task (scaling) was to compare visually the speech rate of the stimulus to the spontaneous rate of the actor being shown. Rate estimates were accurate when the video-clips were shown in the normal direction (forward mode). In contrast, speech rate was underestimated when the video-clips were shown in reverse (backward mode). Experiments 2-4 (2AFC) investigated how accurately one discriminates forward and backward speech movements. Unlike in Experiment 1, observers were never exposed to the sound track of the video-clips. Performance was well above chance when playback mode was crossed with rate modulation, and the number of repetitions of the stimuli allowed some amount of speechreading to take place in forward mode (Experiment 2). In Experiment 3, speechreading was made much more difficult by using a different and larger set of muted video-clips. Yet, accuracy decreased only slightly with respect to Experiment 2. Thus, kinematic rather then speechreading cues are most important for discriminating movement direction. Performance worsened, but remained above chance level when the same stimuli of Experiment 3 were rotated upside down (Experiment 4). We argue that the results are in keeping with the hypothesis that visual perception taps into implicit motor competence. Thus, lawful instances of biological movements (forward stimuli) are processed differently from backward stimuli representing movements that the observer cannot perform.

摘要

四个实验研究了可见语音的感知。实验 1 探讨了语速感知。观察者观看演员自然语速说话的下半张脸的视频片段。然后，他们观看视频片段的静音版本，这些版本被加速或减速。任务（缩放）是通过视觉比较刺激的语速与正在展示的演员的自然语速。当视频片段以正常方向（正向模式）播放时，估计的语速是准确的。相比之下，当视频片段以反向（反向模式）播放时，语速会被低估。实验 2-4（2AFC）研究了人们如何准确地区分正向和反向语音运动。与实验 1 不同，观察者从未接触过视频片段的音轨。当回放模式与速率调制交叉时，表现明显优于机会水平，并且允许刺激的重复次数在正向模式下进行一定程度的语音阅读（实验 2）。在实验 3 中，通过使用不同且更大的静音视频片段集，使语音阅读变得更加困难。然而，与实验 2 相比，准确性仅略有下降。因此，对于区分运动方向，运动学线索而不是语音阅读线索更为重要。当实验 3 的相同刺激被上下颠倒（实验 4）时，性能会恶化，但仍保持在机会水平之上。我们认为，这些结果与假设相符，即视觉感知利用了隐含的运动能力。因此，正向刺激代表的合法生物运动实例与观察者无法执行的反向刺激所代表的运动不同。

相似文献

The perception of visible speech: estimation of speech rate and detection of time reversals.

Exp Brain Res. 2011 Nov;215(2):141-61. doi: 10.1007/s00221-011-2883-9. Epub 2011 Oct 11.

Audiovisual synchrony perception for music, speech, and object actions.

Brain Res. 2006 Sep 21;1111(1):134-42. doi: 10.1016/j.brainres.2006.05.078. Epub 2006 Jul 31.

Temporal recalibration during asynchronous audiovisual speech perception.

Exp Brain Res. 2007 Jul;181(1):173-81. doi: 10.1007/s00221-007-0918-z. Epub 2007 Mar 13.

Evaluating the influence of frame rate on the temporal aspects of audiovisual speech perception.

Neurosci Lett. 2006 Sep 11;405(1-2):132-6. doi: 10.1016/j.neulet.2006.06.041. Epub 2006 Jul 18.

Audiovisual temporal adaptation of speech: temporal order versus simultaneity judgments.

Exp Brain Res. 2008 Mar;185(3):521-9. doi: 10.1007/s00221-007-1168-9. Epub 2007 Oct 26.

How Are Audiovisual Simultaneity Judgments Affected by Multisensory Complexity and Speech Specificity?

Multisens Res. 2020 Jul 28;34(1):49-68. doi: 10.1163/22134808-bja10031.

The use of visible speech cues for improving auditory detection of spoken sentences.

J Acoust Soc Am. 2000 Sep;108(3 Pt 1):1197-208. doi: 10.1121/1.1288668.

Audiovisual synchrony perception for speech and music assessed using a temporal order judgment task.

Neurosci Lett. 2006 Jan 23;393(1):40-4. doi: 10.1016/j.neulet.2005.09.032. Epub 2005 Oct 6.

Investigating the effects of inversion on configural processing with an audiovisual temporal-order judgment task.

Perception. 2008;37(1):143-60. doi: 10.1068/p5648.

The change in perceptual synchrony between auditory and visual speech after exposure to asynchronous speech.

Neuroreport. 2011 Oct 5;22(14):684-8. doi: 10.1097/WNR.0b013e32834a2724.

引用本文的文献

Sensitivity of occipito-temporal cortex, premotor and Broca's areas to visible speech gestures in a familiar language.

PLoS One. 2020 Jun 19;15(6):e0234695. doi: 10.1371/journal.pone.0234695. eCollection 2020.

How long did it last? You would better ask a human.

Front Neurorobot. 2014 Jan 27;8:2. doi: 10.3389/fnbot.2014.00002. eCollection 2014.

Speech through ears and eyes: interfacing the senses with the supramodal brain.

Front Psychol. 2013 Jul 12;4:388. doi: 10.3389/fpsyg.2013.00388. eCollection 2013.

Detecting temporal reversals in human locomotion.

Exp Brain Res. 2011 Sep;214(1):93-103. doi: 10.1007/s00221-011-2809-6. Epub 2011 Aug 4.

本文引用的文献

Perceptual-cognitive universals as reflections of the world.

Psychon Bull Rev. 1994 Mar;1(1):2-28. doi: 10.3758/BF03200759.

Detecting temporal reversals in human locomotion.

Exp Brain Res. 2011 Sep;214(1):93-103. doi: 10.1007/s00221-011-2809-6. Epub 2011 Aug 4.

Experts see it all: configural effects in action observation.

Psychol Res. 2010 Jul;74(4):400-6. doi: 10.1007/s00426-009-0262-y. Epub 2009 Oct 25.

Lipreading, processing speed, and working memory in younger and older adults.

J Speech Lang Hear Res. 2009 Dec;52(6):1555-65. doi: 10.1044/1092-4388(2009/08-0137). Epub 2009 Aug 28.

Motor representations of articulators contribute to categorical perception of speech sounds.

J Neurosci. 2009 Aug 5;29(31):9819-25. doi: 10.1523/JNEUROSCI.6018-08.2009.

Perceptuomotor compatibility effects in speech.

Atten Percept Psychophys. 2009 Jul;71(5):1138-49. doi: 10.3758/APP.71.5.1138.

Acceleration carries the local inversion effect in biological motion perception.

J Vis. 2009 Jan 16;9(1):19.1-17. doi: 10.1167/9.1.19.

Obligatory Broca's area modulation associated with passive speech perception.

Neuroreport. 2009 Mar 25;20(5):492-6. doi: 10.1097/WNR.0b013e32832940a0.

Perception of animacy and direction from local biological motion signals.

J Vis. 2008 May 7;8(5):3.1-10. doi: 10.1167/8.5.3.

Motor speech perception modulates the cortical language areas.

Neuroimage. 2008 Jun;41(2):605-13. doi: 10.1016/j.neuroimage.2008.02.046. Epub 2008 Mar 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

可见语音的感知：语速估计和时间反转检测。

The perception of visible speech: estimation of speech rate and detection of time reversals.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献