与视觉语音信息对齐。

Alignment to visual speech information.

作者信息

Miller Rachel M, Sanchez Kauyumari, Rosenblum Lawrence D

机构信息

University of California, Riverside, California, USA.

出版信息

Atten Percept Psychophys. 2010 Aug;72(6):1614-25. doi: 10.3758/APP.72.6.1614.

DOI:10.3758/APP.72.6.1614

PMID:20675805

Abstract

Speech alignment is the tendency for interlocutors to unconsciously imitate one another's speaking style. Alignment also occurs when a talker is asked to shadow recorded words (e.g., Shockley, Sabadini, & Fowler, 2004). In two experiments, we examined whether alignment could be induced with visual (lipread) speech and with auditory speech. In Experiment 1, we asked subjects to lipread and shadow out loud a model silently uttering words. The results indicate that shadowed utterances sounded more similar to the model's utterances than did subjects' nonshadowed read utterances. This suggests that speech alignment can be based on visual speech. In Experiment 2, we tested whether raters could perceive alignment across modalities. Raters were asked to judge the relative similarity between a model's visual (silent video) utterance and subjects' audio utterances. The subjects' shadowed utterances were again judged as more similar to the model's than were read utterances, suggesting that raters are sensitive to cross-modal similarity between aligned words.

摘要

言语趋同是指对话者会无意识地相互模仿对方的说话风格。当要求说话者跟读录制的单词时（例如，肖克利、萨巴迪尼和福勒，2004年），也会出现趋同现象。在两项实验中，我们研究了视觉（唇读）言语和听觉言语是否能引发趋同现象。在实验1中，我们要求受试者唇读并大声跟读一个默默说出单词的模型。结果表明，跟读的话语听起来比受试者未跟读的朗读话语更类似于模型的话语。这表明言语趋同可以基于视觉言语。在实验2中，我们测试了评分者是否能察觉到跨模态的趋同现象。要求评分者判断模型的视觉（无声视频）话语与受试者的音频话语之间的相对相似度。受试者的跟读话语再次被判定为比朗读话语更类似于模型的话语，这表明评分者对趋同单词之间的跨模态相似度很敏感。

相似文献

Alignment to visual speech information.

Atten Percept Psychophys. 2010 Aug;72(6):1614-25. doi: 10.3758/APP.72.6.1614.

Is speech alignment to talkers or tasks?

Atten Percept Psychophys. 2013 Nov;75(8):1817-26. doi: 10.3758/s13414-013-0517-y.

Visual influences on alignment to voice onset time.

J Speech Lang Hear Res. 2010 Apr;53(2):262-72. doi: 10.1044/1092-4388(2009/08-0247). Epub 2010 Mar 10.

Lip-read me now, hear me better later: cross-modal transfer of talker-familiarity effects.

Psychol Sci. 2007 May;18(5):392-6. doi: 10.1111/j.1467-9280.2007.01911.x.

Audio-visual speech perception is special.

Cognition. 2005 May;96(1):B13-22. doi: 10.1016/j.cognition.2004.10.004. Epub 2004 Dec 30.

A comparison of the McGurk effect for spoken and sung syllables.

Atten Percept Psychophys. 2010 Aug;72(6):1450-4. doi: 10.3758/APP.72.6.1450.

Audio-visual matching of speech and non-speech oral gestures in patients with aphasia and apraxia of speech.

Neuropsychologia. 2006;44(4):546-55. doi: 10.1016/j.neuropsychologia.2005.07.002. Epub 2005 Aug 29.

A role for the second subglottal resonance in lexical access.

J Acoust Soc Am. 2007 Oct;122(4):2320-7. doi: 10.1121/1.2772227.

Do you see what you are hearing? Cross-modal effects of speech sounds on lipreading.

Neurosci Lett. 2010 Mar 3;471(2):100-3. doi: 10.1016/j.neulet.2010.01.019. Epub 2010 Jan 18.

Infant word segmentation revisited: edge alignment facilitates target extraction.

Dev Sci. 2006 Nov;9(6):565-73. doi: 10.1111/j.1467-7687.2006.00534.x.

引用本文的文献

Transfer of statistical learning from passive speech perception to speech production.

Psychon Bull Rev. 2024 Jun;31(3):1193-1205. doi: 10.3758/s13423-023-02399-8. Epub 2023 Oct 26.

Convergence in voice fundamental frequency during synchronous speech.

PLoS One. 2021 Oct 21;16(10):e0258747. doi: 10.1371/journal.pone.0258747. eCollection 2021.

Speaking to a common tune: Between-speaker convergence in voice fundamental frequency in a joint speech production task.

PLoS One. 2020 May 4;15(5):e0232209. doi: 10.1371/journal.pone.0232209. eCollection 2020.

Vocal Imitations of Non-Vocal Sounds.

PLoS One. 2016 Dec 16;11(12):e0168167. doi: 10.1371/journal.pone.0168167. eCollection 2016.

Influences of selective adaptation on perception of audiovisual speech.

J Phon. 2016 May;56:75-84. doi: 10.1016/j.wocn.2016.02.004.

Visibility of speech articulation enhances auditory phonetic convergence.

Atten Percept Psychophys. 2016 Jan;78(1):317-33. doi: 10.3758/s13414-015-0982-6.

Can mergers-in-progress be unmerged in speech accommodation?

Front Psychol. 2013 Sep 24;4:653. doi: 10.3389/fpsyg.2013.00653. eCollection 2013.

Neural correlates of phonetic convergence and speech imitation.

Front Psychol. 2013 Sep 11;4:600. doi: 10.3389/fpsyg.2013.00600. eCollection 2013.

Experience with a talker can transfer across modalities to facilitate lipreading.

Atten Percept Psychophys. 2013 Oct;75(7):1359-65. doi: 10.3758/s13414-013-0534-x.

Is speech alignment to talkers or tasks?

Atten Percept Psychophys. 2013 Nov;75(8):1817-26. doi: 10.3758/s13414-013-0517-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

与视觉语音信息对齐。

Alignment to visual speech information.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献