2276 个英语单词的口面部信息量常模。

Mouth and facial informativeness norms for 2276 English words.

机构信息

Department of Experimental Psychology, University College London, 26 Bedford Way, London, WC1H, 0AP, UK.

出版信息

Behav Res Methods. 2024 Aug;56(5):4786-4801. doi: 10.3758/s13428-023-02216-z. Epub 2023 Aug 21.

DOI:10.3758/s13428-023-02216-z

PMID:37604959

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11289175/

Abstract

Mouth and facial movements are part and parcel of face-to-face communication. The primary way of assessing their role in speech perception has been by manipulating their presence (e.g., by blurring the area of a speaker's lips) or by looking at how informative different mouth patterns are for the corresponding phonemes (or visemes; e.g., /b/ is visually more salient than /g/). However, moving beyond informativeness of single phonemes is challenging due to coarticulation and language variations (to name just a few factors). Here, we present mouth and facial informativeness (MaFI) for words, i.e., how visually informative words are based on their corresponding mouth and facial movements. MaFI was quantified for 2276 English words, varying in length, frequency, and age of acquisition, using phonological distance between a word and participants' speechreading guesses. The results showed that MaFI norms capture well the dynamic nature of mouth and facial movements per word, with words containing phonemes with roundness and frontness features, as well as visemes characterized by lower lip tuck, lip rounding, and lip closure being visually more informative. We also showed that the more of these features there are in a word, the more informative it is based on mouth and facial movements. Finally, we demonstrated that the MaFI norms generalize across different variants of English language. The norms are freely accessible via Open Science Framework ( https://osf.io/mna8j/ ) and can benefit any language researcher using audiovisual stimuli (e.g., to control for the effect of speech-linked mouth and facial movements).

摘要

口面部运动是面对面交流的重要组成部分。评估它们在言语感知中的作用的主要方法是通过操纵它们的存在（例如，模糊说话者嘴唇的区域），或者通过观察不同口型模式对相应音素（或视位；例如，/b/ 在视觉上比/g/更突出）的信息含量。然而，由于协同发音和语言变化（仅举几例），超越单个音素的信息含量具有挑战性。在这里，我们提出了单词的口面部信息量（MaFI），即基于相应的口部和面部运动，单词在视觉上的信息量。使用单词与参与者的语音猜测之间的语音距离，对 2276 个英语单词的 MaFI 进行了量化，这些单词的长度、频率和习得年龄各不相同。结果表明，MaFI 规范很好地捕捉了每个单词的口部和面部运动的动态性质，其中包含圆唇和前元音特征的音素以及以下唇回缩、唇圆化和唇闭为特征的视位的单词在视觉上更具信息量。我们还表明，一个单词中包含的这些特征越多，基于口部和面部运动的信息量就越大。最后，我们证明了 MaFI 规范在不同英语变体中具有通用性。这些规范可通过开放科学框架（https://osf.io/mna8j/）免费获取，任何使用视听刺激的语言研究人员都可以从中受益（例如，控制与言语相关的口部和面部运动的影响）。

相似文献

Mouth and facial informativeness norms for 2276 English words.

Behav Res Methods. 2024 Aug;56(5):4786-4801. doi: 10.3758/s13428-023-02216-z. Epub 2023 Aug 21.

The role of iconic gestures and mouth movements in face-to-face communication.

Psychon Bull Rev. 2022 Apr;29(2):600-612. doi: 10.3758/s13423-021-02009-5. Epub 2021 Oct 20.

Automatic audiovisual integration in speech perception.

Exp Brain Res. 2005 Nov;167(1):66-75. doi: 10.1007/s00221-005-0008-z. Epub 2005 Oct 29.

Lip-Reading Enables the Brain to Synthesize Auditory Features of Unknown Silent Speech.

J Neurosci. 2020 Jan 29;40(5):1053-1065. doi: 10.1523/JNEUROSCI.1101-19.2019. Epub 2019 Dec 30.

Read my lips: asymmetries in the visual expression and perception of speech revealed through the McGurk effect.

Psychol Sci. 2004 Feb;15(2):138-41. doi: 10.1111/j.0963-7214.2004.01502011.x.

Increasing audiovisual speech integration in autism through enhanced attention to mouth.

Dev Sci. 2023 Jul;26(4):e13348. doi: 10.1111/desc.13348. Epub 2022 Dec 1.

Listening to talking faces: motor cortical activation during speech perception.

Neuroimage. 2005 Mar;25(1):76-89. doi: 10.1016/j.neuroimage.2004.11.006. Epub 2005 Jan 8.

Does dynamic information about the speaker's face contribute to semantic speech processing? ERP evidence.

Cortex. 2018 Jul;104:12-25. doi: 10.1016/j.cortex.2018.03.031. Epub 2018 Apr 9.

[Development and evaluation of a deep learning algorithm for German word recognition from lip movements].

HNO. 2022 Jun;70(6):456-465. doi: 10.1007/s00106-021-01143-9. Epub 2022 Jan 13.

Lip movements affect infants' audiovisual speech perception.

Psychol Sci. 2013 May;24(5):603-12. doi: 10.1177/0956797612458802. Epub 2013 Mar 28.

引用本文的文献

The role of multimodal cues in second language comprehension.

Sci Rep. 2023 Nov 27;13(1):20824. doi: 10.1038/s41598-023-47643-2.

本文引用的文献

More than words: word predictability, prosody, gesture and mouth movements in natural language comprehension.

Proc Biol Sci. 2021 Jul 28;288(1955):20210500. doi: 10.1098/rspb.2021.0500. Epub 2021 Jul 21.

Degree of Language Experience Modulates Visual Attention to Visible Speech and Iconic Gestures During Clear and Degraded Speech Comprehension.

Cogn Sci. 2019 Oct;43(10):e12789. doi: 10.1111/cogs.12789.

Native language status of the listener modulates the neural integration of speech and iconic gestures in clear and adverse listening conditions.

Brain Lang. 2018 Feb-Mar;177-178:7-17. doi: 10.1016/j.bandl.2018.01.003.

Visual Cues Contribute Differentially to Audiovisual Perception of Consonants and Vowels in Improving Recognition and Reducing Cognitive Demands in Listeners With Hearing Impairment Using Hearing Aids.

J Speech Lang Hear Res. 2017 Sep 18;60(9):2687-2703. doi: 10.1044/2016_JSLHR-H-16-0160.

Visual Context Enhanced: The Joint Contribution of Iconic Gestures and Visible Speech to Degraded Speech Comprehension.

J Speech Lang Hear Res. 2017 Jan 1;60(1):212-222. doi: 10.1044/2016_JSLHR-H-16-0101.

Prediction and constraint in audiovisual speech perception.

Cortex. 2015 Jul;68:169-81. doi: 10.1016/j.cortex.2015.03.006. Epub 2015 Mar 20.

Concreteness ratings for 40 thousand generally known English word lemmas.

Behav Res Methods. 2014 Sep;46(3):904-11. doi: 10.3758/s13428-013-0403-5.

Age-of-acquisition ratings for 30,000 English words.

Behav Res Methods. 2012 Dec;44(4):978-90. doi: 10.3758/s13428-012-0210-4.

Effects of lips and hands on auditory learning of second-language speech sounds.

J Speech Lang Hear Res. 2010 Apr;53(2):298-310. doi: 10.1044/1092-4388(2009/08-0243). Epub 2010 Mar 10.

Spoken word recognition by eye.

Scand J Psychol. 2009 Oct;50(5):419-25. doi: 10.1111/j.1467-9450.2009.00751.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

2276 个英语单词的口面部信息量常模。

Mouth and facial informativeness norms for 2276 English words.

机构信息

Department of Experimental Psychology, University College London, 26 Bedford Way, London, WC1H, 0AP, UK.

出版信息

Behav Res Methods. 2024 Aug;56(5):4786-4801. doi: 10.3758/s13428-023-02216-z. Epub 2023 Aug 21.

DOI:10.3758/s13428-023-02216-z

PMID:37604959

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11289175/

Abstract

摘要

2276 个英语单词的口面部信息量常模。

Mouth and facial informativeness norms for 2276 English words.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

2276 个英语单词的口面部信息量常模。

Mouth and facial informativeness norms for 2276 English words.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献