• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

猴子和人类对面部/声音整合有着共同的计算。

Monkeys and humans share a common computation for face/voice integration.

机构信息

Neuroscience Institute, Princeton University, Princeton, NJ, USA.

出版信息

PLoS Comput Biol. 2011 Sep;7(9):e1002165. doi: 10.1371/journal.pcbi.1002165. Epub 2011 Sep 29.

DOI:10.1371/journal.pcbi.1002165
PMID:21998576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3182859/
Abstract

Speech production involves the movement of the mouth and other regions of the face resulting in visual motion cues. These visual cues enhance intelligibility and detection of auditory speech. As such, face-to-face speech is fundamentally a multisensory phenomenon. If speech is fundamentally multisensory, it should be reflected in the evolution of vocal communication: similar behavioral effects should be observed in other primates. Old World monkeys share with humans vocal production biomechanics and communicate face-to-face with vocalizations. It is unknown, however, if they, too, combine faces and voices to enhance their perception of vocalizations. We show that they do: monkeys combine faces and voices in noisy environments to enhance their detection of vocalizations. Their behavior parallels that of humans performing an identical task. We explored what common computational mechanism(s) could explain the pattern of results we observed across species. Standard explanations or models such as the principle of inverse effectiveness and a "race" model failed to account for their behavior patterns. Conversely, a "superposition model", positing the linear summation of activity patterns in response to visual and auditory components of vocalizations, served as a straightforward but powerful explanatory mechanism for the observed behaviors in both species. As such, it represents a putative homologous mechanism for integrating faces and voices across primates.

摘要

言语产生涉及嘴和面部其他区域的运动,从而产生视觉运动线索。这些视觉线索提高了听觉言语的可理解性和检测能力。因此,面对面的言语本质上是一种多感觉现象。如果言语本质上是多感觉的,那么它应该反映在声音通讯的进化中:在其他灵长类动物中应该观察到类似的行为效应。旧大陆猴与人类共享发声生物力学,并通过发声进行面对面交流。然而,它们是否也将面部和声音结合起来以增强对声音的感知,尚不清楚。我们表明它们确实如此:猴子在嘈杂的环境中将面部和声音结合起来,以提高对声音的检测能力。它们的行为与人类执行相同任务的行为相似。我们探讨了什么样的共同计算机制可以解释我们在不同物种中观察到的结果模式。标准的解释或模型,如逆有效性原则和“竞赛”模型,无法解释它们的行为模式。相反,“叠加模型”假设活动模式对声音的视觉和听觉成分的线性叠加,作为一种简单但强大的解释机制,适用于两种物种的观察行为。因此,它代表了一种在灵长类动物中整合面部和声音的假定同源机制。

相似文献

1
Monkeys and humans share a common computation for face/voice integration.猴子和人类对面部/声音整合有着共同的计算。
PLoS Comput Biol. 2011 Sep;7(9):e1002165. doi: 10.1371/journal.pcbi.1002165. Epub 2011 Sep 29.
2
Dynamic faces speed up the onset of auditory cortical spiking responses during vocal detection.动态面孔在声音检测过程中加快听觉皮层的尖峰反应。
Proc Natl Acad Sci U S A. 2013 Nov 26;110(48):E4668-77. doi: 10.1073/pnas.1312518110. Epub 2013 Nov 11.
3
Interactions between the superior temporal sulcus and auditory cortex mediate dynamic face/voice integration in rhesus monkeys.颞上沟与听觉皮层之间的相互作用介导了恒河猴动态的面部/声音整合。
J Neurosci. 2008 Apr 23;28(17):4457-69. doi: 10.1523/JNEUROSCI.0541-08.2008.
4
Mouth and Voice: A Relationship between Visual and Auditory Preference in the Human Superior Temporal Sulcus.嘴巴与声音:人类颞上沟中视觉与听觉偏好之间的关系。
J Neurosci. 2017 Mar 8;37(10):2697-2708. doi: 10.1523/JNEUROSCI.2914-16.2017. Epub 2017 Feb 8.
5
Neural correlates of perceptual narrowing in cross-species face-voice matching.跨物种面孔-声音匹配中感知变窄的神经关联。
Dev Sci. 2012 Nov;15(6):830-9. doi: 10.1111/j.1467-7687.2012.01179.x. Epub 2012 Sep 7.
6
Eye Can Hear Clearly Now: Inverse Effectiveness in Natural Audiovisual Speech Processing Relies on Long-Term Crossmodal Temporal Integration.现在眼睛能“听清”了:自然视听言语处理中的反向有效性依赖于长期跨模态时间整合。
J Neurosci. 2016 Sep 21;36(38):9888-95. doi: 10.1523/JNEUROSCI.1396-16.2016.
7
Different neural frequency bands integrate faces and voices differently in the superior temporal sulcus.不同的神经频段在颞上沟对面部和声音的整合方式不同。
J Neurophysiol. 2009 Feb;101(2):773-88. doi: 10.1152/jn.90843.2008. Epub 2008 Nov 26.
8
Neural responses elicited to face motion and vocalization pairings.对面部运动和发声配对所引发的神经反应。
Neuropsychologia. 2007 Jan 7;45(1):93-106. doi: 10.1016/j.neuropsychologia.2006.04.017.
9
A "voice patch" system in the primate brain for processing vocal information?灵长类动物大脑中用于处理声音信息的“语音补丁”系统?
Hear Res. 2018 Sep;366:65-74. doi: 10.1016/j.heares.2018.04.010. Epub 2018 May 7.
10
Facilitation of multisensory integration by the "unity effect" reveals that speech is special.“统一效应”对多感官整合的促进作用表明言语具有特殊性。
J Vis. 2008 Jul 29;8(9):14.1-11. doi: 10.1167/8.9.14.

引用本文的文献

1
Distinct audio and visual accumulators co-activate motor preparation for multisensory detection.不同的听觉和视觉累加器共同激活多感官检测的运动准备。
Nat Hum Behav. 2025 Aug 15. doi: 10.1038/s41562-025-02280-9.
2
Monkeys can identify pictures from words.猴子能够从单词中识别图片。
PLoS One. 2025 Feb 12;20(2):e0317183. doi: 10.1371/journal.pone.0317183. eCollection 2025.
3
Representation of Expression and Identity by Ventral Prefrontal Neurons.腹侧前额叶神经元的表达和身份表征。

本文引用的文献

1
Behavioral Indices of Multisensory Integration: Orientation to Visual Cues is Affected by Auditory Stimuli.多感觉整合的行为指标:对视觉线索的定向受听觉刺激的影响。
J Cogn Neurosci. 1989 Winter;1(1):12-24. doi: 10.1162/jocn.1989.1.1.12.
2
Multisensory interactions in early evoked brain activity follow the principle of inverse effectiveness.早期诱发脑活动中的多感觉交互遵循反效性原则。
Neuroimage. 2011 Jun 15;56(4):2200-8. doi: 10.1016/j.neuroimage.2011.03.075. Epub 2011 Apr 8.
3
Spontaneous voice-face identity matching by rhesus monkeys for familiar conspecifics and humans.
Neuroscience. 2022 Aug 1;496:243-260. doi: 10.1016/j.neuroscience.2022.05.033. Epub 2022 May 30.
4
Honey bees respond to multimodal stimuli following the principle of inverse effectiveness.蜜蜂对多模态刺激的反应遵循反效性原则。
J Exp Biol. 2022 May 15;225(10). doi: 10.1242/jeb.243832. Epub 2022 May 24.
5
Faces and Voices Processing in Human and Primate Brains: Rhythmic and Multimodal Mechanisms Underlying the Evolution and Development of Speech.人类和灵长类动物大脑中的面孔与声音处理:言语进化与发展背后的节律性和多模态机制
Front Psychol. 2022 Mar 30;13:829083. doi: 10.3389/fpsyg.2022.829083. eCollection 2022.
6
Formant-Based Recognition of Words and Other Naturalistic Sounds in Rhesus Monkeys.基于共振峰的恒河猴单词及其他自然声音识别
Front Neurosci. 2021 Oct 29;15:728686. doi: 10.3389/fnins.2021.728686. eCollection 2021.
7
Audiovisual detection at different intensities and delays.在不同强度和延迟条件下的视听检测。
J Math Psychol. 2019 Aug;91:159-175. doi: 10.1016/j.jmp.2019.05.001. Epub 2019 Jul 2.
8
Differing Roles of the Face and Voice in Early Human Communication: Roots of Language in Multimodal Expression.面部和声音在早期人类交流中的不同作用:多模态表达中的语言根源。
Front Commun (Lausanne). 2017;2. doi: 10.3389/fcomm.2017.00010. Epub 2017 Sep 15.
9
Primary Generators of Visually Evoked Field Potentials Recorded in the Macaque Auditory Cortex.猕猴听觉皮层中记录到的视觉诱发电位的主要发生器。
J Neurosci. 2017 Oct 18;37(42):10139-10153. doi: 10.1523/JNEUROSCI.3800-16.2017. Epub 2017 Sep 18.
10
The variability of multisensory processes of natural stimuli in human and non-human primates in a detection task.人类和非人类灵长类动物在检测任务中自然刺激的多感官过程的变异性。
PLoS One. 2017 Feb 17;12(2):e0172480. doi: 10.1371/journal.pone.0172480. eCollection 2017.
恒河猴对熟悉的同种个体和人类的自发声音面孔身份匹配。
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1735-40. doi: 10.1073/pnas.1008169108. Epub 2011 Jan 10.
4
Redundancy gains in simple responses and go/no-go tasks.简单反应和“是/否”任务中的冗余增益。
Atten Percept Psychophys. 2010 Aug;72(6):1692-709. doi: 10.3758/APP.72.6.1692.
5
Dynamic, rhythmic facial expressions and the superior temporal sulcus of macaque monkeys: implications for the evolution of audiovisual speech.猕猴动态、有节奏的面部表情与颞上沟:对视听言语进化的启示
Eur J Neurosci. 2010 May;31(10):1807-17. doi: 10.1111/j.1460-9568.2010.07209.x.
6
Semantic confusion regarding the development of multisensory integration: a practical solution.关于多感官整合发展的语义混淆:一种实用的解决方案。
Eur J Neurosci. 2010 May;31(10):1713-20. doi: 10.1111/j.1460-9568.2010.07206.x.
7
The optimal time window of visual-auditory integration: a reaction time analysis.视觉-听觉整合的最佳时间窗口:反应时间分析。
Front Integr Neurosci. 2010 May 11;4:11. doi: 10.3389/fnint.2010.00011. eCollection 2010.
8
A permutation test for the race model inequality.种族模型不等式的排列检验。
Behav Res Methods. 2010 Feb;42(1):23-8. doi: 10.3758/BRM.42.1.23.
9
Multisensory facilitation of behavior in monkeys: effects of stimulus intensity.多感官促进猴子的行为:刺激强度的影响。
J Cogn Neurosci. 2010 Dec;22(12):2850-63. doi: 10.1162/jocn.2010.21423.
10
Neural processing of asynchronous audiovisual speech perception.异步视听语音感知的神经处理。
Neuroimage. 2010 Feb 15;49(4):3308-18. doi: 10.1016/j.neuroimage.2009.12.001.