• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于多模态的评论:并行架构中的语法交互

Remarks on Multimodality: Grammatical Interactions in the Parallel Architecture.

作者信息

Cohn Neil, Schilperoord Joost

机构信息

Department of Communication and Cognition, Tilburg School of Humanities and Digital Sciences, Tilburg University, Tilburg, Netherlands.

出版信息

Front Artif Intell. 2022 Jan 4;4:778060. doi: 10.3389/frai.2021.778060. eCollection 2021.

DOI:10.3389/frai.2021.778060
PMID:35059636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8764459/
Abstract

Language is typically embedded in multimodal communication, yet models of linguistic competence do not often incorporate this complexity. Meanwhile, speech, gesture, and/or pictures are each considered as indivisible components of multimodal messages. Here, we argue that multimodality should not be characterized by whole interacting behaviors, but by interactions of similar substructures which permeate across expressive behaviors. These structures comprise a unified architecture and align within Jackendoff's Parallel Architecture: a modality, meaning, and grammar. Because this tripartite architecture persists across modalities, interactions can manifest within each of these substructures. Interactions between modalities alone create correspondences in time (ex. speech with gesture) or space (ex. writing with pictures) of the sensory signals, while multimodal meaning-making balances how modalities carry "semantic weight" for the gist of the whole expression. Here we focus primarily on interactions between grammars, which contrast across two variables: symmetry, related to the complexity of the grammars, and allocation, related to the relative independence of interacting grammars. While independent allocations keep grammars separate, substitutive allocation inserts expressions from one grammar into those of another. We show that substitution operates in interactions between all three natural modalities (vocal, bodily, graphic), and also in unimodal contexts within and between languages, as in codeswitching. Altogether, we argue that unimodal and multimodal expressions arise as emergent interactive states from a unified cognitive architecture, heralding a reconsideration of the "language faculty" itself.

摘要

语言通常嵌入在多模态交流中,然而语言能力模型往往没有纳入这种复杂性。与此同时,言语、手势和/或图片都被视为多模态信息中不可分割的组成部分。在此,我们认为多模态不应以整体的交互行为来表征,而应以渗透于各种表达行为中的相似子结构的交互来表征。这些结构构成一个统一的架构,并与杰肯多夫的并行架构相一致:一种模态、意义和语法。由于这种三方架构在各模态中都存在,交互可以在这些子结构的每一个中体现出来。模态之间的交互仅在感觉信号的时间(例如言语与手势)或空间(例如书写与图片)上产生对应关系,而多模态意义构建则平衡各模态如何为整个表达的主旨承载“语义权重”。在此我们主要关注语法之间的交互,这种交互在两个变量上形成对比:对称性,与语法的复杂性相关;分配,与交互语法的相对独立性相关。虽然独立分配使语法保持分离,但替代分配会将一个语法中的表达式插入到另一个语法的表达式中。我们表明,替代操作在所有三种自然模态(语音、身体动作、图形)之间的交互中起作用,也在语言内部和语言之间的单模态语境中起作用,如语码转换。总之,我们认为单模态和多模态表达是从一个统一的认知架构中作为涌现的交互状态而产生的,这预示着对“语言官能”本身的重新思考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/9ad3d0ff645e/frai-04-778060-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/9c63d5b23439/frai-04-778060-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/2f8765510f24/frai-04-778060-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/cd10aa91dc7c/frai-04-778060-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/353ae7b7f501/frai-04-778060-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/d10c0da42e86/frai-04-778060-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/5faa0f127b3a/frai-04-778060-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/afa250b44181/frai-04-778060-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/d7ac7c8644f5/frai-04-778060-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/9ad3d0ff645e/frai-04-778060-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/9c63d5b23439/frai-04-778060-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/2f8765510f24/frai-04-778060-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/cd10aa91dc7c/frai-04-778060-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/353ae7b7f501/frai-04-778060-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/d10c0da42e86/frai-04-778060-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/5faa0f127b3a/frai-04-778060-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/afa250b44181/frai-04-778060-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/d7ac7c8644f5/frai-04-778060-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4663/8764459/9ad3d0ff645e/frai-04-778060-g0009.jpg

相似文献

1
Remarks on Multimodality: Grammatical Interactions in the Parallel Architecture.关于多模态的评论:并行架构中的语法交互
Front Artif Intell. 2022 Jan 4;4:778060. doi: 10.3389/frai.2021.778060. eCollection 2021.
2
A multimodal parallel architecture: A cognitive framework for multimodal interactions.一种多模态并行架构:多模态交互的认知框架。
Cognition. 2016 Jan;146:304-23. doi: 10.1016/j.cognition.2015.10.007. Epub 2015 Nov 9.
3
Generative Grammar: A Meaning First Approach.生成语法:意义优先法。
Front Psychol. 2020 Nov 23;11:571295. doi: 10.3389/fpsyg.2020.571295. eCollection 2020.
4
The grammar of emoji? Constraints on communicative pictorial sequencing.表情符号的语法?对交际性图像序列的限制。
Cogn Res Princ Implic. 2019 Aug 30;4(1):33. doi: 10.1186/s41235-019-0177-0.
5
Multimodality and the origin of a novel communication system in face-to-face interaction.多模态与面对面互动中一种新型交流系统的起源。
R Soc Open Sci. 2020 Jan 15;7(1):182056. doi: 10.1098/rsos.182056. eCollection 2020 Jan.
6
Reimagining Language.重塑语言。
Cogn Sci. 2022 Jul;46(7):e13164. doi: 10.1111/cogs.13174.
7
Syntactic Change in the Parallel Architecture: The Case of Parasitic Gaps.平行结构中的句法变化:寄生语缺的情况
Cogn Sci. 2017 Mar;41 Suppl 2:213-232. doi: 10.1111/cogs.12383. Epub 2016 Jul 1.
8
Neural network processing of natural language: II. Towards a unified model of corticostriatal function in learning sentence comprehension and non-linguistic sequencing.自然语言的神经网络处理:II. 迈向学习句子理解和非语言序列中皮质纹状体功能的统一模型。
Brain Lang. 2009 May-Jun;109(2-3):80-92. doi: 10.1016/j.bandl.2008.08.002. Epub 2008 Oct 5.
9
Extending the Architecture of Language From a Multimodal Perspective.从多模态视角拓展语言架构
Top Cogn Sci. 2024 Mar 17. doi: 10.1111/tops.12728.
10
Multimodality matters in numerical communication.多模态在数字通信中很重要。
Front Psychol. 2023 Jul 26;14:1130777. doi: 10.3389/fpsyg.2023.1130777. eCollection 2023.

引用本文的文献

1
Is Comprehension in Comics More Effective Than in Traditional Texts in Skilled Adult Readers? An Eye Movement-Based Study.在熟练的成年读者中,漫画中的阅读理解是否比传统文本更有效?一项基于眼动的研究。
Cogn Sci. 2025 Jul;49(7):e70081. doi: 10.1111/cogs.70081.

本文引用的文献

1
Stimulus data and experimental design for a self-paced reading study on emoji-word substitutions.关于表情符号-单词替换的自定进度阅读研究的刺激数据和实验设计。
Data Brief. 2022 Jun 18;43:108399. doi: 10.1016/j.dib.2022.108399. eCollection 2022 Aug.
2
'Tiny numbers' are actually tiny: Evidence from gestures in the TV News Archive.“小数字”其实很小:来自电视新闻档案中手势的证据。
PLoS One. 2020 Nov 17;15(11):e0242142. doi: 10.1371/journal.pone.0242142. eCollection 2020.
3
Alignment in Multimodal Interaction: An Integrative Framework.
多模态交互中的对齐:一个综合框架。
Cogn Sci. 2020 Nov;44(11):e12911. doi: 10.1111/cogs.12911.
4
Cross-codal integration of bridging-event information in narrative understanding.跨码 bridging-event 信息在叙事理解中的整合。
Mem Cognit. 2020 Aug;48(6):942-956. doi: 10.3758/s13421-020-01039-z.
5
Learning to read recycles visual cortical networks without destruction.学习阅读会重新利用视觉皮层网络,而不会造成破坏。
Sci Adv. 2019 Sep 18;5(9):eaax0262. doi: 10.1126/sciadv.aax0262. eCollection 2019 Sep.
6
The grammar of emoji? Constraints on communicative pictorial sequencing.表情符号的语法?对交际性图像序列的限制。
Cogn Res Princ Implic. 2019 Aug 30;4(1):33. doi: 10.1186/s41235-019-0177-0.
7
Your Brain on Comics: A Cognitive Model of Visual Narrative Comprehension.漫画中的大脑:视觉叙事理解的认知模型。
Top Cogn Sci. 2020 Jan;12(1):352-386. doi: 10.1111/tops.12421. Epub 2019 Apr 8.
8
Not so secret agents: Event-related potentials to semantic roles in visual event comprehension.并非隐秘的特工:视觉事件理解中语义角色的事件相关电位
Brain Cogn. 2017 Dec;119:1-9. doi: 10.1016/j.bandc.2017.09.001. Epub 2017 Sep 9.
9
When a hit sounds like a kiss: An electrophysiological exploration of semantic processing in visual narrative.当命中听起来像亲吻时:视觉叙事中语义处理的电生理探索。
Brain Lang. 2017 Jun;169:28-38. doi: 10.1016/j.bandl.2017.02.001. Epub 2017 Feb 24.
10
The neural and computational bases of semantic cognition.语义认知的神经和计算基础。
Nat Rev Neurosci. 2017 Jan;18(1):42-55. doi: 10.1038/nrn.2016.150. Epub 2016 Nov 24.