• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉复杂性及其对指代表达生成的影响。

Visual Complexity and Its Effects on Referring Expression Generation.

作者信息

Elsner Micha, Clarke Alasdair, Rohde Hannah

机构信息

Department of Linguistics, The Ohio State University.

Department of Psychology, University of Essex.

出版信息

Cogn Sci. 2018 Jun;42 Suppl 4:940-973. doi: 10.1111/cogs.12507. Epub 2017 Jun 26.

DOI:10.1111/cogs.12507
PMID:28649757
Abstract

Speakers' perception of a visual scene influences the language they use to describe it-which objects they choose to mention and how they characterize the relationships between them. We show that visual complexity can either delay or facilitate description generation, depending on how much disambiguating information is required and how useful the scene's complexity can be in providing, for example, helpful landmarks. To do so, we measure speech onset times, eye gaze, and utterance content in a reference production experiment in which the target object is either unique or non-unique in a visual scene of varying size and complexity. Speakers delay speech onset if the target object is non-unique and requires disambiguation, and we argue that this reflects the cost of deciding on a high-level strategy for describing it. The eye-tracking data demonstrate that these delays increase when speakers are able to conduct an extensive early visual search, implying that when speakers scan too little of the scene early on, they may decide to begin speaking before becoming aware that their description is underspecified. Speakers' content choices reflect the visual makeup of the scene-the number of distractors present and the availability of useful landmarks. Our results highlight the complex role of visual perception in reference production, showing that speakers can make good use of complexity in ways that reflect their visual processing of the scene.

摘要

说话者对视觉场景的感知会影响他们用于描述该场景的语言——他们选择提及哪些物体以及如何描述这些物体之间的关系。我们发现,视觉复杂性既可能延迟也可能促进描述的生成,这取决于需要多少消除歧义的信息,以及场景的复杂性在提供例如有用地标方面有多有用。为此,我们在一个参考生成实验中测量了言语起始时间、目光注视和话语内容,在该实验中,目标物体在不同大小和复杂程度的视觉场景中要么是唯一的,要么不是唯一的。如果目标物体不是唯一的且需要消除歧义,说话者会延迟言语起始,我们认为这反映了确定描述它的高级策略的成本。眼动追踪数据表明,当说话者能够进行广泛的早期视觉搜索时,这些延迟会增加,这意味着当说话者早期对场景扫视太少时,他们可能会在意识到自己的描述不够详细之前就决定开始说话。说话者的内容选择反映了场景的视觉构成——存在的干扰物数量以及有用地标的可用性。我们的结果凸显了视觉感知在参考生成中的复杂作用,表明说话者能够以反映他们对场景视觉处理的方式充分利用复杂性。

相似文献

1
Visual Complexity and Its Effects on Referring Expression Generation.视觉复杂性及其对指代表达生成的影响。
Cogn Sci. 2018 Jun;42 Suppl 4:940-973. doi: 10.1111/cogs.12507. Epub 2017 Jun 26.
2
Realistic About Reference Production: Testing the Effects of Domain Size and Saturation.现实参考生成:测试领域大小和饱和度的影响。
Cogn Sci. 2024 Jun;48(6):e13473. doi: 10.1111/cogs.13473.
3
Lateralized electrical brain activity reveals covert attention allocation during speaking.大脑电活动的偏侧化揭示了说话过程中隐蔽的注意力分配。
Neuropsychologia. 2017 Jan 27;95:101-110. doi: 10.1016/j.neuropsychologia.2016.12.013. Epub 2016 Dec 8.
4
Integrating mechanisms of visual guidance in naturalistic language production.整合自然语言生成中视觉引导的机制。
Cogn Process. 2015 May;16(2):131-50. doi: 10.1007/s10339-014-0642-0. Epub 2014 Nov 23.
5
Reference Production as Search: The Impact of Domain Size on the Production of Distinguishing Descriptions.作为搜索的参考生成:领域规模对区分性描述生成的影响
Cogn Sci. 2017 May;41 Suppl 6:1457-1492. doi: 10.1111/cogs.12375. Epub 2016 Jun 6.
6
Modulation of scene consistency and task demand on language-driven eye movements for audio-visual integration.场景一致性和任务需求对用于视听整合的语言驱动眼动的调节作用。
Acta Psychol (Amst). 2016 Nov;171:1-16. doi: 10.1016/j.actpsy.2016.09.004. Epub 2016 Sep 15.
7
Variation in dual-task performance reveals late initiation of speech planning in turn-taking.双重任务表现的差异揭示了轮流对话中言语规划的延迟启动。
Cognition. 2015 Mar;136:304-24. doi: 10.1016/j.cognition.2014.10.008. Epub 2014 Dec 15.
8
Objective eye-gaze behaviour during face-to-face communication with proficient alaryngeal speakers: a preliminary study.面对面交流中使用人工发声器的熟练失音者的客观眼球注视行为:初步研究。
Int J Lang Commun Disord. 2011 Sep-Oct;46(5):535-49. doi: 10.1111/j.1460-6984.2011.00005.x. Epub 2011 Mar 7.
9
Looking at a contrast object before speaking boosts referential informativeness, but is not essential.说话前看着对比对象会提高指称信息性,但并非必不可少。
Acta Psychol (Amst). 2017 Jul;178:87-99. doi: 10.1016/j.actpsy.2017.06.001. Epub 2017 Jun 16.
10
Anticipation in Real-World Scenes: The Role of Visual Context and Visual Memory.现实场景中的预期:视觉背景和视觉记忆的作用。
Cogn Sci. 2016 Nov;40(8):1995-2024. doi: 10.1111/cogs.12313. Epub 2015 Oct 30.

引用本文的文献

1
Reevaluating pragmatic reasoning in language games.重新评估语言游戏中的语用推理。
PLoS One. 2021 Mar 17;16(3):e0248388. doi: 10.1371/journal.pone.0248388. eCollection 2021.
2
On Visually-Grounded Reference Production: Testing the Effects of Perceptual Grouping and 2D/3D Presentation Mode.关于视觉基础参照生成:测试知觉分组和二维/三维呈现模式的影响。
Front Psychol. 2019 Oct 1;10:2247. doi: 10.3389/fpsyg.2019.02247. eCollection 2019.