• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

她会根据学生的情况进行调整:一位专业的语用学专家会根据外行人听众的情况调整她的指称表达。

She adapts to her student: An expert pragmatic speaker tailoring her referring expressions to the Layman listener.

作者信息

Greco Claudio, Bagade Diksha, Le Dieu-Thu, Bernardi Raffaella

机构信息

CIMeC, University of Trento, Rovereto, TN, Italy.

Amazon Alexa AI, Berlin, Germany.

出版信息

Front Artif Intell. 2023 Mar 9;6:1017204. doi: 10.3389/frai.2023.1017204. eCollection 2023.

DOI:10.3389/frai.2023.1017204
PMID:36967832
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10034353/
Abstract

Communication is a dynamic process through which interlocutors adapt to each other. In the development of conversational agents, this core aspect has been put aside for several years since the main challenge was to obtain conversational neural models able to produce utterances and dialogues that at least at the surface level are human-like. Now that this milestone has been achieved, the importance of paying attention to the dynamic and adaptive interactive aspects of language has been advocated in several position papers. In this paper, we focus on how a Speaker adapts to an interlocutor with different background knowledge. Our models undergo a pre-training phase, through which they acquire grounded knowledge by learning to describe an image, and an adaptive phase through which a Speaker and a Listener play a repeated reference game. Using a similar setting, previous studies focus on how conversational models create new conventions; we are interested, instead, in studying whether the Speaker learns from the Listener's mistakes to adapt to his background knowledge. We evaluate models based on Rational Speech Act (RSA), a likelihood loss, and a combination of the two. We show that RSA could indeed work as a backbone to drive the Speaker toward the Listener: in the combined model, apart from the improved Listener's accuracy, the language generated by the Speaker features the changes that signal adaptation to the Listener's background knowledge. Specifically, captions to unknown object categories contain more adjectives and less direct reference to the unknown objects.

摘要

交流是一个动态过程,在此过程中对话者相互适应。在对话代理的发展过程中,这一核心方面在数年里都被搁置一旁,因为主要挑战是获得能够生成至少在表面上类似人类的话语和对话的对话神经模型。既然这一里程碑已经达成,若干立场文件都主张了关注语言动态和适应性交互方面的重要性。在本文中,我们聚焦于说话者如何适应具有不同背景知识的对话者。我们的模型经历一个预训练阶段,通过该阶段它们通过学习描述图像来获取有根据的知识,以及一个适应阶段,通过该阶段说话者和倾听者进行重复的指称游戏。使用类似的设置,先前的研究聚焦于对话模型如何创造新的惯例;相反,我们感兴趣的是研究说话者是否从倾听者的错误中学习以适应其背景知识。我们基于理性言语行为(RSA)、似然损失以及两者的组合来评估模型。我们表明RSA确实可以作为驱使说话者趋向倾听者的主干:在组合模型中,除了提高倾听者的准确率之外,说话者生成的语言具有表明适应倾听者背景知识的变化。具体而言,针对未知对象类别的字幕包含更多形容词,并且对未知对象的直接指称更少。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/6f1015935310/frai-06-1017204-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/be025fdf0690/frai-06-1017204-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/bcb2824cd4f6/frai-06-1017204-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/30af1b269290/frai-06-1017204-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/12d50c135f76/frai-06-1017204-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/19e9debb19ea/frai-06-1017204-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/efdf3200efa0/frai-06-1017204-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/198c30dc3fb0/frai-06-1017204-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/612c51f03297/frai-06-1017204-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/6f1015935310/frai-06-1017204-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/be025fdf0690/frai-06-1017204-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/bcb2824cd4f6/frai-06-1017204-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/30af1b269290/frai-06-1017204-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/12d50c135f76/frai-06-1017204-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/19e9debb19ea/frai-06-1017204-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/efdf3200efa0/frai-06-1017204-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/198c30dc3fb0/frai-06-1017204-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/612c51f03297/frai-06-1017204-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c525/10034353/6f1015935310/frai-06-1017204-g0009.jpg

相似文献

1
She adapts to her student: An expert pragmatic speaker tailoring her referring expressions to the Layman listener.她会根据学生的情况进行调整:一位专业的语用学专家会根据外行人听众的情况调整她的指称表达。
Front Artif Intell. 2023 Mar 9;6:1017204. doi: 10.3389/frai.2023.1017204. eCollection 2023.
2
Reevaluating pragmatic reasoning in language games.重新评估语言游戏中的语用推理。
PLoS One. 2021 Mar 17;16(3):e0248388. doi: 10.1371/journal.pone.0248388. eCollection 2021.
3
[What factors play a role in a listener's feelings evoked by irony?: the effect of listeners' personality traits and relationship with the speaker].[哪些因素会影响听众对反讽的感受?:听众性格特质及与说话者关系的影响]
Shinrigaku Kenkyu. 2011 Oct;82(4):370-8. doi: 10.4992/jjpsy.82.370.
4
Speaker-Listener Neural Coupling Reveals an Adaptive Mechanism for Speech Comprehension in a Noisy Environment.说话者-倾听者神经耦合揭示了嘈杂环境中语音理解的一种自适应机制。
Cereb Cortex. 2021 Aug 26;31(10):4719-4729. doi: 10.1093/cercor/bhab118.
5
Limits to the Rational Production of Discourse Connectives.话语连接词合理生成的局限性。
Front Psychol. 2021 May 28;12:660730. doi: 10.3389/fpsyg.2021.660730. eCollection 2021.
6
Top-down effect of dialogue coherence on perceived speaker identity.对话连贯性对感知说话人身份的自上而下影响。
Sci Rep. 2023 Mar 1;13(1):3458. doi: 10.1038/s41598-023-30435-z.
7
Warm (for Winter): Inferring Comparison Classes in Communication.温暖(冬季):在交流中推断比较类。
Cogn Sci. 2022 Mar;46(3):e13095. doi: 10.1111/cogs.13095.
8
Do face-to-face interactions support 6-month-olds' understanding of the communicative function of speech?面对面的互动是否支持 6 个月大的婴儿理解言语的交际功能?
Infancy. 2023 Mar;28(2):240-256. doi: 10.1111/infa.12507. Epub 2022 Sep 21.
9
Effects of Disfluency in Online Interpretation of Deception.网络欺骗解读中不流畅性的影响。
Cogn Sci. 2017 May;41 Suppl 6:1434-1456. doi: 10.1111/cogs.12378. Epub 2016 Jun 1.
10
Speaker separation in realistic noise environments with applications to a cognitively-controlled hearing aid.在现实噪声环境中的说话人分离及其在认知控制助听器中的应用。
Neural Netw. 2021 Aug;140:136-147. doi: 10.1016/j.neunet.2021.02.020. Epub 2021 Mar 4.

本文引用的文献

1
From partners to populations: A hierarchical Bayesian account of coordination and convention.从伙伴到人群:协调和惯例的层次贝叶斯解释。
Psychol Rev. 2023 Jul;130(4):977-1016. doi: 10.1037/rev0000348. Epub 2022 Apr 14.
2
Speakers Align With Their Partner's Overspecification During Interaction.说话者在互动中与他们的伙伴的过度指定保持一致。
Cogn Sci. 2021 Dec;45(12):e13065. doi: 10.1111/cogs.13065.
3
Characterizing the Dynamics of Learning in Repeated Reference Games.描述重复参考博弈中学习的动态。
Cogn Sci. 2020 Jun;44(6):e12845. doi: 10.1111/cogs.12845.
4
Shared understanding of narratives is correlated with shared neural responses.叙事的共同理解与共同的神经反应相关。
Neuroimage. 2019 Jan 1;184:161-170. doi: 10.1016/j.neuroimage.2018.09.010. Epub 2018 Sep 12.
5
Brains in dialogue: decoding neural preparation of speaking to a conversational partner.大脑对话:解码与对话伙伴交谈的神经准备过程
Soc Cogn Affect Neurosci. 2017 Jun 1;12(6):871-880. doi: 10.1093/scan/nsx018.
6
Pragmatic Language Interpretation as Probabilistic Inference.语用语言阐释作为概率推理。
Trends Cogn Sci. 2016 Nov;20(11):818-829. doi: 10.1016/j.tics.2016.08.005. Epub 2016 Sep 28.
7
Revisiting the Memory-Based Processing Approach to Common Ground.重新审视基于记忆的共同基础处理方法。
Top Cogn Sci. 2016 Oct;8(4):780-795. doi: 10.1111/tops.12216. Epub 2016 Aug 17.
8
Predicting pragmatic reasoning in language games.预测语言游戏中的语用推理。
Science. 2012 May 25;336(6084):998. doi: 10.1126/science.1218633.
9
Brain-to-brain coupling: a mechanism for creating and sharing a social world.脑脑耦合:创造和分享社会世界的一种机制。
Trends Cogn Sci. 2012 Feb;16(2):114-21. doi: 10.1016/j.tics.2011.12.007. Epub 2012 Jan 3.
10
Why is conversation so easy?为什么交谈如此轻松?
Trends Cogn Sci. 2004 Jan;8(1):8-11. doi: 10.1016/j.tics.2003.10.016.