• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GenPADS:在端到端对话系统中增强礼貌。

GenPADS: Reinforcing politeness in an end-to-end dialogue system.

机构信息

Department of Computer Science and Engineering, Indian Institute of Technology Patna, Bihta, Bihar, India.

出版信息

PLoS One. 2023 Jan 6;18(1):e0278323. doi: 10.1371/journal.pone.0278323. eCollection 2023.

DOI:10.1371/journal.pone.0278323
PMID:36607963
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9821416/
Abstract

In a task-oriented dialogue setting, user's mood and demands can change in an ongoing dialogue, which may lead to a non-informative conversation or may result in conversation drop-off. To rectify such scenarios, a conversational agent should be able to learn the user's behaviour online, and form informative, empathetic and interactive responses. To incorporate these three aspects, we propose a novel end-to-end dialogue system GenPADS. First, we build and train two models, viz. a politeness classifier to extract polite information present in user's and agent's utterances and a generation model (G) to generate varying but semantically correct responses. We then incorporate both of these models in a reinforcement learning (RL) setting using two different politeness oriented reward algorithms to adapt and generate polite responses. To train our politeness classifier, we annotate recently released Taskmaster dataset into four fine-grained classes depicting politeness and impoliteness. Further, to train our generator model, we prepare a GenDD dataset using the same Taskmaster dataset. Lastly, we train GenPADS and perform automatic and human evaluation by building seven different user simulators. Detailed analysis reveals that GenPADS performs better than the two considered baselines,viz. a transformer based seq2seq generator model for user's and agent's utterance and a retrieval based politeness adaptive dialogue system (PADS).

摘要

在面向任务的对话环境中,用户的情绪和需求会在持续的对话中发生变化,这可能导致非信息性的对话或导致对话中断。为了解决这些情况,对话代理应该能够在线学习用户的行为,并形成信息丰富、富有同理心和互动性的响应。为了结合这三个方面,我们提出了一种新颖的端到端对话系统 GenPADS。首先,我们构建并训练了两个模型,即礼貌分类器,用于提取用户和代理话语中存在的礼貌信息,以及生成模型(G),用于生成不同但语义正确的响应。然后,我们在强化学习(RL)设置中使用两种不同的礼貌导向奖励算法将这两个模型结合起来,以适应和生成礼貌响应。为了训练我们的礼貌分类器,我们将最近发布的 Taskmaster 数据集标注为四个细粒度的类别,描绘了礼貌和不礼貌。此外,为了训练我们的生成器模型,我们使用相同的 Taskmaster 数据集准备了 GenDD 数据集。最后,我们训练了 GenPADS,并通过构建七个不同的用户模拟器进行了自动和人工评估。详细分析表明,GenPADS 优于我们考虑的两个基线,即基于转换器的 seq2seq 生成器模型,用于用户和代理的话语,以及基于检索的礼貌自适应对话系统(PADS)。

相似文献

1
GenPADS: Reinforcing politeness in an end-to-end dialogue system.GenPADS:在端到端对话系统中增强礼貌。
PLoS One. 2023 Jan 6;18(1):e0278323. doi: 10.1371/journal.pone.0278323. eCollection 2023.
2
A dynamic goal adapted task oriented dialogue agent.动态目标适应任务导向对话代理。
PLoS One. 2021 Apr 1;16(4):e0249030. doi: 10.1371/journal.pone.0249030. eCollection 2021.
3
Adapting conversational strategies in information-giving human-agent interaction.在信息提供型人机交互中调整对话策略。
Front Artif Intell. 2022 Oct 25;5:1029340. doi: 10.3389/frai.2022.1029340. eCollection 2022.
4
More to diverse: Generating diversified responses in a task oriented multimodal dialog system.更加多样化:在面向任务的多模态对话系统中生成多样化的响应。
PLoS One. 2020 Nov 5;15(11):e0241271. doi: 10.1371/journal.pone.0241271. eCollection 2020.
5
Reinforcing personalized persuasion in task-oriented virtual sales assistant.强化面向任务的虚拟销售助手的个性化说服。
PLoS One. 2023 Jan 5;18(1):e0275750. doi: 10.1371/journal.pone.0275750. eCollection 2023.
6
An emotion-sensitive dialogue policy for task-oriented dialogue system.面向任务的对话系统中的情感敏感对话策略。
Sci Rep. 2024 Aug 26;14(1):19759. doi: 10.1038/s41598-024-70463-x.
7
Chat agents respond more empathetically by using hearsay experience.聊天机器人通过运用传闻经验做出更具同理心的回应。
Front Robot AI. 2023 Jul 25;10:960087. doi: 10.3389/frobt.2023.960087. eCollection 2023.
8
Should robots be polite? Expectations about politeness in human-robot interaction.机器人应该有礼貌吗?对人机交互中礼貌的期望。
Front Robot AI. 2023 Nov 30;10:1242127. doi: 10.3389/frobt.2023.1242127. eCollection 2023.
9
A Hybrid Online Off-Policy Reinforcement Learning Agent Framework Supported by Transformers.基于 Transformer 的混合在线非策略强化学习代理框架。
Int J Neural Syst. 2023 Dec;33(12):2350065. doi: 10.1142/S012906572350065X. Epub 2023 Oct 20.
10
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning.基于自监督学习的强化学习可解释特征发现。
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2712-2724. doi: 10.1109/TPAMI.2020.3037898. Epub 2022 Apr 1.

本文引用的文献

1
Interrater reliability: the kappa statistic.组内一致性:kappa 统计量。
Biochem Med (Zagreb). 2012;22(3):276-82.