• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化面向任务的虚拟销售助手的个性化说服。

Reinforcing personalized persuasion in task-oriented virtual sales assistant.

机构信息

Dept. of Computer Science, Ramakrishna Mission Vivekananda Educational and Research Institute, Belur, Howrah, India.

Dept. of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, Bihar, India.

出版信息

PLoS One. 2023 Jan 5;18(1):e0275750. doi: 10.1371/journal.pone.0275750. eCollection 2023.

DOI:10.1371/journal.pone.0275750
PMID:36602995
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9815581/
Abstract

PURPOSE

Existing task-oriented virtual agents can assist users with simple tasks like ticket booking, hotel reservations, etc. effectively and with high confidence. These virtual assistants, however, assume specific, predictable end-user behavior, such as predefined/servable objectives, which results in conversation failures in challenging situations, such as when goals are unavailable.

METHODOLOGY

Inspired by the practice and its efficacy, we propose an end-to-end framework for task-oriented persuasive dialogue generation that combines pre-training and reinforcement learning for generating context-aware persuasive responses. We utilize four novel rewards to improve consistency and repetitiveness in generated responses. Additionally, a meta-learning strategy has also been utilized to make the model parameters better for domain adaptation. Furthermore, we also curate a personalized persuasive dialogue (PPD) corpus, which contains utterance-level intent, slot, sentiment, and persuasion strategy annotation.

FINDINGS

The obtained results and detailed analysis firmly establish the effectiveness of the proposed persuasive virtual assistant over traditional task-oriented virtual assistants. The proposed framework considerably increases the quality of dialogue generation in terms of consistency and repetitiveness. Additionally, our experiment with a few shot and zero-shot settings proves that our meta-learned model learns to quickly adopt new domains with a few or even zero no. of training epochs. It outperforms the non-meta-learning-based approaches keeping the base model constant.

ORIGINALITY

To the best of our knowledge, this is the first effort to improve a task-oriented virtual agent's persuasiveness and domain adaptation.

摘要

目的

现有的面向任务的虚拟助手可以有效地帮助用户完成简单的任务,例如订票、预订酒店等,并且具有很高的信心。然而,这些虚拟助手假设特定的、可预测的最终用户行为,例如预定义/可服务的目标,这导致在具有挑战性的情况下(例如目标不可用时)会出现对话失败。

方法

受实践及其功效的启发,我们提出了一种端到端的面向任务的有说服力的对话生成框架,该框架结合了预训练和强化学习,用于生成上下文感知的有说服力的响应。我们利用四个新颖的奖励来提高生成响应的一致性和重复性。此外,还利用元学习策略使模型参数更适合领域自适应。此外,我们还精心制作了一个个性化的有说服力的对话(PPD)语料库,其中包含话语级意图、插槽、情感和说服策略注释。

发现

所获得的结果和详细分析坚定地确立了所提出的有说服力的虚拟助手相对于传统面向任务的虚拟助手的有效性。所提出的框架在一致性和重复性方面大大提高了对话生成的质量。此外,我们在少量和零样本设置下的实验证明,我们的元学习模型可以快速学习采用新的领域,只需几个甚至零个训练时期。它优于保持基础模型不变的非元学习方法。

原创性

据我们所知,这是首次努力提高面向任务的虚拟代理的说服力和领域适应性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/158d79cca69f/pone.0275750.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/1c2a1723576a/pone.0275750.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/c2636f11b152/pone.0275750.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/b343942e04d6/pone.0275750.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/d1539f26247d/pone.0275750.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/fb43c1694681/pone.0275750.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/513b530ccd24/pone.0275750.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/f8b8fbc77d8a/pone.0275750.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/d2272ceb72b1/pone.0275750.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/0bf0743cc9ae/pone.0275750.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/ebc0ac92f571/pone.0275750.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/75985cfc221a/pone.0275750.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/16b707ca75b4/pone.0275750.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/158d79cca69f/pone.0275750.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/1c2a1723576a/pone.0275750.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/c2636f11b152/pone.0275750.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/b343942e04d6/pone.0275750.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/d1539f26247d/pone.0275750.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/fb43c1694681/pone.0275750.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/513b530ccd24/pone.0275750.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/f8b8fbc77d8a/pone.0275750.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/d2272ceb72b1/pone.0275750.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/0bf0743cc9ae/pone.0275750.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/ebc0ac92f571/pone.0275750.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/75985cfc221a/pone.0275750.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/16b707ca75b4/pone.0275750.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/def1/9815581/158d79cca69f/pone.0275750.g013.jpg

相似文献

1
Reinforcing personalized persuasion in task-oriented virtual sales assistant.强化面向任务的虚拟销售助手的个性化说服。
PLoS One. 2023 Jan 5;18(1):e0275750. doi: 10.1371/journal.pone.0275750. eCollection 2023.
2
A dynamic goal adapted task oriented dialogue agent.动态目标适应任务导向对话代理。
PLoS One. 2021 Apr 1;16(4):e0249030. doi: 10.1371/journal.pone.0249030. eCollection 2021.
3
Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning.基于分层强化学习的多意图对话中基于情感的对话策略学习。
PLoS One. 2020 Jul 2;15(7):e0235367. doi: 10.1371/journal.pone.0235367. eCollection 2020.
4
GenPADS: Reinforcing politeness in an end-to-end dialogue system.GenPADS:在端到端对话系统中增强礼貌。
PLoS One. 2023 Jan 6;18(1):e0278323. doi: 10.1371/journal.pone.0278323. eCollection 2023.
5
The effectiveness of internet-based e-learning on clinician behavior and patient outcomes: a systematic review protocol.基于互联网的电子学习对临床医生行为和患者结局的有效性:一项系统评价方案。
JBI Database System Rev Implement Rep. 2015 Jan;13(1):52-64. doi: 10.11124/jbisrir-2015-1919.
6
FewJoint: few-shot learning for joint dialogue understanding.FewJoint:用于联合对话理解的少样本学习
Int J Mach Learn Cybern. 2022;13(11):3409-3423. doi: 10.1007/s13042-022-01604-9. Epub 2022 Jul 18.
7
An emotion-sensitive dialogue policy for task-oriented dialogue system.面向任务的对话系统中的情感敏感对话策略。
Sci Rep. 2024 Aug 26;14(1):19759. doi: 10.1038/s41598-024-70463-x.
8
IvCDS: An End-to-End Driver Simulator for Personal In-Vehicle Conversational Assistant.IvCDS:面向车载个人会话助理的端到端驾驶员模拟器。
Int J Environ Res Public Health. 2022 Nov 22;19(23):15493. doi: 10.3390/ijerph192315493.
9
Multitask Learning and Reinforcement Learning for Personalized Dialog Generation: An Empirical Study.用于个性化对话生成的多任务学习与强化学习:一项实证研究
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):49-62. doi: 10.1109/TNNLS.2020.2975035. Epub 2021 Jan 4.
10
The Intersection of Persuasive System Design and Personalization in Mobile Health: Statistical Evaluation.移动健康中的说服系统设计与个性化交叉:统计评估。
JMIR Mhealth Uhealth. 2022 Sep 14;10(9):e40576. doi: 10.2196/40576.

本文引用的文献

1
A dynamic goal adapted task oriented dialogue agent.动态目标适应任务导向对话代理。
PLoS One. 2021 Apr 1;16(4):e0249030. doi: 10.1371/journal.pone.0249030. eCollection 2021.
2
Meta Networks.元网络
Proc Mach Learn Res. 2017 Aug;70:2554-2563.