• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过渐进式澄清使语音助手对痴呆症患者更友好。

: making voice assistants more dementia-friendly with incremental clarification.

作者信息

Addlesee Angus, Eshghi Arash

机构信息

Interaction Lab, Heriot-Watt University, Edinburgh, United Kingdom.

Alana AI, Edinburgh, United Kingdom.

出版信息

Front Dement. 2024 Mar 12;3:1343052. doi: 10.3389/frdem.2024.1343052. eCollection 2024.

DOI:10.3389/frdem.2024.1343052
PMID:39081607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11285561/
Abstract

In spontaneous conversation, speakers seldom have a full plan of what they are going to say in advance: they need to conceptualise and plan as they articulate each word in turn. This often leads to long pauses mid-utterance. Listeners either wait out the pause, offer a possible completion, or respond with an incremental clarification request (iCR), intended to recover the rest of the truncated turn. The ability to generate iCRs in response to pauses is therefore important in building and everyday voice assistants (EVA) such as Amazon Alexa. This becomes crucial with people with dementia (PwDs) as a target user group since they are known to pause longer and more frequently, with current state-of-the-art EVAs interrupting them prematurely, leading to frustration and breakdown of the interaction. In this article, we first use two existing corpora of truncated utterances to establish the generation of clarification requests as an effective strategy for recovering from interruptions. We then proceed to report on, analyse, and release SLUICE-CR: a new corpus of 3,000 crowdsourced, human-produced iCRs, the first of its kind. We use this corpus to probe the incremental processing capability of a number of state-of-the-art large language models (LLMs) by evaluating (1) the quality of the model's generated iCRs in response to incomplete questions and (2) the ability of the said LLMs to respond correctly the users response to the generated iCR. For (1), our experiments show that the ability to generate contextually appropriate iCRs only emerges at larger LLM sizes and only when prompted with example iCRs from our corpus. For (2), our results are in line with (1), that is, that larger LLMs interpret incremental clarificational exchanges more effectively. Overall, our results indicate that autoregressive language models (LMs) are, in principle, able to both understand and generate language incrementally and that LLMs can be configured to handle speech phenomena more commonly produced by PwDs, mitigating frustration with today's EVAs by improving their accessibility.

摘要

在自然对话中,说话者很少会提前有一个完整的要说内容的计划:他们需要在依次说出每个单词时进行概念化和规划。这常常导致话语中间出现长时间停顿。倾听者要么等待停顿结束,提供一个可能的补充内容,要么以递增式澄清请求(iCR)做出回应,旨在找回被截断话轮的其余部分。因此,在构建像亚马逊Alexa这样的日常语音助手(EVA)时,针对停顿生成iCR的能力很重要。当以患有痴呆症的人(PwD)作为目标用户群体时,这一点变得至关重要,因为众所周知他们停顿的时间更长且更频繁,而当前最先进的EVA会过早打断他们,导致沮丧情绪并使互动中断。在本文中,我们首先使用两个现有的截断话语语料库来确定澄清请求的生成是从打断中恢复的有效策略。然后我们继续报告、分析并发布SLUICE - CR:一个由3000个众包的、人工生成的iCR组成的新语料库,这是同类中的第一个。我们使用这个语料库通过评估(1)模型针对不完整问题生成的iCR的质量,以及(2)上述大语言模型正确回应用户对生成的iCR的回应的能力,来探究一些最先进的大语言模型(LLM)的递增处理能力。对于(1),我们的实验表明,仅在较大规模的LLM中,并且仅当用我们语料库中的示例iCR进行提示时,才会出现生成上下文合适的iCR的能力。对于(2),我们的结果与(1)一致,即较大的LLM能更有效地解释递增式澄清交流。总体而言,我们的结果表明自回归语言模型(LM)原则上能够逐步理解和生成语言,并且可以配置LLM来处理PwD更常见产生的语音现象,通过提高其可及性来减轻对当今EVA的沮丧情绪。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8fbefd1d5e9b/frdem-03-1343052-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8dbc36998e72/frdem-03-1343052-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/0a5e99ce9cd8/frdem-03-1343052-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/ae4689030d25/frdem-03-1343052-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/4969a1a6634a/frdem-03-1343052-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/18ede999ec45/frdem-03-1343052-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/07ccac5c6f2f/frdem-03-1343052-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/98be78be77d4/frdem-03-1343052-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8fbefd1d5e9b/frdem-03-1343052-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8dbc36998e72/frdem-03-1343052-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/0a5e99ce9cd8/frdem-03-1343052-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/ae4689030d25/frdem-03-1343052-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/4969a1a6634a/frdem-03-1343052-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/18ede999ec45/frdem-03-1343052-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/07ccac5c6f2f/frdem-03-1343052-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/98be78be77d4/frdem-03-1343052-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8fbefd1d5e9b/frdem-03-1343052-g0008.jpg

相似文献

1
: making voice assistants more dementia-friendly with incremental clarification.通过渐进式澄清使语音助手对痴呆症患者更友好。
Front Dement. 2024 Mar 12;3:1343052. doi: 10.3389/frdem.2024.1343052. eCollection 2024.
2
Combined effects of age and hearing impairment on utterances and requests for clarification in spontaneous conversation and a referential communication task.年龄和听力障碍对自发对话和参照交流任务中话语和澄清请求的综合影响。
Int J Lang Commun Disord. 2024 Jan-Feb;59(1):293-303. doi: 10.1111/1460-6984.12940. Epub 2023 Aug 17.
3
FROST: Fallback Voice Apps Recommendation for Unhandled Voice Commands in Intelligent Personal Assistants.弗罗斯特:智能个人助理中未处理语音命令的备用语音应用推荐
Front Big Data. 2022 Apr 25;5:867251. doi: 10.3389/fdata.2022.867251. eCollection 2022.
4
Introduction to Large Language Models (LLMs) for dementia care and research.用于痴呆症护理和研究的大语言模型介绍
Front Dement. 2024 May 14;3:1385303. doi: 10.3389/frdem.2024.1385303. eCollection 2024.
5
Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Health Care Professionals.大语言模型与用户信任:自我参照学习循环的后果及医疗保健专业人员的技能退化
J Med Internet Res. 2024 Apr 25;26:e56764. doi: 10.2196/56764.
6
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.ChatGPT及其他对话式大语言模型在医疗保健领域的系统评价
medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.
7
Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants.Alexa、Siri、Cortana 及其他:语音助手介绍。
Med Ref Serv Q. 2018 Jan-Mar;37(1):81-88. doi: 10.1080/02763869.2018.1404391.
8
Steering the conversation: A linguistic exploration of natural language interactions with a digital assistant during simulated driving.引导对话:对模拟驾驶期间与数字助理进行自然语言交互的语言探索。
Appl Ergon. 2017 Sep;63:53-61. doi: 10.1016/j.apergo.2017.04.003. Epub 2017 Apr 12.
9
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
10
Health and Fitness Apps for Hands-Free Voice-Activated Assistants: Content Analysis.适用于免提语音激活助手的健康与健身应用程序:内容分析
JMIR Mhealth Uhealth. 2018 Sep 24;6(9):e174. doi: 10.2196/mhealth.9705.

本文引用的文献

1
Understanding Barriers and Design Opportunities to Improve Healthcare and QOL for Older Adults through Voice Assistants.通过语音助手理解改善老年人医疗保健和生活质量的障碍与设计机会。
ASSETS. 2021 Oct;2021. doi: 10.1145/3441852.3471218. Epub 2021 Oct 17.
2
Evaluating Voice-Assistant Commands for Dementia Detection.评估用于痴呆症检测的语音助手指令
Comput Speech Lang. 2022 Mar;72. doi: 10.1016/j.csl.2021.101297. Epub 2021 Sep 22.
3
Perceptual prioritization of self-associated voices.自我相关声音的感知优先级。
Br J Psychol. 2021 Aug;112(3):585-610. doi: 10.1111/bjop.12479. Epub 2020 Oct 17.
4
Investigating the Accessibility of Voice Assistants With Impaired Users: Mixed Methods Study.调查残障用户对语音助手的可及性:混合方法研究。
J Med Internet Res. 2020 Sep 25;22(9):e18431. doi: 10.2196/18431.
5
A Personalized Voice-Based Diet Assistant for Caregivers of Alzheimer Disease and Related Dementias: System Development and Validation.基于个性化语音的阿尔茨海默病和相关痴呆症照护者饮食助手:系统开发与验证。
J Med Internet Res. 2020 Sep 21;22(9):e19897. doi: 10.2196/19897.
6
Qualitative study of affective identities in dementia patients for the design of cognitive assistive technologies.针对认知辅助技术设计的痴呆症患者情感身份质性研究。
J Rehabil Assist Technol Eng. 2017 Jan 1;4:2055668316685038. doi: 10.1177/2055668316685038. eCollection 2017 Jan-Dec.
7
Speech-based markers for posttraumatic stress disorder in US veterans.基于语音的美国退伍军人创伤后应激障碍标志物。
Depress Anxiety. 2019 Jul;36(7):607-616. doi: 10.1002/da.22890. Epub 2019 Apr 22.
8
Connected Speech Features from Picture Description in Alzheimer's Disease: A Systematic Review.阿尔茨海默病患者图片描述中的连续语音特征:系统评价。
J Alzheimers Dis. 2018;65(2):519-542. doi: 10.3233/JAD-170881.
9
Nouns slow down speech across structurally and culturally diverse languages.名词会减缓各种结构和文化差异语言的语速。
Proc Natl Acad Sci U S A. 2018 May 29;115(22):5720-5725. doi: 10.1073/pnas.1800708115. Epub 2018 May 14.
10
Running Repairs: Coordinating Meaning in Dialogue.运行修复:对话中的协调意义。
Top Cogn Sci. 2018 Apr;10(2):367-388. doi: 10.1111/tops.12336. Epub 2018 Apr 24.