通过渐进式澄清使语音助手对痴呆症患者更友好。

: making voice assistants more dementia-friendly with incremental clarification.

作者信息

Addlesee Angus, Eshghi Arash

机构信息

Interaction Lab, Heriot-Watt University, Edinburgh, United Kingdom.

Alana AI, Edinburgh, United Kingdom.

出版信息

Front Dement. 2024 Mar 12;3:1343052. doi: 10.3389/frdem.2024.1343052. eCollection 2024.

DOI:10.3389/frdem.2024.1343052

PMID:39081607

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11285561/

Abstract

In spontaneous conversation, speakers seldom have a full plan of what they are going to say in advance: they need to conceptualise and plan as they articulate each word in turn. This often leads to long pauses mid-utterance. Listeners either wait out the pause, offer a possible completion, or respond with an incremental clarification request (iCR), intended to recover the rest of the truncated turn. The ability to generate iCRs in response to pauses is therefore important in building and everyday voice assistants (EVA) such as Amazon Alexa. This becomes crucial with people with dementia (PwDs) as a target user group since they are known to pause longer and more frequently, with current state-of-the-art EVAs interrupting them prematurely, leading to frustration and breakdown of the interaction. In this article, we first use two existing corpora of truncated utterances to establish the generation of clarification requests as an effective strategy for recovering from interruptions. We then proceed to report on, analyse, and release SLUICE-CR: a new corpus of 3,000 crowdsourced, human-produced iCRs, the first of its kind. We use this corpus to probe the incremental processing capability of a number of state-of-the-art large language models (LLMs) by evaluating (1) the quality of the model's generated iCRs in response to incomplete questions and (2) the ability of the said LLMs to respond correctly the users response to the generated iCR. For (1), our experiments show that the ability to generate contextually appropriate iCRs only emerges at larger LLM sizes and only when prompted with example iCRs from our corpus. For (2), our results are in line with (1), that is, that larger LLMs interpret incremental clarificational exchanges more effectively. Overall, our results indicate that autoregressive language models (LMs) are, in principle, able to both understand and generate language incrementally and that LLMs can be configured to handle speech phenomena more commonly produced by PwDs, mitigating frustration with today's EVAs by improving their accessibility.

摘要

在自然对话中，说话者很少会提前有一个完整的要说内容的计划：他们需要在依次说出每个单词时进行概念化和规划。这常常导致话语中间出现长时间停顿。倾听者要么等待停顿结束，提供一个可能的补充内容，要么以递增式澄清请求（iCR）做出回应，旨在找回被截断话轮的其余部分。因此，在构建像亚马逊Alexa这样的日常语音助手（EVA）时，针对停顿生成iCR的能力很重要。当以患有痴呆症的人（PwD）作为目标用户群体时，这一点变得至关重要，因为众所周知他们停顿的时间更长且更频繁，而当前最先进的EVA会过早打断他们，导致沮丧情绪并使互动中断。在本文中，我们首先使用两个现有的截断话语语料库来确定澄清请求的生成是从打断中恢复的有效策略。然后我们继续报告、分析并发布SLUICE - CR：一个由3000个众包的、人工生成的iCR组成的新语料库，这是同类中的第一个。我们使用这个语料库通过评估（1）模型针对不完整问题生成的iCR的质量，以及（2）上述大语言模型正确回应用户对生成的iCR的回应的能力，来探究一些最先进的大语言模型（LLM）的递增处理能力。对于（1），我们的实验表明，仅在较大规模的LLM中，并且仅当用我们语料库中的示例iCR进行提示时，才会出现生成上下文合适的iCR的能力。对于（2），我们的结果与（1）一致，即较大的LLM能更有效地解释递增式澄清交流。总体而言，我们的结果表明自回归语言模型（LM）原则上能够逐步理解和生成语言，并且可以配置LLM来处理PwD更常见产生的语音现象，通过提高其可及性来减轻对当今EVA的沮丧情绪。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f164/11285561/8dbc36998e72/frdem-03-1343052-g0001.jpg

相似文献

: making voice assistants more dementia-friendly with incremental clarification.

Front Dement. 2024 Mar 12;3:1343052. doi: 10.3389/frdem.2024.1343052. eCollection 2024.

Combined effects of age and hearing impairment on utterances and requests for clarification in spontaneous conversation and a referential communication task.

Int J Lang Commun Disord. 2024 Jan-Feb;59(1):293-303. doi: 10.1111/1460-6984.12940. Epub 2023 Aug 17.

FROST: Fallback Voice Apps Recommendation for Unhandled Voice Commands in Intelligent Personal Assistants.

Front Big Data. 2022 Apr 25;5:867251. doi: 10.3389/fdata.2022.867251. eCollection 2022.

Introduction to Large Language Models (LLMs) for dementia care and research.

Front Dement. 2024 May 14;3:1385303. doi: 10.3389/frdem.2024.1385303. eCollection 2024.

Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Health Care Professionals.

J Med Internet Res. 2024 Apr 25;26:e56764. doi: 10.2196/56764.

A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.

medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.

Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants.

Med Ref Serv Q. 2018 Jan-Mar;37(1):81-88. doi: 10.1080/02763869.2018.1404391.

Steering the conversation: A linguistic exploration of natural language interactions with a digital assistant during simulated driving.

Appl Ergon. 2017 Sep;63:53-61. doi: 10.1016/j.apergo.2017.04.003. Epub 2017 Apr 12.

Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.

J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.

Health and Fitness Apps for Hands-Free Voice-Activated Assistants: Content Analysis.

JMIR Mhealth Uhealth. 2018 Sep 24;6(9):e174. doi: 10.2196/mhealth.9705.

本文引用的文献

Understanding Barriers and Design Opportunities to Improve Healthcare and QOL for Older Adults through Voice Assistants.

ASSETS. 2021 Oct;2021. doi: 10.1145/3441852.3471218. Epub 2021 Oct 17.

Evaluating Voice-Assistant Commands for Dementia Detection.

Comput Speech Lang. 2022 Mar;72. doi: 10.1016/j.csl.2021.101297. Epub 2021 Sep 22.

Perceptual prioritization of self-associated voices.

Br J Psychol. 2021 Aug;112(3):585-610. doi: 10.1111/bjop.12479. Epub 2020 Oct 17.

Investigating the Accessibility of Voice Assistants With Impaired Users: Mixed Methods Study.

J Med Internet Res. 2020 Sep 25;22(9):e18431. doi: 10.2196/18431.

A Personalized Voice-Based Diet Assistant for Caregivers of Alzheimer Disease and Related Dementias: System Development and Validation.

J Med Internet Res. 2020 Sep 21;22(9):e19897. doi: 10.2196/19897.

Qualitative study of affective identities in dementia patients for the design of cognitive assistive technologies.

J Rehabil Assist Technol Eng. 2017 Jan 1;4:2055668316685038. doi: 10.1177/2055668316685038. eCollection 2017 Jan-Dec.

Speech-based markers for posttraumatic stress disorder in US veterans.

Depress Anxiety. 2019 Jul;36(7):607-616. doi: 10.1002/da.22890. Epub 2019 Apr 22.

Connected Speech Features from Picture Description in Alzheimer's Disease: A Systematic Review.

J Alzheimers Dis. 2018;65(2):519-542. doi: 10.3233/JAD-170881.

Nouns slow down speech across structurally and culturally diverse languages.

Proc Natl Acad Sci U S A. 2018 May 29;115(22):5720-5725. doi: 10.1073/pnas.1800708115. Epub 2018 May 14.

Running Repairs: Coordinating Meaning in Dialogue.

Top Cogn Sci. 2018 Apr;10(2):367-388. doi: 10.1111/tops.12336. Epub 2018 Apr 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过渐进式澄清使语音助手对痴呆症患者更友好。

: making voice assistants more dementia-friendly with incremental clarification.

作者信息

Addlesee Angus, Eshghi Arash

机构信息

Interaction Lab, Heriot-Watt University, Edinburgh, United Kingdom.

Alana AI, Edinburgh, United Kingdom.

出版信息

Front Dement. 2024 Mar 12;3:1343052. doi: 10.3389/frdem.2024.1343052. eCollection 2024.

DOI:10.3389/frdem.2024.1343052

PMID:39081607

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11285561/

Abstract

摘要

通过渐进式澄清使语音助手对痴呆症患者更友好。

: making voice assistants more dementia-friendly with incremental clarification.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过渐进式澄清使语音助手对痴呆症患者更友好。

: making voice assistants more dementia-friendly with incremental clarification.

作者信息

机构信息

出版信息