• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

思考FAST:一种评估对话式人工智能健康教练对话中的保真度、准确性、安全性和语气的新颖框架。

Think FAST: a novel framework to evaluate fidelity, accuracy, safety, and tone in conversational AI health coach dialogues.

作者信息

Neary Martha, Fulton Emily, Rogers Victoria, Wilson Julia, Griffiths Zoe, Chuttani Ram, Sacher Paul M

机构信息

Vir Health Ltd., London, United Kingdom.

Allurion Technologies, Natick, MA, United States.

出版信息

Front Digit Health. 2025 Jun 18;7:1460236. doi: 10.3389/fdgth.2025.1460236. eCollection 2025.

DOI:10.3389/fdgth.2025.1460236
PMID:40607190
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12216977/
Abstract

Developments in Machine Learning based Conversational and Generative Artificial Intelligence (GenAI) have created opportunities for sophisticated Conversational Agents to augment elements of healthcare. While not a replacement for professional care, AI offers opportunities for scalability, cost effectiveness, and automation of many aspects of patient care. However, to realize these opportunities and deliver AI-enabled support safely, interactions between patients and AI must be continuously monitored and evaluated against an agreed upon set of performance criteria. This paper presents one such set of criteria which was developed to evaluate interactions with an AI Health Coach designed to support patients receiving obesity treatment and deployed with an active patient user base. The evaluation framework evolved through an iterative process of development, testing, refining, training, reviewing and supervision. The framework evaluates at both individual message and overall conversation level, rating interactions as Acceptable or Unacceptable in four domains: Fidelity, Accuracy, Safety, and Tone (FAST), with a series of questions to be considered with respect to each domain. Processes to ensure consistent evaluation quality were established and additional patient safety procedures were defined for escalations to healthcare providers based on clinical risk. The framework can be implemented by trained evaluators and offers a method by which healthcare settings deploying AI to support patients can review quality and safety, thus ensuring safe adoption.

摘要

基于机器学习的对话式和生成式人工智能(GenAI)的发展为先进的对话代理创造了机会,以增强医疗保健的各个方面。虽然人工智能不能替代专业护理,但它为扩大规模、提高成本效益以及实现患者护理诸多方面的自动化提供了机会。然而,为了实现这些机会并安全地提供人工智能支持,必须根据一套商定的性能标准,持续监测和评估患者与人工智能之间的互动。本文提出了这样一套标准,该标准是为评估与人工智能健康教练的互动而制定的,该教练旨在支持接受肥胖治疗的患者,并已在活跃的患者用户群体中部署。评估框架通过开发、测试、完善、培训、审查和监督的迭代过程不断演变。该框架在单个消息和整体对话层面进行评估,在四个领域将互动评定为可接受或不可接受:保真度、准确性、安全性和语气(FAST),每个领域都有一系列需要考虑的问题。建立了确保评估质量一致的流程,并根据临床风险定义了向医疗保健提供者升级的额外患者安全程序。该框架可由经过培训的评估人员实施,并提供一种方法,通过该方法,部署人工智能以支持患者的医疗机构可以审查质量和安全性,从而确保安全采用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/d86a20ef9f7b/fdgth-07-1460236-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/f5b043cbe0fa/fdgth-07-1460236-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/3bc2711b74e9/fdgth-07-1460236-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/20f53ad2ed65/fdgth-07-1460236-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/c765192c9a2a/fdgth-07-1460236-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/d86a20ef9f7b/fdgth-07-1460236-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/f5b043cbe0fa/fdgth-07-1460236-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/3bc2711b74e9/fdgth-07-1460236-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/20f53ad2ed65/fdgth-07-1460236-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/c765192c9a2a/fdgth-07-1460236-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cd3/12216977/d86a20ef9f7b/fdgth-07-1460236-g005.jpg

相似文献

1
Think FAST: a novel framework to evaluate fidelity, accuracy, safety, and tone in conversational AI health coach dialogues.思考FAST:一种评估对话式人工智能健康教练对话中的保真度、准确性、安全性和语气的新颖框架。
Front Digit Health. 2025 Jun 18;7:1460236. doi: 10.3389/fdgth.2025.1460236. eCollection 2025.
2
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
3
How to Implement Digital Clinical Consultations in UK Maternity Care: the ARM@DA Realist Review.如何在英国产科护理中实施数字临床会诊:ARM@DA实证主义综述
Health Soc Care Deliv Res. 2025 May 21:1-77. doi: 10.3310/WQFV7425.
4
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
5
Safety and User Experience of a Generative Artificial Intelligence Digital Mental Health Intervention: Exploratory Randomized Controlled Trial.生成式人工智能数字心理健康干预的安全性与用户体验:探索性随机对照试验
J Med Internet Res. 2025 May 23;27:e67365. doi: 10.2196/67365.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
8
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.用于评估人工智能驱动的临床医生工具长期现实世界影响的AI for IMPACTS框架:系统评价与叙述性综合分析
J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.

本文引用的文献

1
Examining the Accuracy and Reproducibility of Responses to Nutrition Questions Related to Inflammatory Bowel Disease by Generative Pre-trained Transformer-4.通过生成式预训练变换器-4检验与炎症性肠病相关营养问题回答的准确性和可重复性。
Crohns Colitis 360. 2025 Feb 19;7(1):otae077. doi: 10.1093/crocol/otae077. eCollection 2025 Jan.
2
Leveraging large language models for generating responses to patient messages-a subjective analysis.利用大型语言模型生成对患者信息的回复——主观分析。
J Am Med Inform Assoc. 2024 May 20;31(6):1367-1379. doi: 10.1093/jamia/ocae052.
3
ChatGPT and mental healthcare: balancing benefits with risks of harms.
ChatGPT 与精神健康保健:平衡获益与风险。
BMJ Ment Health. 2023 Nov;26(1). doi: 10.1136/bmjment-2023-300884.
4
A review of the explainability and safety of conversational agents for mental health to identify avenues for improvement.对用于心理健康的对话代理的可解释性和安全性进行综述,以确定改进途径。
Front Artif Intell. 2023 Oct 12;6:1229805. doi: 10.3389/frai.2023.1229805. eCollection 2023.
5
Defining digital coaching: a qualitative inductive approach.定义数字辅导:一种定性归纳法。
Front Psychol. 2023 Oct 5;14:1148243. doi: 10.3389/fpsyg.2023.1148243. eCollection 2023.
6
Clinical science and practice in the age of large language models and generative artificial intelligence.大语言模型和生成式人工智能时代的临床科学与实践。
J Consult Clin Psychol. 2023 Oct;91(10):559-561. doi: 10.1037/ccp0000848.
7
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
8
Systematic review and meta-analysis of the effectiveness of chatbots on lifestyle behaviours.聊天机器人对生活方式行为有效性的系统评价与荟萃分析。
NPJ Digit Med. 2023 Jun 23;6(1):118. doi: 10.1038/s41746-023-00856-1.
9
AI in health: keeping the human in the loop.健康领域的人工智能:让人类参与其中。
J Am Med Inform Assoc. 2023 Jun 20;30(7):1225-1226. doi: 10.1093/jamia/ocad091.
10
Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。
JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.