用于临床咨询的 Chatbot 评估报告工具 (CHART) 的开发方案。

Protocol for the development of the Chatbot Assessment Reporting Tool (CHART) for clinical advice.

出版信息

BMJ Open. 2024 May 21;14(5):e081155. doi: 10.1136/bmjopen-2023-081155.

DOI:10.1136/bmjopen-2023-081155

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11110548/

Abstract

INTRODUCTION

Large language model (LLM)-linked chatbots are being increasingly applied in healthcare due to their impressive functionality and public availability. Studies have assessed the ability of LLM-linked chatbots to provide accurate clinical advice. However, the methods applied in these Chatbot Assessment Studies are inconsistent due to the lack of reporting standards available, which obscures the interpretation of their study findings. This protocol outlines the development of the Chatbot Assessment Reporting Tool (CHART) reporting guideline.

METHODS AND ANALYSIS

The development of the CHART reporting guideline will consist of three phases, led by the Steering Committee. During phase one, the team will identify relevant reporting guidelines with artificial intelligence extensions that are published or in development by searching preprint servers, protocol databases, and the Enhancing the Quality and Transparency of health research Network. During phase two, we will conduct a scoping review to identify studies that have addressed the performance of LLM-linked chatbots in summarising evidence and providing clinical advice. The Steering Committee will identify methodology used in previous Chatbot Assessment Studies. Finally, the study team will use checklist items from prior reporting guidelines and findings from the scoping review to develop a draft reporting checklist. We will then perform a Delphi consensus and host two synchronous consensus meetings with an international, multidisciplinary group of stakeholders to refine reporting checklist items and develop a flow diagram.

ETHICS AND DISSEMINATION

We will publish the final CHART reporting guideline in peer-reviewed journals and will present findings at peer-reviewed meetings. Ethical approval was submitted to the Hamilton Integrated Research Ethics Board and deemed "not required" in accordance with the Tri-Council Policy Statement (TCPS2) for the development of the CHART reporting guideline (#17025).

REGISTRATION

This study protocol is preregistered with Open Science Framework: https://doi.org/10.17605/OSF.IO/59E2Q.

摘要

简介

由于大型语言模型 (LLM)-链接的聊天机器人具有令人印象深刻的功能和公众可用性，因此它们在医疗保健中被越来越多地应用。已经有研究评估了 LLM 链接的聊天机器人提供准确临床建议的能力。然而，由于缺乏可用的报告标准，这些 Chatbot Assessment Studies 应用的方法不一致，这使得他们的研究结果难以解释。本方案概述了 Chatbot Assessment Reporting Tool (CHART) 报告指南的制定。

方法与分析

CHART 报告指南的制定将由指导委员会领导，分为三个阶段。在第一阶段，团队将通过搜索预印本服务器、协议数据库和 Enhancing the Quality and Transparency of health research Network，确定人工智能扩展的相关报告指南，这些指南已发布或正在开发中。在第二阶段，我们将进行范围综述，以确定研究 LLM 链接的聊天机器人在总结证据和提供临床建议方面的性能。指导委员会将确定之前 Chatbot Assessment Studies 中使用的方法。最后，研究团队将使用之前报告指南的检查表项目和范围综述的结果制定一份报告检查表草案。然后，我们将进行 Delphi 共识，并与一个国际多学科利益相关者小组举行两次同步共识会议，以完善报告检查表项目并制定流程图。

伦理与传播

我们将在同行评议期刊上发表最终的 CHART 报告指南，并将在同行评议会议上介绍研究结果。根据 Hamilton Integrated Research Ethics Board 的规定，由于 TCPS2 开发 CHART 报告指南 (#17025)，因此本研究无需伦理批准。

注册

本研究方案已在 Open Science Framework 上预先注册：https://doi.org/10.17605/OSF.IO/59E2Q。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe1b/11110548/7bd556e8d701/bmjopen-2023-081155f01.jpg

相似文献

Protocol for the development of the Chatbot Assessment Reporting Tool (CHART) for clinical advice.用于临床咨询的 Chatbot 评估报告工具 (CHART) 的开发方案。

BMJ Open. 2024 May 21;14(5):e081155. doi: 10.1136/bmjopen-2023-081155.

PRISMA-Children (C) and PRISMA-Protocol for Children (P-C) Extensions: a study protocol for the development of guidelines for the conduct and reporting of systematic reviews and meta-analyses of newborn and child health research.PRISMA儿童版（C）和PRISMA儿童研究方案版（P-C）扩展：一项关于制定新生儿和儿童健康研究系统评价与荟萃分析实施及报告指南的研究方案

BMJ Open. 2016 Apr 18;6(4):e010270. doi: 10.1136/bmjopen-2015-010270.

Reporting of methodological studies in health research: a protocol for the development of the MethodologIcal STudy reportIng Checklist (MISTIC).健康研究方法学研究报告：方法学研究报告清单（MISTIC）制定的方案。

BMJ Open. 2020 Dec 17;10(12):e040478. doi: 10.1136/bmjopen-2020-040478.

Protocol for the development of the STrengthening the Reporting Of Pharmacogenetic Studies (STROPS) guideline: checklist of items for reporting pharmacogenetic studies.《加强遗传药理学研究报告规范（STROPS）指南制定议定书：遗传药理学研究报告项目清单》。

BMJ Open. 2019 Jul 11;9(7):e030212. doi: 10.1136/bmjopen-2019-030212.

Large Language Models for Chatbot Health Advice Studies: A Systematic Review.用于聊天机器人健康建议研究的大语言模型：一项系统综述。

JAMA Netw Open. 2025 Feb 3;8(2):e2457879. doi: 10.1001/jamanetworkopen.2024.57879.

Protocol of reporting items for public versions of guidelines: the Reporting Tool for Practice Guidelines in Health Care-public versions of guidelines.报告卫生保健实践指南公开版本的项目协议：指南公开版本报告工具。

BMJ Open. 2019 Mar 3;9(3):e023147. doi: 10.1136/bmjopen-2018-023147.

Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence.基于人工智能的诊断和预后预测模型研究报告指南（TRIPOD-AI）和偏倚风险工具（PROBAST-AI）制定方案。

BMJ Open. 2021 Jul 9;11(7):e048008. doi: 10.1136/bmjopen-2020-048008.

Protocol for the development of a core outcome set and reporting guidelines for locoregional treatment in neoadjuvant systemic breast cancer treatment trials: the PRECEDENT project.局部区域新辅助全身治疗乳腺癌临床试验结局核心集制定和报告指南的制定方案：PRECEDENT 项目。

BMJ Open. 2024 Apr 19;14(4):e084488. doi: 10.1136/bmjopen-2024-084488.

SPIRIT and CONSORT extensions for early phase dose-finding clinical trials: the DEFINE (DosE-FIndiNg Extensions) study protocol.早期阶段剂量发现临床试验的 SPIRIT 和 CONSORT 扩展：DEFINE（剂量发现扩展）研究方案。

BMJ Open. 2023 Mar 29;13(3):e068173. doi: 10.1136/bmjopen-2022-068173.

Extending the RIGHT statement for reporting adapted practice guidelines in healthcare: the RIGHT-Ad@pt Checklist protocol.扩展RIGHT声明以报告医疗保健领域的适应性实践指南：RIGHT-Ad@pt清单协议。

BMJ Open. 2019 Sep 24;9(9):e031767. doi: 10.1136/bmjopen-2019-031767.

引用本文的文献

Evaluating a Chatbot as a Companion for Patients With Breast Cancer: Collaborative Pilot Study.评估聊天机器人作为乳腺癌患者陪伴者的效果：协作性试点研究。

JMIR Cancer. 2025 Aug 13;11:e68426. doi: 10.2196/68426.

Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement.聊天机器人健康建议研究报告指南：聊天机器人评估报告工具（CHART）声明。

BMJ Med. 2025 Aug 1;4(1):e001632. doi: 10.1136/bmjmed-2025-001632. eCollection 2025.

Reporting guideline for chatbot health advice studies: the Chatbot Assessment Reporting Tool (CHART) statement.聊天机器人健康建议研究报告指南：聊天机器人评估报告工具（CHART）声明

Br J Surg. 2025 Aug 1;112(8). doi: 10.1093/bjs/znaf142.

Reporting guideline for Chatbot Health Advice studies: the CHART statement.聊天机器人健康建议研究报告指南：CHART声明

BMC Med. 2025 Aug 1;23(1):447. doi: 10.1186/s12916-025-04274-w.

Conversational Agents to Support Pain Management: A Scoping Review.支持疼痛管理的对话代理：一项范围综述

Eur J Pain. 2025 May;29(5):e70016. doi: 10.1002/ejp.70016.

Advantages and limitations of large language models for antibiotic prescribing and antimicrobial stewardship.大型语言模型在抗生素处方和抗菌药物管理方面的优势与局限性

NPJ Antimicrob Resist. 2025 Feb 27;3(1):14. doi: 10.1038/s44259-025-00084-5.

Adherence of Studies on Large Language Models for Medical Applications Published in Leading Medical Journals According to the MI-CLEAR-LLM Checklist.根据MI-CLEAR-LLM清单，顶级医学期刊发表的关于医学应用大语言模型的研究的依从性。

Korean J Radiol. 2025 Apr;26(4):304-312. doi: 10.3348/kjr.2024.1161. Epub 2025 Jan 23.

Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM).用于清晰评估医疗保健领域大语言模型准确性报告的最低报告项目（MI-CLEAR-LLM）。

Korean J Radiol. 2024 Oct;25(10):865-868. doi: 10.3348/kjr.2024.0843.

Reporting Guidelines for Artificial Intelligence Studies in Healthcare (for Both Conventional and Large Language Models): What's New in 2024.医疗保健领域人工智能研究报告指南（适用于传统模型和大语言模型）：2024年有哪些新内容。

Korean J Radiol. 2024 Aug;25(8):687-690. doi: 10.3348/kjr.2024.0598. Epub 2024 Jul 10.

本文引用的文献

Large language models in medicine.医学中的大型语言模型。

Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.

How AI Responds to Common Lung Cancer Questions: ChatGPT vs Google Bard.人工智能如何回答常见肺癌问题：ChatGPT 与 Google Bard 对比。

Radiology. 2023 Jun;307(5):e230922. doi: 10.1148/radiol.230922.

Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。

JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.

Appropriateness of Breast Cancer Prevention and Screening Recommendations Provided by ChatGPT.ChatGPT提供的乳腺癌预防和筛查建议的适宜性。

Radiology. 2023 May;307(4):e230424. doi: 10.1148/radiol.230424. Epub 2023 Apr 4.

ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用：对其前景与合理担忧的系统评价

Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.

Evaluating the Feasibility of ChatGPT in Healthcare: An Analysis of Multiple Clinical and Research Scenarios.评估 ChatGPT 在医疗保健中的可行性：对多个临床和研究场景的分析。

J Med Syst. 2023 Mar 4;47(1):33. doi: 10.1007/s10916-023-01925-4.

Reporting guideline for the early-stage clinical evaluation of decision support systems driven by artificial intelligence: DECIDE-AI.人工智能驱动的决策支持系统早期临床评估报告指南：DECIDE-AI。

Nat Med. 2022 May;28(5):924-933. doi: 10.1038/s41591-022-01772-9. Epub 2022 May 18.

The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.PRISMA 2020 声明：系统评价报告的更新指南。

BMJ. 2021 Mar 29;372:n71. doi: 10.1136/bmj.n71.

Updated methodological guidance for the conduct of scoping reviews.范围综述实施的更新方法学指南。

JBI Evid Synth. 2020 Oct;18(10):2119-2126. doi: 10.11124/JBIES-20-00167.

Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension.涉及人工智能干预的临床试验报告的报告指南：CONSORT-AI 扩展。

BMJ. 2020 Sep 9;370:m3164. doi: 10.1136/bmj.m3164.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于临床咨询的 Chatbot 评估报告工具 (CHART) 的开发方案。

Protocol for the development of the Chatbot Assessment Reporting Tool (CHART) for clinical advice.

出版信息

INTRODUCTION

METHODS AND ANALYSIS

ETHICS AND DISSEMINATION

REGISTRATION

简介

方法与分析

伦理与传播

注册

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献