人工智能聊天机器人对乳腺外科肿瘤学常见问题回答的可靠性。

Reliability of artificial intelligence chatbot responses to frequently asked questions in breast surgical oncology.

机构信息

Department of Surgery, Breast Surgical Oncology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA.

School of Medicine and Dentistry, University of Rochester, Rochester, New York, USA.

出版信息

J Surg Oncol. 2024 Aug;130(2):188-203. doi: 10.1002/jso.27715. Epub 2024 Jun 4.

DOI:10.1002/jso.27715

PMID:38837375

Abstract

INTRODUCTION

Artificial intelligence (AI)-driven chatbots, capable of simulating human-like conversations, are becoming more prevalent in healthcare. While this technology offers potential benefits in patient engagement and information accessibility, it raises concerns about potential misuse, misinformation, inaccuracies, and ethical challenges.

METHODS

This study evaluated a publicly available AI chatbot, ChatGPT, in its responses to nine questions related to breast cancer surgery selected from the American Society of Breast Surgeons' frequently asked questions (FAQ) patient education website. Four breast surgical oncologists assessed the responses for accuracy and reliability using a five-point Likert scale and the Patient Education Materials Assessment (PEMAT) Tool.

RESULTS

The average reliability score for ChatGPT in answering breast cancer surgery questions was 3.98 out of 5.00. Surgeons unanimously found the responses understandable and actionable per the PEMAT criteria. The consensus found ChatGPT's overall performance was appropriate, with minor or no inaccuracies.

CONCLUSION

ChatGPT demonstrates good reliability in responding to breast cancer surgery queries, with minor, nonharmful inaccuracies. Its answers are accurate, clear, and easy to comprehend. Notably, ChatGPT acknowledged its informational role and did not attempt to replace medical advice or discourage users from seeking input from a healthcare professional.

摘要

简介

人工智能（AI）驱动的聊天机器人能够模拟类似人类的对话，在医疗保健领域越来越普及。虽然这项技术在患者参与和信息获取方面具有潜在的好处，但它也引发了对潜在滥用、错误信息、不准确和道德挑战的担忧。

方法

本研究评估了一个名为 ChatGPT 的公共可用 AI 聊天机器人，它对从美国乳腺外科学会（ASBrS）常见问题解答（FAQ）患者教育网站中选择的九个与乳腺癌手术相关的问题的回答。四位乳腺外科肿瘤学家使用五点李克特量表和患者教育材料评估（PEMAT）工具评估了这些回答的准确性和可靠性。

结果

ChatGPT 在回答乳腺癌手术问题方面的平均可靠性得分为 3.98 分（满分 5.00 分）。根据 PEMAT 标准，外科医生一致认为回答易于理解且可操作。共识认为 ChatGPT 的整体表现是合适的，只有轻微或没有不准确的地方。

结论

ChatGPT 在回答乳腺癌手术查询方面表现出良好的可靠性，只有轻微的、无害的不准确之处。它的答案准确、清晰且易于理解。值得注意的是，ChatGPT 承认其信息角色，并未试图替代医疗建议或劝阻用户不向医疗保健专业人员寻求意见。

相似文献

Reliability of artificial intelligence chatbot responses to frequently asked questions in breast surgical oncology.人工智能聊天机器人对乳腺外科肿瘤学常见问题回答的可靠性。

J Surg Oncol. 2024 Aug;130(2):188-203. doi: 10.1002/jso.27715. Epub 2024 Jun 4.

Assessing artificial intelligence responses to common patient questions regarding inflatable penile prostheses using a publicly available natural language processing tool (ChatGPT).评估人工智能对常见患者问题的反应，这些问题涉及可充气阴茎假体，使用一个公开可用的自然语言处理工具（ChatGPT）。

Can J Urol. 2024 Jun;31(3):11880-11885.

Evaluating ChatGPT to test its robustness as an interactive information database of radiation oncology and to assess its responses to common queries from radiotherapy patients: A single institution investigation.评估ChatGPT以测试其作为放射肿瘤学交互式信息数据库的稳健性，并评估其对放疗患者常见问题的回答：一项单机构调查。

Cancer Radiother. 2024 Jun;28(3):258-264. doi: 10.1016/j.canrad.2023.11.005. Epub 2024 Jun 12.

An Artificial Intelligence Chatbot is an Accurate and Useful Online Patient Resource Prior to Total Knee Arthroplasty.人工智能聊天机器人是全膝关节置换术前准确且有用的在线患者资源。

J Arthroplasty. 2024 Aug;39(8S1):S358-S362. doi: 10.1016/j.arth.2024.02.005. Epub 2024 Feb 11.

An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer.评估 ChatGPT 对宫颈癌和乳腺癌常见问题的回答。

BMC Womens Health. 2024 Sep 2;24(1):482. doi: 10.1186/s12905-024-03320-8.

Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer.评估人工智能聊天机器人对癌症热门搜索查询的响应

JAMA Oncol. 2023 Oct 1;9(10):1437-1440. doi: 10.1001/jamaoncol.2023.2947.

Evaluating Chatbot Efficacy for Answering Frequently Asked Questions in Plastic Surgery: A ChatGPT Case Study Focused on Breast Augmentation.评估聊天机器人在回答整形手术常见问题方面的效果：以聚焦隆胸手术的ChatGPT为例的研究

Aesthet Surg J. 2023 Sep 14;43(10):1126-1135. doi: 10.1093/asj/sjad140.

Exploring the Potential of ChatGPT-4 in Responding to Common Questions About Abdominoplasty: An AI-Based Case Study of a Plastic Surgery Consultation.探讨 ChatGPT-4 在回答腹部整形常见问题方面的潜力：基于人工智能的整形外科咨询案例研究。

Aesthetic Plast Surg. 2024 Apr;48(8):1571-1583. doi: 10.1007/s00266-023-03660-0. Epub 2023 Sep 28.

Talking technology: exploring chatbots as a tool for cataract patient education.技术漫谈：探索聊天机器人作为白内障患者教育工具的作用

Clin Exp Optom. 2025 Jan;108(1):56-64. doi: 10.1080/08164622.2023.2298812. Epub 2024 Jan 9.

Assessing ChatGPT vs. Standard Medical Resources for Endoscopic Sleeve Gastroplasty Education: A Medical Professional Evaluation Study.评估 ChatGPT 与标准医学资源在经内镜袖状胃切除术教育中的作用：一项医学专业人员评估研究。

Obes Surg. 2024 Jul;34(7):2718-2724. doi: 10.1007/s11695-024-07283-5. Epub 2024 May 17.

引用本文的文献

Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer.评估DeepSeek、Gemini、ChatGPT-4o和Perplexity对涎腺癌的回答。

BMC Oral Health. 2025 Aug 23;25(1):1358. doi: 10.1186/s12903-025-06726-4.

Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型：机遇、风险与伦理

Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.

The impact of an AI-focused ethics education program on nursing students' ethical awareness, moral sensitivity, attitudes, and generative AI adoption intention: a quasi-experimental study.以人工智能为重点的伦理教育项目对护理专业学生伦理意识、道德敏感性、态度及生成式人工智能采用意愿的影响：一项准实验研究。

BMC Nurs. 2025 Jul 1;24(1):720. doi: 10.1186/s12912-025-03458-2.

Ethical Challenges and Opportunities of AI in End-of-Life Palliative Care: Integrative Review.人工智能在临终姑息治疗中的伦理挑战与机遇：综合综述

Interact J Med Res. 2025 May 14;14:e73517. doi: 10.2196/73517.

ChatGPT's Agreement with the Recommendations from the 18th St. Gallen International Consensus Conference on the Treatment of Early Breast Cancer.ChatGPT与第18届圣加仑国际早期乳腺癌治疗共识会议建议的一致性。

Cancers (Basel). 2024 Dec 13;16(24):4163. doi: 10.3390/cancers16244163.

Language discrepancies in the performance of generative artificial intelligence models: an examination of infectious disease queries in English and Arabic.生成式人工智能模型在性能方面的语言差异：对英文和阿拉伯文传染病查询的考察。

BMC Infect Dis. 2024 Aug 8;24(1):799. doi: 10.1186/s12879-024-09725-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人工智能聊天机器人对乳腺外科肿瘤学常见问题回答的可靠性。

Reliability of artificial intelligence chatbot responses to frequently asked questions in breast surgical oncology.

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSION

简介

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献