Suppr超能文献

激励与责任:快速工程如何影响 ChatGPT-4 在放射科考试中的表现。

Encouragement vs. liability: How prompt engineering influences ChatGPT-4's radiology exam performance.

机构信息

University of Massachusetts Chan Medical School, Worcester, MA, United States of America.

Department of Radiology, University of Massachusetts Chan Medical School, Worcester, MA, United States of America.

出版信息

Clin Imaging. 2024 Nov;115:110276. doi: 10.1016/j.clinimag.2024.110276. Epub 2024 Sep 6.

Abstract

Large Language Models (LLM) like ChatGPT-4 hold significant promise in medical application, especially in the field of radiology. While previous studies have shown the promise of ChatGTP-4 in textual-based scenarios, its performance on image-based response remains suboptimal. This study investigates the impact of prompt engineering on ChatGPT-4's accuracy on the 2022 American College of Radiology In Training Test Questions for Diagnostic Radiology Residents that include textual and visual-based questions. Four personas were created, each with unique prompts, and evaluated using ChatGPT-4. Results indicate that encouraging prompts and those disclaiming responsibility led to higher overall accuracy (number of questions answered correctly) compared to other personas. Personas that threaten the LLM with legal action or mounting clinical responsibility were not only found to score less, but also refrain of answering questions at a higher rate. These findings highlight the importance of prompt context in optimizing LLM responses and the need for further research to integrate AI responsibly into medical practice.

摘要

大型语言模型(LLM),如 ChatGPT-4,在医学应用中具有巨大的潜力,尤其是在放射学领域。虽然之前的研究已经表明 ChatGPT-4 在基于文本的场景中具有很大的潜力,但其在基于图像的响应方面的性能仍不理想。本研究探讨了提示工程对 ChatGPT-4 在包括基于文本和基于图像的问题的 2022 年美国放射学学院住院医师培训测试问题中的准确性的影响。创建了四个角色,每个角色都有独特的提示,并使用 ChatGPT-4 进行评估。结果表明,与其他角色相比,鼓励提示和免责提示导致了更高的整体准确性(正确回答的问题数量)。那些用法律行动或临床责任威胁 LLM 的角色不仅得分较低,而且回答问题的比例也更高。这些发现强调了提示上下文在优化 LLM 响应中的重要性,以及需要进一步研究将人工智能负责任地整合到医学实践中的必要性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验