Department of Surgery, Townsville Hospital, Townsville, Queensland, Australia.
Faculty of Medicine and Health, Central Clinical School, The University of Sydney, Sydney, New South Wales, Australia.
ANZ J Surg. 2024 Mar;94(3):342-352. doi: 10.1111/ans.18736. Epub 2023 Oct 19.
Appendicitis is a common surgical condition that requires urgent medical attention. Recent advancements in artificial intelligence and large language processing, such as ChatGPT, have demonstrated potential in supporting healthcare management and scientific research. This study aims to evaluate the accuracy and comprehensiveness of ChatGPT's knowledge on appendicitis management.
Six questions related to appendicitis management were created by experienced RACS qualified general surgeons to assess ChatGPT's ability to provide accurate information. The criteria of ChatGPT answers' accuracy were compared with current healthcare guidelines for appendicitis and subjective evaluation by two RACS qualified General Surgeons. Additionally, ChatGPT was then asked to provide five high level evidence references to support its responses.
ChatGPT provided clinically relevant information on appendicitis management, however, was inconsistent in doing so and often provided superficial information. Further to this, ChatGPT encountered difficulties in generating relevant references, with some being either non-existent or incorrect.
ChatGPT has the potential to provide timely and comprehensible medical information on appendicitis management to laypersons. However, its issue of inaccuracy in information and production of non-existent or erroneous references presents a challenge for researchers and clinicians who may inadvertently employ such information in their research or healthcare. Therefore, clinicians should exercise caution when using ChatGPT for these purposes.
阑尾炎是一种常见的外科病症,需要紧急医疗关注。最近,人工智能和大型语言处理技术(如 ChatGPT)在支持医疗保健管理和科学研究方面展现出了潜力。本研究旨在评估 ChatGPT 在阑尾炎管理方面的知识的准确性和全面性。
由具有丰富经验的 RACS 认证普通外科医生创建了六个与阑尾炎管理相关的问题,以评估 ChatGPT 提供准确信息的能力。ChatGPT 回答的准确性标准与阑尾炎的当前医疗保健指南以及两名 RACS 认证普通外科医生的主观评估进行了比较。此外,还要求 ChatGPT 提供五个高级别证据参考,以支持其回答。
ChatGPT 提供了与阑尾炎管理相关的临床相关信息,但在这样做时不一致,并且经常提供肤浅的信息。此外,ChatGPT 在生成相关参考资料方面遇到困难,有些参考资料要么不存在,要么不正确。
ChatGPT 有可能为非专业人士提供关于阑尾炎管理的及时和易懂的医疗信息。然而,其信息不准确以及生成不存在或错误的参考资料的问题,给研究人员和临床医生带来了挑战,他们可能会在研究或医疗保健中无意中使用这些信息。因此,临床医生在将 ChatGPT 用于这些目的时应保持谨慎。