Robinson Alexander, Aggarwal Shaurya
General Surgery, Mid and South Essex NHS (National Health Service) Foundation Trust, Chelmsford, GBR.
Cureus. 2023 Jun 17;15(6):e40546. doi: 10.7759/cureus.40546. eCollection 2023 Jun.
ChatGPT (Chatbot Generative Pre-Trained Transformer) is an artificial intelligence with several potential applications in the field of medicine. As a large language model, it is particularly good at generating text. This study investigates the use of ChatGPT in constructing operation notes for laparoscopic appendicectomy, one of the most common surgical procedures in the UK. We prompted ChatGPT-4, the latest generation of ChatGPT, to produce operation notes for laparoscopic appendicectomy, which were then evaluated against 'Getting It Right First Time' (GIRFT) recommendations. GIRFT is an organisation that has collaborated with the National Health Service (NHS) to improve surgical documentation guidelines. Excluding certain items documented elsewhere in patient records, the generated notes were assessed against 30 key points in GIRFT recommendations. This process was repeated three times to obtain an average score. Our results showed that ChatGPT generated operation notes in seconds, with an average coverage of 78.8% (23.66 out of 30 points) of the GIRFT guidelines, surpassing average compliance with similar guidelines from the Royal College of Surgeons (RCS). However, the quality of ChatGPT's output was found to be dependent on the quality of the prompt, highlighting the need for verification of the generated content. Additionally, secure integration with electronic health records is required before ChatGPT can be adopted into the NHS.
ChatGPT(聊天机器人生成式预训练变换器)是一种人工智能,在医学领域有多种潜在应用。作为一个大型语言模型,它特别擅长生成文本。本研究调查了ChatGPT在构建腹腔镜阑尾切除术手术记录中的应用,腹腔镜阑尾切除术是英国最常见的外科手术之一。我们促使最新一代的ChatGPT-4生成腹腔镜阑尾切除术的手术记录,然后根据“首次正确执行”(GIRFT)的建议对其进行评估。GIRFT是一个与英国国家医疗服务体系(NHS)合作以改进手术记录指南的组织。排除患者记录中其他地方记录的某些项目后,根据GIRFT建议中的30个关键点对生成的记录进行评估。这个过程重复了三次以获得平均分。我们的结果显示,ChatGPT在数秒内生成了手术记录,平均涵盖了GIRFT指南中78.8%(30分中的23.66分)的内容,超过了皇家外科医学院(RCS)类似指南的平均合规率。然而,发现ChatGPT输出的质量取决于提示的质量,这突出了对生成内容进行核实的必要性。此外,在ChatGPT能够被NHS采用之前,需要与电子健康记录进行安全集成。