Hsu Hsing-Yu, Hsu Kai-Cheng, Hou Shih-Yen, Wu Ching-Lung, Hsieh Yow-Wen, Cheng Yih-Dih
Department of Pharmacy, China Medical University Hospital, Taichung, Taiwan.
Graduate Institute of Clinical Pharmacy, College of Medicine, National Taiwan University, Taipei, Taiwan.
JMIR Med Educ. 2023 Aug 21;9:e48433. doi: 10.2196/48433.
Since OpenAI released ChatGPT, with its strong capability in handling natural tasks and its user-friendly interface, it has garnered significant attention.
A prospective analysis is required to evaluate the accuracy and appropriateness of medication consultation responses generated by ChatGPT.
A prospective cross-sectional study was conducted by the pharmacy department of a medical center in Taiwan. The test data set comprised retrospective medication consultation questions collected from February 1, 2023, to February 28, 2023, along with common questions about drug-herb interactions. Two distinct sets of questions were tested: real-world medication consultation questions and common questions about interactions between traditional Chinese and Western medicines. We used the conventional double-review mechanism. The appropriateness of each response from ChatGPT was assessed by 2 experienced pharmacists. In the event of a discrepancy between the assessments, a third pharmacist stepped in to make the final decision.
Of 293 real-world medication consultation questions, a random selection of 80 was used to evaluate ChatGPT's performance. ChatGPT exhibited a higher appropriateness rate in responding to public medication consultation questions compared to those asked by health care providers in a hospital setting (31/51, 61% vs 20/51, 39%; P=.01).
The findings from this study suggest that ChatGPT could potentially be used for answering basic medication consultation questions. Our analysis of the erroneous information allowed us to identify potential medical risks associated with certain questions; this problem deserves our close attention.
自OpenAI发布ChatGPT以来,凭借其在处理自然任务方面的强大能力和用户友好的界面,它受到了广泛关注。
需要进行前瞻性分析,以评估ChatGPT生成的用药咨询回复的准确性和适用性。
台湾一家医疗中心的药房部门进行了一项前瞻性横断面研究。测试数据集包括从2023年2月1日至2023年2月28日收集的回顾性用药咨询问题,以及关于药物与草药相互作用的常见问题。测试了两组不同的问题:现实世界中的用药咨询问题和关于中西药相互作用的常见问题。我们采用了传统的双重审核机制。ChatGPT的每条回复的适用性由2名经验丰富的药剂师进行评估。如果评估结果存在差异,则由第三名药剂师做出最终决定。
在293个现实世界中的用药咨询问题中,随机选择了80个来评估ChatGPT的表现。与医院环境中医护人员提出的问题相比,ChatGPT在回答公众用药咨询问题时表现出更高的适用率(31/51,61%对20/51,39%;P = 0.01)。
本研究结果表明,ChatGPT可能可用于回答基本的用药咨询问题。我们对错误信息的分析使我们能够识别与某些问题相关的潜在医疗风险;这个问题值得我们密切关注。