Department of Medicine, Medstar Union Memorial Hospital.
Department of Medicine, Medstar Franklin Square Medical Center.
Eur J Gastroenterol Hepatol. 2024 Sep 1;36(9):1109-1112. doi: 10.1097/MEG.0000000000002815. Epub 2024 Jul 8.
The USA has the highest age-standardized prevalence of inflammatory bowel disease (IBD). Both genetic and environmental factors have been implicated in IBD flares and multiple strategies are centered around avoiding dietary triggers to maintain remission. Chat-based artificial intelligence (CB-AI) has shown great potential in enhancing patient education in medicine. We evaluate the role of CB-AI in patient education on dietary management of IBD.
Six questions evaluating important concepts about the dietary management of IBD which then were posed to three CB-AI models - ChatGPT, BingChat, and YouChat three different times. All responses were graded for appropriateness and reliability by two physicians using dietary information from the Crohn's and Colitis Foundation. The responses were graded as reliably appropriate, reliably inappropriate, and unreliable. The expert assessment of the reviewing physicians was validated by the joint probability of agreement for two raters.
ChatGPT provided reliably appropriate responses to questions on dietary management of IBD more often than BingChat and YouChat. There were two questions that more than one CB-AI provided unreliable responses to. Each CB-AI provided examples within their responses, but the examples were not always appropriate. Whether the response was appropriate or not, CB-AIs mentioned consulting with an expert in the field. The inter-rater reliability was 88.9%.
CB-AIs have the potential to improve patient education and outcomes but studies evaluating their appropriateness for various health conditions are sparse. Our study showed that CB-AIs have the ability to provide appropriate answers to most questions regarding the dietary management of IBD.
美国拥有最高的炎症性肠病(IBD)年龄标准化患病率。遗传和环境因素都与 IBD 发作有关,多种策略都集中在避免饮食诱因以维持缓解。基于聊天的人工智能(CB-AI)在增强医学患者教育方面显示出巨大潜力。我们评估了 CB-AI 在 IBD 饮食管理患者教育中的作用。
提出了六个评估关于 IBD 饮食管理重要概念的问题,然后分三次向三个 CB-AI 模型——ChatGPT、BingChat 和 YouChat 提出。两位医生根据克罗恩病和结肠炎基金会的饮食信息,对所有回答的适宜性和可靠性进行了评分。回答被评为可靠合适、可靠不合适和不可靠。通过两位评分者的一致性联合概率验证了审查医生的专家评估。
ChatGPT 比 BingChat 和 YouChat 更频繁地提供关于 IBD 饮食管理的可靠合适的回答。有两个问题,不止一个 CB-AI 提供了不可靠的回答。每个 CB-AI 在其回答中都提供了示例,但这些示例并不总是合适的。无论回答是否合适,CB-AI 都提到咨询该领域的专家。评分者间的可靠性为 88.9%。
CB-AI 有可能改善患者教育和结果,但评估其对各种健康状况的适宜性的研究很少。我们的研究表明,CB-AI 有能力为大多数关于 IBD 饮食管理的问题提供合适的答案。