Suppr超能文献

通过生成式预训练变换器-4检验与炎症性肠病相关营养问题回答的准确性和可重复性。

Examining the Accuracy and Reproducibility of Responses to Nutrition Questions Related to Inflammatory Bowel Disease by Generative Pre-trained Transformer-4.

作者信息

Samaan Jamil S, Issokson Kelly, Feldman Erin, Fasulo Christina, Rajeev Nithya, Ng Wee Han, Hollander Barbara, Yeo Yee Hui, Vasiliauskas Eric

机构信息

Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA.

Keck School of Medicine of USC, Los Angeles, CA, USA.

出版信息

Crohns Colitis 360. 2025 Feb 19;7(1):otae077. doi: 10.1093/crocol/otae077. eCollection 2025 Jan.

Abstract

BACKGROUND

Generative pre-trained transformer-4 (GPT-4) is a large language model (LLM) trained on a vast corpus of data, including the medical literature. Nutrition plays an important role in managing inflammatory bowel disease (IBD), with an unmet need for nutrition-related patient education resources. This study examines the accuracy, comprehensiveness, and reproducibility of responses by GPT-4 to patient nutrition questions related to IBD.

METHODS

Questions were obtained from adult IBD clinic visits, Facebook, and Reddit. Two IBD-focused registered dieticians independently graded the accuracy and reproducibility of GPT-4's responses while a third senior IBD-focused registered dietitian arbitrated. Each question was inputted twice into the model.

RESULTS

88 questions were selected. The model correctly responded to 73/88 questions (83.0%), with 61 (69.0%) graded as comprehensive. 15/88 (17%) responses were graded as mixed with correct and incorrect/outdated data. The model comprehensively responded to 10 (62.5%) questions related to "Nutrition and diet needs for surgery," 12 (92.3%) "Tube feeding and parenteral nutrition," 11 (64.7%) "General diet questions," 10 (50%) "Diet for reducing symptoms/inflammation," and 18 (81.8%) "Micronutrients/supplementation needs." The model provided reproducible responses to 81/88 (92.0%) questions.

CONCLUSIONS

GPT-4 comprehensively answered most questions, demonstrating the promising potential of LLMs as supplementary tools for IBD patients seeking nutrition-related information. However, 17% of responses contained incorrect information, highlighting the need for continuous refinement prior to incorporation into clinical practice. Future studies should emphasize leveraging LLMs to enhance patient outcomes and promoting patient and healthcare professional proficiency in using LLMs to maximize their efficacy.

摘要

背景

生成式预训练变换器4(GPT-4)是一种基于大量数据(包括医学文献)训练的大语言模型(LLM)。营养在炎症性肠病(IBD)的管理中起着重要作用,但对营养相关患者教育资源的需求尚未得到满足。本研究考察了GPT-4对与IBD相关的患者营养问题的回答的准确性、全面性和可重复性。

方法

问题来自成人IBD门诊、脸书和红迪网。两名专注于IBD的注册营养师独立对GPT-4的回答的准确性和可重复性进行评分,第三名资深的专注于IBD的注册营养师进行仲裁。每个问题都输入模型两次。

结果

共选择了88个问题。该模型正确回答了73/88个问题(83.0%),其中61个(69.0%)被评为全面。15/88(17%)的回答被评为正确与不正确/过时数据混合。该模型全面回答了10个(62.5%)与“手术的营养和饮食需求”相关的问题、12个(92.3%)“管饲和肠外营养”问题、11个(64.7%)“一般饮食问题”、10个(50%)“减轻症状/炎症的饮食”以及18个(81.8%)“微量营养素/补充需求”问题。该模型对81/88(92.0%)的问题提供了可重复回答。

结论

GPT-4全面回答了大多数问题,表明大语言模型作为寻求营养相关信息的IBD患者的辅助工具具有广阔的潜力。然而,17%的回答包含错误信息,突出了在纳入临床实践之前持续改进的必要性。未来的研究应强调利用大语言模型改善患者预后,并提高患者和医疗保健专业人员使用大语言模型以最大化其疗效的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efde/11897593/eeabd22bfc3c/otae077_fig2.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验