• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT 为炎症性肠病患者提供的信息与 ECCO 指南的准确性比较。

Accuracy of Information given by ChatGPT for Patients with Inflammatory Bowel Disease in Relation to ECCO Guidelines.

机构信息

Department of Medicine, Division of Gastroenterology, Mater Dei Hospital, Msida, Malta.

Department of Gastroenterology, Barts Health NHS Trust, London, UK.

出版信息

J Crohns Colitis. 2024 Aug 14;18(8):1215-1221. doi: 10.1093/ecco-jcc/jjae040.

DOI:10.1093/ecco-jcc/jjae040
PMID:38520394
Abstract

BACKGROUND

As acceptance of artificial intelligence [AI] platforms increases, more patients will consider these tools as sources of information. The ChatGPT architecture utilizes a neural network to process natural language, thus generating responses based on the context of input text. The accuracy and completeness of ChatGPT3.5 in the context of inflammatory bowel disease [IBD] remains unclear.

METHODS

In this prospective study, 38 questions worded by IBD patients were inputted into ChatGPT3.5. The following topics were covered: [1] Crohn's disease [CD], ulcerative colitis [UC], and malignancy; [2] maternal medicine; [3] infection and vaccination; and [4] complementary medicine. Responses given by ChatGPT were assessed for accuracy [1-completely incorrect to 5-completely correct] and completeness [3-point Likert scale; range 1-incomplete to 3-complete] by 14 expert gastroenterologists, in comparison with relevant ECCO guidelines.

RESULTS

In terms of accuracy, most replies [84.2%] had a median score of ≥4 (interquartile range [IQR]: 2) and a mean score of 3.87 [SD: ±0.6]. For completeness, 34.2% of the replies had a median score of 3 and 55.3% had a median score of between 2 and <3. Overall, the mean rating was 2.24 [SD: ±0.4, median: 2, IQR: 1]. Though groups 3 and 4 had a higher mean for both accuracy and completeness, there was no significant scoring variation between the four question groups [Kruskal-Wallis test p > 0.05]. However, statistical analysis for the different individual questions revealed a significant difference for both accuracy [p < 0.001] and completeness [p < 0.001]. The questions which rated the highest for both accuracy and completeness were related to smoking, while the lowest rating was related to screening for malignancy and vaccinations especially in the context of immunosuppression and family planning.

CONCLUSION

This is the first study to demonstrate the capability of an AI-based system to provide accurate and comprehensive answers to real-world patient queries in IBD. AI systems may serve as a useful adjunct for patients, in addition to standard of care in clinics and validated patient information resources. However, responses in specialist areas may deviate from evidence-based guidance and the replies need to give more firm advice.

摘要

背景

随着人们对人工智能[AI]平台的接受程度不断提高,越来越多的患者将这些工具视为信息来源。ChatGPT 架构利用神经网络处理自然语言,从而根据输入文本的上下文生成响应。ChatGPT3.5 在炎症性肠病[IBD]方面的准确性和完整性尚不清楚。

方法

在这项前瞻性研究中,38 个由 IBD 患者提出的问题被输入到 ChatGPT3.5 中。涵盖的主题包括:[1]克罗恩病[CD]、溃疡性结肠炎[UC]和恶性肿瘤;[2]孕产妇医学;[3]感染和疫苗接种;和[4]补充医学。由 14 名专家胃肠病学家评估 ChatGPT 给出的回复的准确性[1-完全不正确到 5-完全正确]和完整性[3 分李克特量表;范围 1-不完整到 3-完整],并与相关的 ECCO 指南进行比较。

结果

就准确性而言,大多数回复[84.2%]的中位数评分为≥4(四分位距[IQR]:2),平均评分为 3.87[标准差:±0.6]。就完整性而言,34.2%的回复的中位数评分为 3,55.3%的回复的中位数评分为 2 至<3。总体而言,平均评分[2.24 标准差:±0.4,中位数:2,IQR:1]。虽然第 3 组和第 4 组在准确性和完整性方面的平均得分都较高,但四个问题组之间的评分差异没有统计学意义[Kruskal-Wallis 检验 p>0.05]。然而,对个别问题的统计分析显示,准确性[p<0.001]和完整性[p<0.001]都有显著差异。在准确性和完整性方面得分最高的问题与吸烟有关,而得分最低的问题与恶性肿瘤筛查和疫苗接种有关,尤其是在免疫抑制和计划生育的背景下。

结论

这是第一项研究,证明了基于人工智能的系统能够为 IBD 患者提供准确和全面的真实世界患者查询答案。AI 系统可以作为患者的有用补充,除了诊所的标准护理和经过验证的患者信息资源之外。然而,在专业领域的回复可能偏离基于证据的指导,并且回复需要提供更坚定的建议。

相似文献

1
Accuracy of Information given by ChatGPT for Patients with Inflammatory Bowel Disease in Relation to ECCO Guidelines.ChatGPT 为炎症性肠病患者提供的信息与 ECCO 指南的准确性比较。
J Crohns Colitis. 2024 Aug 14;18(8):1215-1221. doi: 10.1093/ecco-jcc/jjae040.
2
Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性:ChatGPT与谷歌巴德人工智能的比较分析
Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.
3
Accuracy and Reliability of Chatbot Responses to Physician Questions.聊天机器人对医生提问回答的准确性和可靠性。
JAMA Netw Open. 2023 Oct 2;6(10):e2336483. doi: 10.1001/jamanetworkopen.2023.36483.
4
Comparative evaluation of a language model and human specialists in the application of European guidelines for the management of inflammatory bowel diseases and malignancies.比较语言模型和人类专家在应用欧洲炎症性肠病和恶性肿瘤管理指南方面的效果。
Endoscopy. 2024 Sep;56(9):706-709. doi: 10.1055/a-2289-5732. Epub 2024 Mar 18.
5
Assessing the Accuracy and Reliability of AI-Generated Medical Responses: An Evaluation of the Chat-GPT Model.评估人工智能生成的医学回复的准确性和可靠性:对Chat-GPT模型的评估
Res Sq. 2023 Feb 28:rs.3.rs-2566942. doi: 10.21203/rs.3.rs-2566942/v1.
6
Artificial intelligence in endoscopy related to inflammatory bowel disease: A systematic review.与炎症性肠病相关的内镜检查中的人工智能:一项系统综述。
Indian J Gastroenterol. 2024 Feb;43(1):172-187. doi: 10.1007/s12664-024-01531-3. Epub 2024 Feb 28.
7
Evaluating ChatGPT to test its robustness as an interactive information database of radiation oncology and to assess its responses to common queries from radiotherapy patients: A single institution investigation.评估ChatGPT以测试其作为放射肿瘤学交互式信息数据库的稳健性,并评估其对放疗患者常见问题的回答:一项单机构调查。
Cancer Radiother. 2024 Jun;28(3):258-264. doi: 10.1016/j.canrad.2023.11.005. Epub 2024 Jun 12.
8
Investigating the Accuracy and Completeness of an Artificial Intelligence Large Language Model About Uveitis: An Evaluation of ChatGPT.探讨一款关于葡萄膜炎的人工智能大语言模型的准确性和完整性:ChatGPT 的评估。
Ocul Immunol Inflamm. 2024 Nov;32(9):2052-2055. doi: 10.1080/09273948.2024.2317417. Epub 2024 Feb 23.
9
Artificial intelligence in gastrointestinal endoscopy for inflammatory bowel disease: a systematic review and new horizons.人工智能在炎症性肠病胃肠内镜检查中的应用:系统评价与新视野
Therap Adv Gastroenterol. 2021 Jun 10;14:17562848211017730. doi: 10.1177/17562848211017730. eCollection 2021.
10
The performance of artificial intelligence models in generating responses to general orthodontic questions: ChatGPT vs Google Bard.人工智能模型在生成正畸常见问题回答方面的表现:ChatGPT与谷歌巴德的对比
Am J Orthod Dentofacial Orthop. 2024 Jun;165(6):652-662. doi: 10.1016/j.ajodo.2024.01.012. Epub 2024 Mar 15.

引用本文的文献

1
Artificial Intelligence for Individualized Radiological Dialogue: The Impact of RadioBot on Precision-Driven Medical Practices.用于个性化放射学对话的人工智能:RadioBot对精准驱动医疗实践的影响。
J Pers Med. 2025 Aug 8;15(8):363. doi: 10.3390/jpm15080363.
2
Digital biomarkers and artificial intelligence: a new frontier in personalized management of inflammatory bowel disease.数字生物标志物与人工智能:炎症性肠病个性化管理的新前沿。
Front Immunol. 2025 Aug 4;16:1637159. doi: 10.3389/fimmu.2025.1637159. eCollection 2025.
3
Expert evaluation of ChatGPT accuracy and reliability for basic celiac disease frequently asked questions.
针对乳糜泻基本常见问题,对ChatGPT准确性和可靠性的专家评估。
Sci Rep. 2025 Aug 14;15(1):29871. doi: 10.1038/s41598-025-15898-6.
4
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型:机遇、风险与伦理
Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.
5
Evaluating the Performance of State-of-the-Art Artificial Intelligence Chatbots Based on the WHO Global Guidelines for the Prevention of Surgical Site Infection: Cross-Sectional Study.基于世界卫生组织预防手术部位感染全球指南评估最先进的人工智能聊天机器人的性能:横断面研究
J Med Internet Res. 2025 Jul 31;27:e75567. doi: 10.2196/75567.
6
The global academic distribution and changes in research hotspots of artificial intelligence in inflammatory bowel disease since 2000.2000年以来炎症性肠病人工智能领域的全球学术分布及研究热点变化
Front Med (Lausanne). 2025 Jul 11;12:1600291. doi: 10.3389/fmed.2025.1600291. eCollection 2025.
7
Assessing ChatGPT-v4 for Guideline-Concordant Inflammatory Bowel Disease: Accuracy, Completeness, and Temporal Drift.评估ChatGPT-v4在符合指南的炎症性肠病方面的表现:准确性、完整性和时间漂移
J Clin Med. 2025 Jun 29;14(13):4599. doi: 10.3390/jcm14134599.
8
Evaluation of the Performance of Large Language Models in the Management of Axial Spondyloarthropathy: Analysis of EULAR 2022 Recommendations.大型语言模型在轴性脊柱关节炎管理中的性能评估:对欧洲抗风湿病联盟2022年建议的分析
Diagnostics (Basel). 2025 Jun 7;15(12):1455. doi: 10.3390/diagnostics15121455.
9
Quality and reliability of pediatric pneumonia related short videos on mainstream platforms: cross-sectional study.主流平台上儿科肺炎相关短视频的质量与可靠性:横断面研究
BMC Public Health. 2025 May 23;25(1):1896. doi: 10.1186/s12889-025-22963-2.
10
Mapping the landscape of AI and ML in vaccine innovation: A bibliometric study.绘制人工智能和机器学习在疫苗创新领域的图景:一项文献计量学研究。
Hum Vaccin Immunother. 2025 Dec;21(1):2501358. doi: 10.1080/21645515.2025.2501358. Epub 2025 May 16.