• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过多模态学习提升胃肠病学:大语言模型聊天机器人在消化内镜检查中的作用

Enhancing gastroenterology with multimodal learning: the role of large language model chatbots in digestive endoscopy.

作者信息

Qin Yuanyuan, Chang Jianming, Li Li, Wu Mianhua

机构信息

First Clinical Medical College, Nanjing University of Chinese Medicine, Nanjing, China.

Jiangsu Collaborative Innovation Center of Traditional Chinese Medicine Prevention and Treatment of Tumor, Nanjing University of Chinese Medicine, Nanjing, China.

出版信息

Front Med (Lausanne). 2025 May 21;12:1583514. doi: 10.3389/fmed.2025.1583514. eCollection 2025.

DOI:10.3389/fmed.2025.1583514
PMID:40470039
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12133735/
Abstract

INTRODUCTION

Advancements in artificial intelligence (AI) and large language models (LLMs) have the potential to revolutionize digestive endoscopy by enhancing diagnostic accuracy, improving procedural efficiency, and supporting clinical decision-making. Traditional AI-assisted endoscopic systems often rely on single-modal image analysis, which lacks contextual understanding and adaptability to complex gastrointestinal (GI) conditions. Moreover, existing methods struggle with domain shifts, data heterogeneity, and interpretability, limiting their clinical applicability.

METHODS

To address these challenges, we propose a multimodal learning framework that integrates LLM-powered chatbots with endoscopic imaging and patient-specific medical data. Our approach employs self-supervised learning to extract clinically relevant patterns from heterogeneous sources, enabling real-time guidance and AI-assisted report generation. We introduce a domain-adaptive learning strategy to enhance model generalization across diverse patient populations and imaging conditions.

RESULTS AND DISCUSSION

Experimental results on multiple GI datasets demonstrate that our method significantly improves lesion detection, reduces diagnostic variability, and enhances physician-AI collaboration. This study highlights the potential of multimodal LLM-based systems in advancing gastroenterology by providing interpretable, context-aware, and adaptable AI support in digestive endoscopy.

摘要

引言

人工智能(AI)和大语言模型(LLMs)的进步有可能通过提高诊断准确性、改善操作效率和支持临床决策,给消化内镜检查带来变革。传统的人工智能辅助内镜系统通常依赖单模态图像分析,缺乏对上下文的理解以及对复杂胃肠道(GI)病症的适应性。此外,现有方法在领域转移、数据异质性和可解释性方面存在困难,限制了它们的临床适用性。

方法

为应对这些挑战,我们提出了一个多模态学习框架,该框架将基于大语言模型的聊天机器人与内镜成像和患者特定的医疗数据相结合。我们的方法采用自监督学习从异质来源中提取临床相关模式,实现实时指导和人工智能辅助报告生成。我们引入了一种领域自适应学习策略,以增强模型在不同患者群体和成像条件下的泛化能力。

结果与讨论

在多个胃肠道数据集上的实验结果表明,我们的方法显著提高了病变检测能力,减少了诊断变异性,并增强了医生与人工智能的协作。这项研究强调了基于多模态大语言模型的系统在消化内镜检查中提供可解释、上下文感知和适应性人工智能支持,从而推动胃肠病学发展方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/df0d381cf15b/fmed-12-1583514-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/4bda096ac75d/fmed-12-1583514-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/e46984d8cf87/fmed-12-1583514-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/e836104b1680/fmed-12-1583514-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/16189d5c9e77/fmed-12-1583514-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/f1fba0930a5a/fmed-12-1583514-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/91eb9632e546/fmed-12-1583514-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/1efe397b9914/fmed-12-1583514-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/859e6c16bd46/fmed-12-1583514-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/df0d381cf15b/fmed-12-1583514-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/4bda096ac75d/fmed-12-1583514-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/e46984d8cf87/fmed-12-1583514-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/e836104b1680/fmed-12-1583514-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/16189d5c9e77/fmed-12-1583514-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/f1fba0930a5a/fmed-12-1583514-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/91eb9632e546/fmed-12-1583514-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/1efe397b9914/fmed-12-1583514-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/859e6c16bd46/fmed-12-1583514-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174b/12133735/df0d381cf15b/fmed-12-1583514-g0009.jpg

相似文献

1
Enhancing gastroenterology with multimodal learning: the role of large language model chatbots in digestive endoscopy.通过多模态学习提升胃肠病学:大语言模型聊天机器人在消化内镜检查中的作用
Front Med (Lausanne). 2025 May 21;12:1583514. doi: 10.3389/fmed.2025.1583514. eCollection 2025.
2
Consensus statements on the current landscape of artificial intelligence applications in endoscopy, addressing roadblocks, and advancing artificial intelligence in gastroenterology.关于人工智能在内窥镜检查中的当前应用情况、解决障碍以及推动胃肠病学领域人工智能发展的共识声明。
Gastrointest Endosc. 2025 Jan;101(1):2-9.e1. doi: 10.1016/j.gie.2023.12.003. Epub 2024 Apr 17.
3
Application of artificial intelligence in the diagnosis of malignant digestive tract tumors: focusing on opportunities and challenges in endoscopy and pathology.人工智能在恶性消化道肿瘤诊断中的应用:聚焦于内镜检查与病理学中的机遇与挑战
J Transl Med. 2025 Apr 9;23(1):412. doi: 10.1186/s12967-025-06428-z.
4
Applied machine learning in intelligent systems: knowledge graph-enhanced ophthalmic contrastive learning with "clinical profile" prompts.智能系统中的应用机器学习:通过“临床概况”提示进行知识图谱增强的眼科对比学习
Front Artif Intell. 2025 Mar 12;8:1527010. doi: 10.3389/frai.2025.1527010. eCollection 2025.
5
Large Language Models in Gastroenterology: Systematic Review.胃肠病学中的大语言模型:系统评价
J Med Internet Res. 2024 Dec 20;26:e66648. doi: 10.2196/66648.
6
Utilizing large language models for gastroenterology research: a conceptual framework.利用大语言模型进行胃肠病学研究:一个概念框架。
Therap Adv Gastroenterol. 2025 Apr 1;18:17562848251328577. doi: 10.1177/17562848251328577. eCollection 2025.
7
Generative Large Language Model-Powered Conversational AI App for Personalized Risk Assessment: Case Study in COVID-19.用于个性化风险评估的生成式大语言模型驱动的对话式人工智能应用程序:COVID-19案例研究
JMIR AI. 2025 Mar 27;4:e67363. doi: 10.2196/67363.
8
Proceedings from the First Global Artificial Intelligence in Gastroenterology and Endoscopy Summit.第一届全球人工智能消化内镜峰会会议录。
Gastrointest Endosc. 2020 Oct;92(4):938-945.e1. doi: 10.1016/j.gie.2020.04.044. Epub 2020 Apr 25.
9
Evolving role of artificial intelligence in gastrointestinal endoscopy.人工智能在胃肠内镜中的不断发展的作用。
World J Gastroenterol. 2020 Dec 14;26(46):7287-7298. doi: 10.3748/wjg.v26.i46.7287.
10
Comparative bibliometric analysis of artificial intelligence-assisted polyp diagnosis and AI-assisted digestive endoscopy: trends and growth in AI gastroenterology (2003-2023).人工智能辅助息肉诊断与人工智能辅助消化内镜检查的比较文献计量分析:人工智能胃肠病学的趋势与发展(2003 - 2023年)
Front Med (Lausanne). 2024 Sep 18;11:1438979. doi: 10.3389/fmed.2024.1438979. eCollection 2024.

引用本文的文献

1
Large language models and the future of gastroenterology: dissecting the biopolitics of data in a global health ecosystem.大语言模型与胃肠病学的未来:剖析全球卫生生态系统中数据的生物政治学
Front Med (Lausanne). 2025 Aug 20;12:1668365. doi: 10.3389/fmed.2025.1668365. eCollection 2025.

本文引用的文献

1
Water-assisted colonoscopy in inflammatory bowel diseases: From technical implications to diagnostic and therapeutic potentials.炎症性肠病中的水辅助结肠镜检查:从技术影响到诊断和治疗潜力
World J Gastrointest Endosc. 2024 Dec 16;16(12):647-660. doi: 10.4253/wjge.v16.i12.647.
2
Conversational LLM Chatbot ChatGPT-4 for Colonoscopy Boston Bowel Preparation Scoring: An Artificial Intelligence-to-Head Concordance Analysis.用于结肠镜检查波士顿肠道准备评分的对话式大语言模型聊天机器人ChatGPT-4:人工智能与人类的一致性分析。
Diagnostics (Basel). 2024 Nov 13;14(22):2537. doi: 10.3390/diagnostics14222537.
3
Self-Supervised Multimodal Learning: A Survey.
自监督多模态学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2025 Jul;47(7):5299-5318. doi: 10.1109/TPAMI.2024.3429301.
4
Multimodal learning with graphs.基于图的多模态学习。
Nat Mach Intell. 2023 Apr;5(4):340-350. doi: 10.1038/s42256-023-00624-6. Epub 2023 Apr 3.
5
Association between triglyceride-glucose index and in-hospital mortality in critically ill patients with sepsis: analysis of the MIMIC-IV database.脓毒症危重症患者甘油三酯-葡萄糖指数与院内死亡率的关系:对 MIMIC-IV 数据库的分析。
Cardiovasc Diabetol. 2023 Nov 8;22(1):307. doi: 10.1186/s12933-023-02041-w.
6
Dictionary learning for integrative, multimodal and scalable single-cell analysis.基于字典学习的综合、多模态和可扩展的单细胞分析。
Nat Biotechnol. 2024 Feb;42(2):293-304. doi: 10.1038/s41587-023-01767-y. Epub 2023 May 25.
7
Multimodal Learning With Transformers: A Survey.基于Transformer的多模态学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12113-12132. doi: 10.1109/TPAMI.2023.3275156. Epub 2023 Sep 5.
8
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features.通过脑-视觉-语言特征的多模态学习解码视觉神经表示。
IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):10760-10777. doi: 10.1109/TPAMI.2023.3263181. Epub 2023 Aug 7.
9
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation.GCNet:用于对话中不完全多模态学习的图补全网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8419-8432. doi: 10.1109/TPAMI.2023.3234553. Epub 2023 Jun 5.
10
An Extensive Data Processing Pipeline for MIMIC-IV.用于MIMIC-IV的广泛数据处理管道。
Proc Mach Learn Res. 2022 Nov;193:311-325.