Gong Eun Jeong, Bang Chang Seok, Lee Jae Jun, Park Jonghyung, Kim Eunsil, Kim Subeen, Kimm Minjae, Choi Seoung-Ho
Department of Internal Medicine, Hallym University College of Medicine, Chuncheon, Republic of Korea.
Institute for Liver and Digestive Diseases, Hallym University, Chuncheon, Republic of Korea.
J Med Internet Res. 2024 Dec 20;26:e66648. doi: 10.2196/66648.
As health care continues to evolve with technological advancements, the integration of artificial intelligence into clinical practices has shown promising potential to enhance patient care and operational efficiency. Among the forefront of these innovations are large language models (LLMs), a subset of artificial intelligence designed to understand, generate, and interact with human language at an unprecedented scale.
This systematic review describes the role of LLMs in improving diagnostic accuracy, automating documentation, and advancing specialist education and patient engagement within the field of gastroenterology and gastrointestinal endoscopy.
Core databases including MEDLINE through PubMed, Embase, and Cochrane Central registry were searched using keywords related to LLMs (from inception to April 2024). Studies were included if they satisfied the following criteria: (1) any type of studies that investigated the potential role of LLMs in the field of gastrointestinal endoscopy or gastroenterology, (2) studies published in English, and (3) studies in full-text format. The exclusion criteria were as follows: (1) studies that did not report the potential role of LLMs in the field of gastrointestinal endoscopy or gastroenterology, (2) case reports and review papers, (3) ineligible research objects (eg, animals or basic research), and (4) insufficient data regarding the potential role of LLMs. Risk of Bias in Non-Randomized Studies-of Interventions was used to evaluate the quality of the identified studies.
Overall, 21 studies on the potential role of LLMs in gastrointestinal disorders were included in the systematic review, and narrative synthesis was done because of heterogeneity in the specified aims and methodology in each included study. The overall risk of bias was low in 5 studies and moderate in 16 studies. The ability of LLMs to spread general medical information, offer advice for consultations, generate procedure reports automatically, or draw conclusions about the presumptive diagnosis of complex medical illnesses was demonstrated by the systematic review. Despite promising benefits, such as increased efficiency and improved patient outcomes, challenges related to data privacy, accuracy, and interdisciplinary collaboration remain.
We highlight the importance of navigating these challenges to fully leverage LLMs in transforming gastrointestinal endoscopy practices.
PROSPERO 581772; https://www.crd.york.ac.uk/prospero/.
随着医疗保健随着技术进步不断发展,将人工智能整合到临床实践中已显示出在改善患者护理和运营效率方面的巨大潜力。这些创新的前沿领域包括大语言模型(LLMs),它是人工智能的一个子集,旨在以前所未有的规模理解、生成和与人类语言进行交互。
本系统评价描述了大语言模型在提高诊断准确性、实现文档自动化以及推进胃肠病学和胃肠内镜领域的专科教育与患者参与度方面的作用。
使用与大语言模型相关的关键词(从起始到2024年4月)检索包括通过PubMed的MEDLINE、Embase和Cochrane中央登记册在内的核心数据库。如果研究满足以下标准则纳入:(1)任何调查大语言模型在胃肠内镜或胃肠病学领域潜在作用的研究类型,(2)以英文发表的研究,(3)全文格式的研究。排除标准如下:(1)未报告大语言模型在胃肠内镜或胃肠病学领域潜在作用的研究,(2)病例报告和综述论文,(3)不符合条件的研究对象(如动物或基础研究),以及(4)关于大语言模型潜在作用的数据不足。采用干预性非随机研究的偏倚风险评估所纳入研究的质量。
总体而言,本系统评价纳入了21项关于大语言模型在胃肠疾病中潜在作用的研究,由于每项纳入研究的特定目的和方法存在异质性,因此进行了叙述性综合分析。5项研究的总体偏倚风险较低,16项研究的总体偏倚风险中等。该系统评价证明了大语言模型在传播一般医学信息、提供会诊建议、自动生成操作报告或对复杂疾病的推定诊断得出结论方面的能力。尽管有提高效率和改善患者结局等有前景的益处,但与数据隐私、准确性和跨学科合作相关的挑战仍然存在。
我们强调应对这些挑战对于在胃肠内镜实践中充分利用大语言模型的重要性。
PROSPERO 581772;https://www.crd.york.ac.uk/prospero/