Li Caixia, Zhao Yina, Bai Yang, Zhao Baoquan, Tola Yetunde Oluwafunmilayo, Chan Carmen Wh, Zhang Meifen, Fu Xia
The Department of Nursing, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen, China.
The School of Nursing, Sun Yat-sen University, Guangzhou, China.
J Med Internet Res. 2025 Apr 16;27:e70535. doi: 10.2196/70535.
Chronic diseases are a major global health burden, accounting for nearly three-quarters of the deaths worldwide. Large language models (LLMs) are advanced artificial intelligence systems with transformative potential to optimize chronic disease management; however, robust evidence is lacking.
This review aims to synthesize evidence on the feasibility, opportunities, and challenges of LLMs across the disease management spectrum, from prevention to screening, diagnosis, treatment, and long-term care.
Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analysis) guidelines, 11 databases (Cochrane Central Register of Controlled Trials, CINAHL, Embase, IEEE Xplore, MEDLINE via Ovid, ProQuest Health & Medicine Collection, ScienceDirect, Scopus, Web of Science Core Collection, China National Knowledge Internet, and SinoMed) were searched on April 17, 2024. Intervention and simulation studies that examined LLMs in the management of chronic diseases were included. The methodological quality of the included studies was evaluated using a rating rubric designed for simulation-based research and the risk of bias in nonrandomized studies of interventions tool for quasi-experimental studies. Narrative analysis with descriptive figures was used to synthesize the study findings. Random-effects meta-analyses were conducted to assess the pooled effect estimates of the feasibility of LLMs in chronic disease management.
A total of 20 studies examined general-purpose (n=17) and retrieval-augmented generation-enhanced LLMs (n=3) for the management of chronic diseases, including cancer, cardiovascular diseases, and metabolic disorders. LLMs demonstrated feasibility across the chronic disease management spectrum by generating relevant, comprehensible, and accurate health recommendations (pooled accurate rate 71%, 95% CI 0.59-0.83; I=88.32%) with retrieval-augmented generation-enhanced LLMs having higher accuracy rates compared to general-purpose LLMs (odds ratio 2.89, 95% CI 1.83-4.58; I=54.45%). LLMs facilitated equitable information access; increased patient awareness regarding ailments, preventive measures, and treatment options; and promoted self-management behaviors in lifestyle modification and symptom coping. Additionally, LLMs facilitate compassionate emotional support, social connections, and health care resources to improve the health outcomes of chronic diseases. However, LLMs face challenges in addressing privacy, language, and cultural issues; undertaking advanced tasks, including diagnosis, medication, and comorbidity management; and generating personalized regimens with real-time adjustments and multiple modalities.
LLMs have demonstrated the potential to transform chronic disease management at the individual, social, and health care levels; however, their direct application in clinical settings is still in its infancy. A multifaceted approach that incorporates robust data security, domain-specific model fine-tuning, multimodal data integration, and wearables is crucial for the evolution of LLMs into invaluable adjuncts for health care professionals to transform chronic disease management.
PROSPERO CRD42024545412; https://www.crd.york.ac.uk/PROSPERO/view/CRD42024545412.
慢性病是全球主要的健康负担,占全球死亡人数的近四分之三。大语言模型(LLMs)是具有变革潜力的先进人工智能系统,可优化慢性病管理;然而,目前仍缺乏有力证据。
本综述旨在综合关于大语言模型在从预防到筛查、诊断、治疗和长期护理的疾病管理全流程中的可行性、机遇和挑战的证据。
遵循PRISMA(系统评价和Meta分析的首选报告项目)指南,于2024年4月17日检索了11个数据库(Cochrane对照试验中央注册库、CINAHL、Embase、IEEE Xplore、通过Ovid检索的MEDLINE、ProQuest健康与医学数据库、ScienceDirect、Scopus、科学引文索引核心合集、中国知网和中国生物医学文献数据库)。纳入了在慢性病管理中研究大语言模型的干预和模拟研究。使用为基于模拟的研究设计的评分标准以及准实验研究的非随机干预研究中的偏倚风险工具评估纳入研究的方法学质量。采用带有描述性图表的叙述性分析来综合研究结果。进行随机效应Meta分析以评估大语言模型在慢性病管理中的可行性的合并效应估计值。
共有20项研究考察了通用大语言模型(n = 17)和检索增强生成强化大语言模型(n = 3)在慢性病管理中的应用,这些慢性病包括癌症、心血管疾病和代谢紊乱。大语言模型通过生成相关、易懂且准确的健康建议,在慢性病管理全流程中表现出可行性(合并准确率71%,95%置信区间0.59 - 0.83;I² = 88.32%),检索增强生成强化大语言模型的准确率高于通用大语言模型(优势比2.89,95%置信区间1.83 - 4.58;I² = 54.45%)。大语言模型促进了信息的公平获取;提高了患者对疾病、预防措施和治疗选择的认识;并促进了生活方式改变和症状应对方面的自我管理行为。此外,大语言模型有助于提供富有同情心的情感支持、社交联系和医疗保健资源,以改善慢性病的健康结局。然而,大语言模型在解决隐私、语言和文化问题;执行包括诊断、用药和合并症管理在内的高级任务;以及生成具有实时调整和多种模式的个性化治疗方案方面面临挑战。
大语言模型已展现出在个体、社会和医疗保健层面改变慢性病管理的潜力;然而,它们在临床环境中的直接应用仍处于起步阶段。一种包含强大数据安全、特定领域模型微调、多模态数据整合和可穿戴设备的多方面方法对于大语言模型演变为医疗保健专业人员的宝贵辅助工具以改变慢性病管理至关重要。
PROSPERO CRD42024545412;https://www.crd.york.ac.uk/PROSPERO/view/CRD42024545412 。