Zhao Yi-Fei, Bove Allyn, Thompson David, Hill James, Xu Yi, Ren Yufan, Hassman Andrea, Zhou Leming, Wang Yanshan
Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA.
Stern School of Business, New York University, New York, NY.
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:644-653. eCollection 2025.
Low back pain (LBP) is a leading cause of disability globally. Following the onset of LBP and subsequent treatment, adequate patient education is crucial for improving functionality and long-term outcomes. Despite advancements in patient education strategies, significant gaps persist in delivering personalized, evidence-based information to patients with LBP. Recent advancements in large language models (LLMs) and generative artificial intelligence (GenAI) have demonstrated the potential to enhance patient education. However, their application and efficacy in delivering educational content to patients with LBP remain underexplored and warrant further investigation. In this study, we introduce a novel approach utilizing LLMs with Retrieval-Augmented Generation (RAG) and few-shot learning to generate tailored educational materials for patients with LBP. Physical therapists manually evaluated our model responses for redundancy, accuracy, and completeness using a Likert scale. In addition, the readability of the generated education materials is assessed using the Flesch Reading Ease score. The findings demonstrate that RAG-based LLMs outperform traditional LLMs, providing more accurate, complete, and readable patient education materials with less redundancy. Having said that, our analysis reveals that the generated materials are not yet ready for use in clinical practice. This study underscores the potential of AI-driven models utilizing RAG to improve patient education for LBP; however, significant challenges remain in ensuring the clinical relevance and granularity of content generated by these models.
腰痛(LBP)是全球致残的主要原因。在腰痛发作及后续治疗后,充分的患者教育对于改善功能和长期预后至关重要。尽管患者教育策略有所进步,但在为腰痛患者提供个性化的循证信息方面仍存在重大差距。大语言模型(LLMs)和生成式人工智能(GenAI)的最新进展已显示出增强患者教育的潜力。然而,它们在为腰痛患者提供教育内容方面的应用和效果仍未得到充分探索,值得进一步研究。在本研究中,我们引入了一种新颖的方法,利用带有检索增强生成(RAG)和少样本学习的大语言模型为腰痛患者生成量身定制的教育材料。物理治疗师使用李克特量表对我们模型的回答进行冗余性、准确性和完整性的人工评估。此外,使用弗莱什易读性分数评估所生成教育材料的可读性。研究结果表明,基于RAG的大语言模型优于传统大语言模型,能提供更准确、完整且可读性更强的患者教育材料,冗余度更低。话虽如此,我们的分析表明所生成的材料尚未准备好用于临床实践。本研究强调了利用RAG的人工智能驱动模型在改善腰痛患者教育方面的潜力;然而,在确保这些模型生成内容的临床相关性和详细程度方面仍存在重大挑战。