• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于艾滋病护理中心血管健康教育的CARDIO大语言模型的跨学科开发与优化:教程

Interdisciplinary Development and Fine-Tuning of CARDIO a LLM for Cardiovascular Health Education in HIV Care: A Tutorial.

作者信息

Rullo Ryan, Maatouk Ali, Huang Tinglin, Chen Jialin, Qiu Weikang, O'Connor Giselle, Womack Julie, Sadak Tatiana, Rodriguez Christine, de Jesus Espinosa Tania, Carneiro Pedro, Marshall Ami, Ying Rex, Ramos S Raquel

机构信息

School of Nursing, Yale University, 400 West Campus Drive, Orange, US.

Department of Computer Science, School of Engineering & Applied Science, Yale University, New Haven, US.

出版信息

J Med Internet Res. 2025 Aug 11. doi: 10.2196/77053.

DOI:10.2196/77053
PMID:40794856
Abstract

The integration of Artificial Intelligence in healthcare presents as a significant opportunity to revolutionize patient care. In the United States, an estimated 129 million people have at least one chronic illness, with 42% having two or more. Despite being largely preventable, the prevalence of chronic illness is expected to rise and impose significant economic burdens and financial toxicity on healthcare consumers. We leveraged an interdisciplinary team encompassing nursing, public health, and computer science to optimize health through prevention education for cardiovascular and metabolic comorbidities in persons living with HIV. In this tutorial, we describe the iterative, data-based development and evaluation of an intersectionality-informed large language model designed to support patient teaching in this population. First, we curated data by scraping publicly available, authoritative, evidence-based sources to capture a comprehensive dataset, supplemented by publicly available HIV forum content. Second, we benchmarked candidate large language models and generated a fine-tuning dataset using GPT-4 through multi-turn question-answer conversations, employing standardized metrics to assess baseline model performance. Third, we iteratively refined the selected model via Low-Rank Adaptation and reinforcement learning, integrating quantitative metrics with qualitative expert evaluations. Pre-existing LLM models demonstrated poor n-gram agreement, dissonance from model answers (Accuracy 4.16, Readability 4.63, Professionalism 4.58), and difficult readability (Kincaid 8.54, Jargon 4.44). After prompt adjustments and fine-tuning, preliminary results demonstrate the potential of a customized LLaMA-based LLM to provide personalized, culturally salient patient education. We present a data-based, step-by-step tutorial for interdisciplinary development of CARDIO, a specialized LLM, for cardiovascular health education in HIV care. Through comprehensive data curation and scraping, systematic benchmarking, and a dual-stage fine-tuning pipeline, CARDIO's performance improved markedly (Accuracy 5.0, Readability 4.98, Professionalism 4.98, Kincaid 7.17, Jargon 2.92). Although patient pilot testing remains forthcoming, our results demonstrate that targeted data curation, rigorous benchmarking, and iterative fine-tuning have provided a robust evaluation of the model's potential. By building an LLM tailored to cardiovascular health promotion and patient education, this work lays the foundation for innovative AI-driven strategies to manage comorbid conditions in people living with HIV.

摘要

人工智能在医疗保健领域的整合为彻底改变患者护理提供了重大机遇。在美国,估计有1.29亿人至少患有一种慢性病,其中42%的人患有两种或更多种慢性病。尽管慢性病在很大程度上是可以预防的,但预计其患病率仍将上升,并给医疗保健消费者带来巨大的经济负担和财务毒性。我们利用了一个跨学科团队,包括护理、公共卫生和计算机科学,通过对艾滋病毒感染者的心血管和代谢合并症进行预防教育来优化健康状况。在本教程中,我们描述了一个基于交叉性的大语言模型的迭代式、基于数据的开发和评估,该模型旨在支持对这一人群的患者教育。首先,我们通过抓取公开可用的、权威的、基于证据的来源来策划数据,以获取一个全面的数据集,并辅以公开可用的艾滋病毒论坛内容。其次,我们对候选大语言模型进行基准测试,并通过多轮问答对话使用GPT-4生成一个微调数据集,采用标准化指标来评估基线模型性能。第三,我们通过低秩适应和强化学习迭代地改进所选模型,将定量指标与定性专家评估相结合。现有的大语言模型表现出较差的n元语法一致性、与模型答案不一致(准确率4.16、可读性4.63、专业性4.58)以及可读性差(金凯德可读性指数8.54、行话4.44)。经过提示调整和微调后,初步结果表明基于定制的基于LLaMA的大语言模型有潜力提供个性化的、具有文化特色的患者教育。我们为专门用于艾滋病毒护理中心血管健康教育的CARDIO大语言模型的跨学科开发提供了一个基于数据的分步教程。通过全面的数据策划和抓取、系统的基准测试以及双阶段微调管道,CARDIO模型的性能有了显著提高(准确率5.0、可读性4.98、专业性4.98、金凯德可读性指数7.17、行话2.92)。尽管患者试点测试仍有待进行,但我们的结果表明,有针对性的数据策划、严格的基准测试和迭代微调为评估模型的潜力提供了有力支持。通过构建一个针对心血管健康促进和患者教育的大语言模型,这项工作为创新的人工智能驱动策略奠定了基础,以管理艾滋病毒感染者的合并症。

相似文献

1
Interdisciplinary Development and Fine-Tuning of CARDIO a LLM for Cardiovascular Health Education in HIV Care: A Tutorial.用于艾滋病护理中心血管健康教育的CARDIO大语言模型的跨学科开发与优化:教程
J Med Internet Res. 2025 Aug 11. doi: 10.2196/77053.
2
Menstrual Health Education Using a Specialized Large Language Model in India: Development and Evaluation Study of MenstLLaMA.在印度使用专门的大语言模型进行月经健康教育:MenstLLaMA的开发与评估研究
J Med Internet Res. 2025 Jul 16;27:e71977. doi: 10.2196/71977.
3
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
4
Artificial intelligence-simplified information to advance reproductive genetic literacy and health equity.人工智能简化信息以促进生殖遗传知识普及和健康公平。
Hum Reprod. 2025 Jul 22. doi: 10.1093/humrep/deaf135.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.
7
Evaluating and Enhancing Japanese Large Language Models for Genetic Counseling Support: Comparative Study of Domain Adaptation and the Development of an Expert-Evaluated Dataset.评估和增强用于遗传咨询支持的日本大语言模型:领域适应的比较研究与专家评估数据集的开发
JMIR Med Inform. 2025 Jan 16;13:e65047. doi: 10.2196/65047.
8
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
9
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.
10
Fine-tuning medical language models for enhanced long-contextual understanding and domain expertise.微调医学语言模型以增强长上下文理解和领域专业知识。
Quant Imaging Med Surg. 2025 Jun 6;15(6):5450-5462. doi: 10.21037/qims-2024-2655. Epub 2025 Jun 3.