ChatDoctor：一种基于医学领域知识对大型语言模型Meta-AI（LLaMA）进行微调的医学聊天模型。

ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge.

作者信息

Li Yunxiang, Li Zihan, Zhang Kai, Dan Ruilong, Jiang Steve, Zhang You

机构信息

Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, USA.

Department of Computer Science, University of Illinois at Urbana-Champaign, Illinois, USA.

出版信息

Cureus. 2023 Jun 24;15(6):e40895. doi: 10.7759/cureus.40895. eCollection 2023 Jun.

DOI:10.7759/cureus.40895

PMID:37492832

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10364849/

Abstract

Objective The primary aim of this research was to address the limitations observed in the medical knowledge of prevalent large language models (LLMs) such as ChatGPT, by creating a specialized language model with enhanced accuracy in medical advice. Methods We achieved this by adapting and refining the large language model meta-AI (LLaMA) using a large dataset of 100,000 patient-doctor dialogues sourced from a widely used online medical consultation platform. These conversations were cleaned and anonymized to respect privacy concerns. In addition to the model refinement, we incorporated a self-directed information retrieval mechanism, allowing the model to access and utilize real-time information from online sources like Wikipedia and data from curated offline medical databases. Results The fine-tuning of the model with real-world patient-doctor interactions significantly improved the model's ability to understand patient needs and provide informed advice. By equipping the model with self-directed information retrieval from reliable online and offline sources, we observed substantial improvements in the accuracy of its responses. Conclusion Our proposed ChatDoctor, represents a significant advancement in medical LLMs, demonstrating a significant improvement in understanding patient inquiries and providing accurate advice. Given the high stakes and low error tolerance in the medical field, such enhancements in providing accurate and reliable information are not only beneficial but essential.

摘要

目的本研究的主要目的是通过创建一个在医疗建议方面具有更高准确性的专业语言模型，来解决诸如ChatGPT等流行大语言模型（LLMs）在医学知识方面所观察到的局限性。方法我们通过使用从一个广泛使用的在线医疗咨询平台获取的100,000个医患对话的大型数据集，对大语言模型meta-AI（LLaMA）进行调整和优化来实现这一目标。这些对话经过清理和匿名化处理，以尊重隐私问题。除了模型优化外，我们还纳入了一种自主信息检索机制，使模型能够访问和利用来自维基百科等在线来源的实时信息以及来自精心整理的离线医学数据库的数据。结果通过真实世界的医患互动对模型进行微调，显著提高了模型理解患者需求并提供明智建议的能力。通过为模型配备从可靠的在线和离线来源进行自主信息检索的功能，我们观察到其回答的准确性有了大幅提高。结论我们提出的ChatDoctor代表了医学大语言模型的重大进步，在理解患者询问和提供准确建议方面有显著改进。鉴于医学领域的高风险和低容错率，在提供准确可靠信息方面的这种改进不仅有益而且至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/10364849/255cfc8b9502/cureus-0015-00000040895-i01.jpg

相似文献

ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge.

Cureus. 2023 Jun 24;15(6):e40895. doi: 10.7759/cureus.40895. eCollection 2023 Jun.

EYE-Llama, an in-domain large language model for ophthalmology.

bioRxiv. 2024 Apr 29:2024.04.26.591355. doi: 10.1101/2024.04.26.591355.

Large Language Models for Therapy Recommendations Across 3 Clinical Specialties: Comparative Study.

J Med Internet Res. 2023 Oct 30;25:e49324. doi: 10.2196/49324.

A Reliable and Accessible Caregiving Language Model (CaLM) to Support Tools for Caregivers: Development and Evaluation Study.

JMIR Form Res. 2024 Jul 31;8:e54633. doi: 10.2196/54633.

Evaluation of Large language model performance on the Multi-Specialty Recruitment Assessment (MSRA) exam.

Comput Biol Med. 2024 Jan;168:107794. doi: 10.1016/j.compbiomed.2023.107794. Epub 2023 Nov 30.

Me-LLaMA: Foundation Large Language Models for Medical Applications.

Res Sq. 2024 May 22:rs.3.rs-4240043. doi: 10.21203/rs.3.rs-4240043/v1.

Distilling large language models for matching patients to clinical trials.

J Am Med Inform Assoc. 2024 Sep 1;31(9):1953-1963. doi: 10.1093/jamia/ocae073.

PMC-LLaMA: toward building open-source language models for medicine.

J Am Med Inform Assoc. 2024 Sep 1;31(9):1833-1843. doi: 10.1093/jamia/ocae045.

Evaluating Large Language Models in Extracting Cognitive Exam Dates and Scores.

medRxiv. 2024 Feb 13:2023.07.10.23292373. doi: 10.1101/2023.07.10.23292373.

BioInstruct: instruction tuning of large language models for biomedical natural language processing.

J Am Med Inform Assoc. 2024 Sep 1;31(9):1821-1832. doi: 10.1093/jamia/ocae122.

引用本文的文献

A Pipeline for Automating Emergency Medicine Documentation Using LLMs with Retrieval-Augmented Text Generation.

Appl Artif Intell. 2025 Jun 18;39(1):2519169. doi: 10.1080/08839514.2025.2519169. eCollection 2025.

Evaluation of large language models as a diagnostic tool for medical learners and clinicians using advanced prompting techniques.

PLoS One. 2025 Aug 1;20(8):e0325803. doi: 10.1371/journal.pone.0325803. eCollection 2025.

EYE-Llama, an in-domain large language model for ophthalmology.

iScience. 2025 Jun 23;28(7):112984. doi: 10.1016/j.isci.2025.112984. eCollection 2025 Jul 18.

Accuracy of ChatGPT-3.5, ChatGPT-4o, Copilot, Gemini, Claude, and Perplexity in advising on lumbosacral radicular pain against clinical practice guidelines: cross-sectional study.

Front Digit Health. 2025 Jun 27;7:1574287. doi: 10.3389/fdgth.2025.1574287. eCollection 2025.

Conversational health agents: a personalized large language model-powered agent framework.

JAMIA Open. 2025 Jul 6;8(4):ooaf067. doi: 10.1093/jamiaopen/ooaf067. eCollection 2025 Aug.

Large models in medical imaging: Advances and prospects.

Chin Med J (Engl). 2025 Jul 20;138(14):1647-1664. doi: 10.1097/CM9.0000000000003699. Epub 2025 Jun 20.

Large Language Model Architectures in Health Care: Scoping Review of Research Perspectives.

J Med Internet Res. 2025 Jun 19;27:e70315. doi: 10.2196/70315.

Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.

J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.

BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning.

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:149-158. eCollection 2025.

Retrieval augmented generation for large language models in healthcare: A systematic review.

PLOS Digit Health. 2025 Jun 11;4(6):e0000877. doi: 10.1371/journal.pdig.0000877. eCollection 2025 Jun.

本文引用的文献

Artificial intelligence hallucinations.

Crit Care. 2023 May 10;27(1):180. doi: 10.1186/s13054-023-04473-y.

Artificial hallucination: GPT on LSD?

Crit Care. 2023 Apr 18;27(1):148. doi: 10.1186/s13054-023-04425-6.

ChatGPT: Is this version good for healthcare and research?

Diabetes Metab Syndr. 2023 Apr;17(4):102744. doi: 10.1016/j.dsx.2023.102744. Epub 2023 Mar 15.

Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.

N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.

How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.

JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.

Mpox in Children and Adolescents: Epidemiology, Clinical Features, Diagnosis, and Management.

Pediatrics. 2023 Feb 1;151(2). doi: 10.1542/peds.2022-060179.

Monkeypox.

N Engl J Med. 2022 Nov 10;387(19):1783-1793. doi: 10.1056/NEJMra2208860. Epub 2022 Oct 26.

Limits of trust in medical AI.

J Med Ethics. 2020 Jul;46(7):478-481. doi: 10.1136/medethics-2019-105935. Epub 2020 Mar 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

ChatDoctor：一种基于医学领域知识对大型语言模型Meta-AI（LLaMA）进行微调的医学聊天模型。

ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献