医聊 ChatZH：一个用于中医咨询的调优大语言模型。

MedChatZH: A tuning LLM for traditional Chinese medicine consultations.

机构信息

Department of Computer Science and Technology, East China University of Science and Technology, Shanghai, 200237, China; Shanghai Artificial Intelligence Laboratory, Shanghai, 200240, China; Chongqing Artificial Intelligence Research Institute of Shanghai Jiao Tong University, 200240, China.

Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.

出版信息

Comput Biol Med. 2024 Apr;172:108290. doi: 10.1016/j.compbiomed.2024.108290. Epub 2024 Mar 13.

DOI:10.1016/j.compbiomed.2024.108290

PMID:38503097

Abstract

Generative Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, including Question-Answering (QA) and dialogue systems. However, most models are trained on English data and lack strong generalization in providing answers in Chinese. This limitation is especially evident in specialized domains like traditional Chinese medical QA, where performance suffers due to the absence of fine-tuning and high-quality datasets. To address this, we introduce MedChatZH, a dialogue model optimized for Chinese medical QA based on transformer decoder with LLaMA architecture. Continued pre-training on a curated corpus of Chinese medical books is followed by fine-tuning with a carefully selected medical instruction dataset, resulting in MedChatZH outperforming several Chinese dialogue baselines on a real-world medical dialogue dataset. Our model, code, and dataset are publicly available on GitHub (https://github.com/tyang816/MedChatZH) to encourage further research in traditional Chinese medicine and LLMs.

摘要

生成式大型语言模型 (LLM) 在各种自然语言处理任务中取得了重大成功，包括问答 (QA) 和对话系统。然而，大多数模型都是基于英语数据进行训练的，在提供中文答案方面缺乏强大的泛化能力。这种限制在传统中医 QA 等专业领域尤为明显，由于缺乏微调以及高质量数据集，其性能受到影响。为了解决这个问题，我们引入了 MedChatZH，这是一个基于 transformer 解码器和 LLaMA 架构的针对中文医学 QA 优化的对话模型。在经过精心挑选的医学指令数据集上进行微调之前，它还需要先在一个经过整理的中文医学书籍语料库上进行持续的预训练，这使得 MedChatZH 在真实世界的医学对话数据集上的表现优于几个中文对话基准。我们的模型、代码和数据集都可以在 GitHub 上获得 (https://github.com/tyang816/MedChatZH)，以鼓励对中医和 LLM 进行进一步研究。