文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

迈向医学多语言语言模型的构建。

Towards building multilingual language model for medicine.

机构信息

Shanghai Jiao Tong University, Shanghai, China.

Shanghai AI Laboratory, Shanghai, China.

出版信息

Nat Commun. 2024 Sep 27;15(1):8384. doi: 10.1038/s41467-024-52417-z.


DOI:10.1038/s41467-024-52417-z
PMID:39333468
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11436924/
Abstract

The development of open-source, multilingual medical language models can benefit a wide, linguistically diverse audience from different regions. To promote this domain, we present contributions from the following: First, we construct a multilingual medical corpus, containing approximately 25.5B tokens encompassing 6 main languages, termed as MMedC, enabling auto-regressive domain adaptation for general LLMs; Second, to monitor the development of multilingual medical LLMs, we propose a multilingual medical multi-choice question-answering benchmark with rationale, termed as MMedBench; Third, we have assessed a number of open-source large language models (LLMs) on our benchmark, along with those further auto-regressive trained on MMedC. Our final model, MMed-Llama 3, with only 8B parameters, achieves superior performance compared to all other open-source models on both MMedBench and English benchmarks, even rivaling GPT-4. In conclusion, in this work, We present a large-scale corpus, a benchmark and a series of models to support the development of multilingual medical LLMs.

摘要

开源、多语言医学语言模型的发展可以使来自不同地区的广泛的、语言多样化的受众受益。为了促进这一领域的发展,我们提出了以下贡献:首先,我们构建了一个多语言医学语料库,包含大约 255 亿个包含 6 种主要语言的令牌,称为 MMedC,能够实现通用大语言模型的自回归领域自适应;其次,为了监测多语言医学大语言模型的发展,我们提出了一个带有推理的多语言医学多项选择问答基准,称为 MMedBench;第三,我们在基准上评估了一些开源的大语言模型(LLMs),以及那些在 MMedC 上进一步自回归训练的模型。我们的最终模型 MMed-Llama 3 只有 80 亿个参数,在 MMedBench 和英语基准上的表现都优于所有其他开源模型,甚至可以与 GPT-4 相媲美。总之,在这项工作中,我们提出了一个大规模语料库、一个基准和一系列模型,以支持多语言医学大语言模型的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/66f8acd700d2/41467_2024_52417_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/4d6289a09496/41467_2024_52417_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/55ec2b909df6/41467_2024_52417_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/e157d43ba4b3/41467_2024_52417_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/f90dd0e989ac/41467_2024_52417_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/66f8acd700d2/41467_2024_52417_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/4d6289a09496/41467_2024_52417_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/55ec2b909df6/41467_2024_52417_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/e157d43ba4b3/41467_2024_52417_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/f90dd0e989ac/41467_2024_52417_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5375/11436924/66f8acd700d2/41467_2024_52417_Fig5_HTML.jpg

相似文献

[1]
Towards building multilingual language model for medicine.

Nat Commun. 2024-9-27

[2]
MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering.

Artif Intell Med. 2024-9

[3]
PMC-LLaMA: toward building open-source language models for medicine.

J Am Med Inform Assoc. 2024-9-1

[4]
PH-LLM: Public Health Large Language Models for Infoveillance.

medRxiv. 2025-2-10

[5]
Benchmarking Hook and Bait Urdu news dataset for domain-agnostic and multilingual fake news detection using large language models.

Sci Rep. 2025-5-3

[6]
Benchmarking large language models for biomedical natural language processing applications and recommendations.

Nat Commun. 2025-4-6

[7]
Speech translation for multilingual medical education leveraged by large language models.

Artif Intell Med. 2025-8

[8]
Multilingual feasibility of GPT-4o for automated Voice-to-Text CT and MRI report transcription.

Eur J Radiol. 2025-1

[9]
A dataset and benchmark for hospital course summarization with adapted large language models.

J Am Med Inform Assoc. 2025-3-1

[10]
GPT is an effective tool for multilingual psychological text analysis.

Proc Natl Acad Sci U S A. 2024-8-20

引用本文的文献

[1]
From large language models to multimodal AI: a scoping review on the potential of generative AI in medicine.

Biomed Eng Lett. 2025-8-22

[2]
Two stage large language model approach enhancing entity classification and relationship mapping in radiology reports.

Sci Rep. 2025-8-27

[3]
Evaluating the role of large language models in traditional Chinese medicine diagnosis and treatment recommendations.

NPJ Digit Med. 2025-7-21

[4]
Performance of large language models in the differential diagnosis of benign and malignant biliary stricture.

Front Oncol. 2025-7-3

[5]
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.

J Med Internet Res. 2025-6-18

[6]
Enhancing the Accuracy of Human Phenotype Ontology Identification: Comparative Evaluation of Multimodal Large Language Models.

J Med Internet Res. 2025-6-2

[7]
Application of AI Chatbot in Responding to Asynchronous Text-Based Messages From Patients With Cancer: Comparative Study.

J Med Internet Res. 2025-5-21

[8]
Natural Language Processing for Digital Health in the Era of Large Language Models.

Yearb Med Inform. 2024-8

[9]
Benchmarking of Large Language Models for the Dental Admission Test.

Health Data Sci. 2025-4-1

[10]
A two-step concept-based approach for enhanced interpretability and trust in skin lesion diagnosis.

Comput Struct Biotechnol J. 2025-2-20

本文引用的文献

[1]
PMC-LLaMA: toward building open-source language models for medicine.

J Am Med Inform Assoc. 2024-9-1

[2]
Almanac - Retrieval-Augmented Language Models for Clinical Medicine.

NEJM AI. 2024-2

[3]
Large language models propagate race-based medicine.

NPJ Digit Med. 2023-10-20

[4]
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge.

Cureus. 2023-6-24

[5]
Large language models encode clinical knowledge.

Nature. 2023-8

[6]
Foundation models for generalist medical artificial intelligence.

Nature. 2023-4

[7]
Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.

PLOS Digit Health. 2023-2-9

[8]
Explainable artificial intelligence for mental health through transparency and interpretability for understandability.

NPJ Digit Med. 2023-1-18

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索