文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

评估大语言模型在中医诊断和治疗建议中的作用。

Evaluating the role of large language models in traditional Chinese medicine diagnosis and treatment recommendations.

作者信息

Liu Yu, Yuan Yishan, Yan Keming, Li Yuanyuan, Sacca Valeria, Hodges Sierra, Cannistra Mattia, Jeong Pauline, Wu Jiani, Kong Jian

机构信息

Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA, USA.

Beijing University of Chinese Medicine, Beijing, China.

出版信息

NPJ Digit Med. 2025 Jul 21;8(1):466. doi: 10.1038/s41746-025-01845-2.


DOI:10.1038/s41746-025-01845-2
PMID:40691277
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12279949/
Abstract

Digital health technologies hold significant potential for reducing global healthcare disparities. Large language models (LLMs) offer new opportunities to enhance access to culturally specific healthcare, including traditional Chinese medicine (TCM). This study evaluated the diagnostic and treatment performance of seven publicly available LLMs using a real-world acupuncture case, comparing their outputs with three professional acupuncturists across five domains: Western diagnosis, TCM diagnosis, acupoint selection, needling technique, and herbal medicine. Twenty-eight expert evaluators from China, South Korea, and the United States assessed the responses using a multilingual survey. LLMs performed comparably to acupuncturists in Western diagnosis and showed variable performance in TCM-specific tasks. GPT-4o, Qwen 2.5 Max, and Doubao 1.5 Pro demonstrated the highest alignment with expert evaluations, particularly in TCM diagnosis and acupoint selection. These findings highlight the potential of general-purpose LLMs to support culturally grounded medical decision-making and reduce access barriers in TCM care systems.

摘要

数字健康技术在减少全球医疗保健差距方面具有巨大潜力。大语言模型(LLMs)为增加获得包括中医(TCM)在内的特定文化背景医疗保健服务提供了新机会。本研究使用一个真实世界的针灸病例评估了七个公开可用的大语言模型的诊断和治疗性能,并将它们的输出与三位专业针灸师在五个领域进行比较:西医诊断、中医诊断、穴位选择、针刺技术和草药。来自中国、韩国和美国的28名专家评估员使用多语言调查问卷对回答进行了评估。大语言模型在西医诊断方面的表现与针灸师相当,而在中医特定任务中表现各异。GPT-4o、文心一言2.5 Max和豆包1.5 Pro与专家评估的一致性最高,尤其是在中医诊断和穴位选择方面。这些发现凸显了通用大语言模型在支持基于文化的医疗决策和减少中医护理系统中的获取障碍方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb1/12279949/bce152b7e9c9/41746_2025_1845_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb1/12279949/6e4887ebdc7f/41746_2025_1845_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb1/12279949/bce152b7e9c9/41746_2025_1845_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb1/12279949/6e4887ebdc7f/41746_2025_1845_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb1/12279949/bce152b7e9c9/41746_2025_1845_Fig2_HTML.jpg

相似文献

[1]
Evaluating the role of large language models in traditional Chinese medicine diagnosis and treatment recommendations.

NPJ Digit Med. 2025-7-21

[2]
Evaluating and Improving Syndrome Differentiation Thinking Ability in Large Language Models: Method Development Study.

JMIR Med Inform. 2025-6-20

[3]
Application of Large Language Models in Stroke Rehabilitation Health Education: 2-Phase Study.

J Med Internet Res. 2025-7-22

[4]
Clinical Management of Wasp Stings Using Large Language Models: Cross-Sectional Evaluation Study.

J Med Internet Res. 2025-6-4

[5]
Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study.

J Med Internet Res. 2025-5-20

[6]
Clinical feasibility of AI Doctors: Evaluating the replacement potential of large language models in outpatient settings for central nervous system tumors.

Int J Med Inform. 2025-6-12

[7]
Lingdan: enhancing encoding of traditional Chinese medicine knowledge for clinical reasoning tasks with large language models.

J Am Med Inform Assoc. 2024-9-1

[8]
Relation extraction using large language models: a case study on acupuncture point locations.

J Am Med Inform Assoc. 2024-11-1

[9]
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.

J Med Internet Res. 2025-6-18

[10]
Application of Large Language Models in Traditional Chinese Medicine: A State-of-the-Art Review.

Am J Chin Med. 2025

本文引用的文献

[1]
Leveraging LLMs and wearables to provide personalized recommendations for enhancing student well-being and academic performance through a proof of concept.

Sci Rep. 2025-2-7

[2]
Standardized patient profile review using large language models for case adjudication in observational research.

NPJ Digit Med. 2025-1-9

[3]
Large Language Models in Traditional Chinese Medicine: A Scoping Review.

J Evid Based Med. 2025-3

[4]
ChatGPT-4 Omni Performance in USMLE Disciplines and Clinical Skills: Comparative Analysis.

JMIR Med Educ. 2024-11-6

[5]
Acupuncture vs Sham Acupuncture for Chronic Sciatica From Herniated Disk: A Randomized Clinical Trial.

JAMA Intern Med. 2024-12-1

[6]
Deciphering the mechanistic impact of acupuncture on the neurovascular unit in acute ischemic stroke: Insights from basic research in a narrative review.

Ageing Res Rev. 2024-11

[7]
Towards building multilingual language model for medicine.

Nat Commun. 2024-9-27

[8]
Enhancing Clinical Accuracy of Medical Chatbots with Large Language Models.

IEEE J Biomed Health Inform. 2024-9-27

[9]
Large Language Model Prompting Techniques for Advancement in Clinical Medicine.

J Clin Med. 2024-8-28

[10]
Long-Term Effects of Individualized Acupuncture for Chronic Neck Pain : A Randomized Controlled Trial.

Ann Intern Med. 2024-10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索