• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型在疾病诊断与治疗中的应用。

Application of large language models in disease diagnosis and treatment.

作者信息

Yang Xintian, Li Tongxin, Su Qin, Liu Yaling, Kang Chenxi, Lyu Yong, Zhao Lina, Nie Yongzhan, Pan Yanglin

机构信息

State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, Shaanxi 710032, China.

Department of Radiotherapy, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi 710032, China.

出版信息

Chin Med J (Engl). 2025 Jan 20;138(2):130-142. doi: 10.1097/CM9.0000000000003456. Epub 2024 Dec 26.

DOI:10.1097/CM9.0000000000003456
PMID:39722188
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11745858/
Abstract

Large language models (LLMs) such as ChatGPT, Claude, Llama, and Qwen are emerging as transformative technologies for the diagnosis and treatment of various diseases. With their exceptional long-context reasoning capabilities, LLMs are proficient in clinically relevant tasks, particularly in medical text analysis and interactive dialogue. They can enhance diagnostic accuracy by processing vast amounts of patient data and medical literature and have demonstrated their utility in diagnosing common diseases and facilitating the identification of rare diseases by recognizing subtle patterns in symptoms and test results. Building on their image-recognition abilities, multimodal LLMs (MLLMs) show promising potential for diagnosis based on radiography, chest computed tomography (CT), electrocardiography (ECG), and common pathological images. These models can also assist in treatment planning by suggesting evidence-based interventions and improving clinical decision support systems through integrated analysis of patient records. Despite these promising developments, significant challenges persist regarding the use of LLMs in medicine, including concerns regarding algorithmic bias, the potential for hallucinations, and the need for rigorous clinical validation. Ethical considerations also underscore the importance of maintaining the function of supervision in clinical practice. This paper highlights the rapid advancements in research on the diagnostic and therapeutic applications of LLMs across different medical disciplines and emphasizes the importance of policymaking, ethical supervision, and multidisciplinary collaboration in promoting more effective and safer clinical applications of LLMs. Future directions include the integration of proprietary clinical knowledge, the investigation of open-source and customized models, and the evaluation of real-time effects in clinical diagnosis and treatment practices.

摘要

ChatGPT、Claude、Llama和豆包等大语言模型正作为用于各种疾病诊断和治疗的变革性技术而兴起。凭借其卓越的长上下文推理能力,大语言模型精通临床相关任务,尤其是在医学文本分析和交互式对话方面。它们可以通过处理大量患者数据和医学文献来提高诊断准确性,并已在诊断常见疾病以及通过识别症状和检查结果中的细微模式来促进罕见病的识别方面展示了其效用。基于其图像识别能力,多模态大语言模型在基于X射线摄影、胸部计算机断层扫描(CT)、心电图(ECG)和常见病理图像的诊断方面显示出有前景的潜力。这些模型还可以通过建议基于证据的干预措施来协助治疗规划,并通过对患者记录的综合分析来改进临床决策支持系统。尽管有这些有前景的发展,但在医学中使用大语言模型仍存在重大挑战,包括对算法偏差、幻觉可能性的担忧以及严格临床验证的必要性。伦理考量也强调了在临床实践中保持监督功能的重要性。本文强调了大语言模型在不同医学学科的诊断和治疗应用研究中的快速进展,并强调了政策制定、伦理监督和多学科合作在促进大语言模型更有效、更安全的临床应用方面的重要性。未来的方向包括专有临床知识的整合、开源和定制模型的研究以及临床诊断和治疗实践中实时效果的评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cac7/11745858/407293ba59b1/cm9-138-130-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cac7/11745858/6b0fe1b3515d/cm9-138-130-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cac7/11745858/407293ba59b1/cm9-138-130-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cac7/11745858/6b0fe1b3515d/cm9-138-130-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cac7/11745858/407293ba59b1/cm9-138-130-g002.jpg

相似文献

1
Application of large language models in disease diagnosis and treatment.大语言模型在疾病诊断与治疗中的应用。
Chin Med J (Engl). 2025 Jan 20;138(2):130-142. doi: 10.1097/CM9.0000000000003456. Epub 2024 Dec 26.
2
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
3
Enhancing the Accuracy of Human Phenotype Ontology Identification: Comparative Evaluation of Multimodal Large Language Models.提高人类表型本体识别的准确性:多模态大语言模型的比较评估
J Med Internet Res. 2025 Jun 2;27:e73233. doi: 10.2196/73233.
4
Large Language Models in Medicine: Applications, Challenges, and Future Directions.医学领域的大语言模型:应用、挑战与未来方向。
Int J Med Sci. 2025 May 31;22(11):2792-2801. doi: 10.7150/ijms.111780. eCollection 2025.
5
Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.医学诊断中的大语言模型:基于文献计量分析的综述
J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.
6
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
7
Multimodal LLMs for retinal disease diagnosis via OCT: few-shot versus single-shot learning.通过光学相干断层扫描(OCT)进行视网膜疾病诊断的多模态语言模型:少样本学习与单样本学习
Ther Adv Ophthalmol. 2025 May 20;17:25158414251340569. doi: 10.1177/25158414251340569. eCollection 2025 Jan-Dec.
8
The Role of Large Language Models (LLMs) in Breast Imaging Today and in the Near Future.大语言模型(LLMs)在当今及不久的将来在乳腺成像中的作用。
J Magn Reson Imaging. 2025 May 4. doi: 10.1002/jmri.29807.
9
Large language models and psychiatry.大型语言模型与精神病学。
Int J Law Psychiatry. 2025 Jul-Aug;101:102086. doi: 10.1016/j.ijlp.2025.102086. Epub 2025 Feb 27.
10
Can large language models be new supportive tools in coronary computed tomography angiography reporting?大语言模型能否成为冠状动脉 CT 血管造影报告的新辅助工具?
Clin Imaging. 2024 Oct;114:110271. doi: 10.1016/j.clinimag.2024.110271. Epub 2024 Aug 31.

引用本文的文献

1
Development and Validation of a Pathomics-Based Prognostic Model for Patients with Lung Adenocarcinoma Undergoing First-Line EGFR-TKI Therapy.基于病理组学的一线EGFR-TKI治疗肺腺癌患者预后模型的开发与验证
Ann Surg Oncol. 2025 Sep 5. doi: 10.1245/s10434-025-17656-4.
2
Advancements and challenges of artificial intelligence in dermatology: a review of applications and perspectives in China.人工智能在皮肤病学中的进展与挑战:中国应用与前景综述
Front Digit Health. 2025 Aug 13;7:1544520. doi: 10.3389/fdgth.2025.1544520. eCollection 2025.
3
Large language models in clinical nutrition: an overview of its applications, capabilities, limitations, and potential future prospects.

本文引用的文献

1
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations.医学多文档摘要的自动化指标与人类评估结果不一致。
Proc Conf Assoc Comput Linguist Meet. 2023 Jul;2023:9871-9889. doi: 10.18653/v1/2023.acl-long.549.
2
Large Language Model Influence on Diagnostic Reasoning: A Randomized Clinical Trial.大语言模型对诊断推理的影响:一项随机临床试验。
JAMA Netw Open. 2024 Oct 1;7(10):e2440969. doi: 10.1001/jamanetworkopen.2024.40969.
3
Large language models' responses to liver cancer surveillance, diagnosis, and management questions: accuracy, reliability, readability.
临床营养中的大语言模型:其应用、能力、局限性及潜在未来前景概述
Front Nutr. 2025 Aug 7;12:1635682. doi: 10.3389/fnut.2025.1635682. eCollection 2025.
4
Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer.评估DeepSeek、Gemini、ChatGPT-4o和Perplexity对涎腺癌的回答。
BMC Oral Health. 2025 Aug 23;25(1):1358. doi: 10.1186/s12903-025-06726-4.
5
Large Language Models in Healthcare and Medical Applications: A Review.医疗保健和医学应用中的大语言模型:综述
Bioengineering (Basel). 2025 Jun 10;12(6):631. doi: 10.3390/bioengineering12060631.
6
Evaluation of large language models in generating pulmonary nodule follow-up recommendations.评估大语言模型在生成肺结节随访建议方面的能力。
Eur J Radiol Open. 2025 Apr 30;14:100655. doi: 10.1016/j.ejro.2025.100655. eCollection 2025 Jun.
大语言模型对肝癌监测、诊断和管理问题的反应:准确性、可靠性、可读性。
Abdom Radiol (NY). 2024 Dec;49(12):4286-4294. doi: 10.1007/s00261-024-04501-7. Epub 2024 Aug 1.
4
Assessing novelty, feasibility and value of creative ideas with an unsupervised approach using GPT-4.使用GPT-4的无监督方法评估创意的新颖性、可行性和价值。
Br J Psychol. 2024 Jul 22. doi: 10.1111/bjop.12720.
5
Integrated image-based deep learning and language models for primary diabetes care.基于图像的深度学习和语言模型在初级糖尿病护理中的应用。
Nat Med. 2024 Oct;30(10):2886-2896. doi: 10.1038/s41591-024-03139-8. Epub 2024 Jul 19.
6
Evaluating the strengths and limitations of multimodal ChatGPT-4 in detecting glaucoma using fundus images.评估多模态ChatGPT-4在使用眼底图像检测青光眼方面的优势和局限性。
Front Ophthalmol (Lausanne). 2024 Jun 7;4:1387190. doi: 10.3389/fopht.2024.1387190. eCollection 2024.
7
Benchmarking PathCLIP for Pathology Image Analysis.用于病理学图像分析的PathCLIP基准测试
J Imaging Inform Med. 2025 Feb;38(1):422-438. doi: 10.1007/s10278-024-01128-4. Epub 2024 Jul 9.
8
Evaluation of ChatGPT's Potential in Tailoring Gynecological Cancer Therapies.评估 ChatGPT 在定制妇科癌症疗法方面的潜力。
In Vivo. 2024 Jul-Aug;38(4):1649-1659. doi: 10.21873/invivo.13614.
9
Decoding the Clavien-Dindo Classification: Artificial Intelligence (AI) as a Novel Tool to Grade Postoperative Complications.解读Clavien-Dindo分类法:人工智能作为一种评估术后并发症的新型工具
Ann Surg. 2024 Jun 17. doi: 10.1097/SLA.0000000000006399.
10
Evaluating ChatGPT-4V in chest CT diagnostics: a critical image interpretation assessment.评估 ChatGPT-4V 在胸部 CT 诊断中的应用:一项关键的图像解读评估。
Jpn J Radiol. 2024 Oct;42(10):1168-1177. doi: 10.1007/s11604-024-01606-3. Epub 2024 Jun 13.