• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自大语言模型的词嵌入在医学诊断中的效用。

Utility of word embeddings from large language models in medical diagnosis.

作者信息

Yazdani Shahram, Henry Ronald Claude, Byrne Avery, Henry Isaac Claude

机构信息

Department of Pediatrics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, United States.

Department of Civil Engineering, University of Southern California, Los Angeles, CA 90089, United States.

出版信息

J Am Med Inform Assoc. 2025 Mar 1;32(3):526-534. doi: 10.1093/jamia/ocae314.

DOI:10.1093/jamia/ocae314
PMID:39786898
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11833464/
Abstract

OBJECTIVE

This study evaluates the utility of word embeddings, generated by large language models (LLMs), for medical diagnosis by comparing the semantic proximity of symptoms to their eponymic disease embedding ("eponymic condition") and the mean of all symptom embeddings associated with a disease ("ensemble mean").

MATERIALS AND METHODS

Symptom data for 5 diagnostically challenging pediatric diseases-CHARGE syndrome, Cowden disease, POEMS syndrome, Rheumatic fever, and Tuberous sclerosis-were collected from PubMed. Using the Ada-002 embedding model, disease names and symptoms were translated into vector representations in a high-dimensional space. Euclidean and Chebyshev distance metrics were used to classify symptoms based on their proximity to both the eponymic condition and the ensemble mean of the condition's symptoms.

RESULTS

The ensemble mean approach showed significantly higher classification accuracy, correctly classifying between 80% (Cowden disease) to 100% (Tuberous sclerosis) of the sample disease symptoms using the Euclidean distance metric. In contrast, the eponymic condition approach using Euclidian distance metric and Chebyshev distances, in general, showed poor symptom classification performance, with erratic results (0%-100% accuracy), largely ranging between 0% and 3% accuracy.

DISCUSSION

The ensemble mean captures a disease's collective symptom profile, providing a more nuanced representation than the disease name alone. However, some misclassifications were due to superficial semantic similarities, highlighting the need for LLM models trained on medical corpora.

CONCLUSION

The ensemble mean of symptom embeddings improves classification accuracy over the eponymic condition approach. Future efforts should focus on medical-specific training of LLMs to enhance their diagnostic accuracy and clinical utility.

摘要

目的

本研究通过比较症状与其同名疾病嵌入(“同名病症”)的语义接近度以及与疾病相关的所有症状嵌入的平均值(“总体平均值”),评估由大语言模型(LLMs)生成的词嵌入在医学诊断中的效用。

材料与方法

从PubMed收集了5种诊断具有挑战性的儿科疾病——CHARGE综合征、考登病、POEMS综合征、风湿热和结节性硬化症的症状数据。使用Ada - 002嵌入模型,将疾病名称和症状转换为高维空间中的向量表示。使用欧几里得距离和切比雪夫距离度量,根据症状与同名病症及其症状总体平均值的接近程度对症状进行分类。

结果

总体平均值方法显示出显著更高的分类准确率,使用欧几里得距离度量正确分类了样本疾病症状的80%(考登病)至100%(结节性硬化症)。相比之下,使用欧几里得距离度量和切比雪夫距离的同名病症方法总体上显示出较差的症状分类性能,结果不稳定(准确率为0% - 100%),大多在0%至3%的准确率之间。

讨论

总体平均值捕捉了疾病的集体症状特征,提供了比单独疾病名称更细致入微的表示。然而,一些错误分类是由于表面的语义相似性,这凸显了对在医学语料库上训练的大语言模型的需求。

结论

症状嵌入的总体平均值比同名病症方法提高了分类准确率。未来的工作应专注于大语言模型的医学特定训练,以提高其诊断准确性和临床效用。

相似文献

1
Utility of word embeddings from large language models in medical diagnosis.来自大语言模型的词嵌入在医学诊断中的效用。
J Am Med Inform Assoc. 2025 Mar 1;32(3):526-534. doi: 10.1093/jamia/ocae314.
2
Algorithmic Classification of Psychiatric Disorder-Related Spontaneous Communication Using Large Language Model Embeddings: Algorithm Development and Validation.使用大语言模型嵌入对精神障碍相关自发交流进行算法分类:算法开发与验证
JMIR AI. 2025 May 30;4:e67369. doi: 10.2196/67369.
3
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
4
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
7
Performance of Large Language Models in the Non-English Context: Qualitative Study of Models Trained on Different Languages in Chinese Medical Examinations.大语言模型在非英语环境中的表现:对在中国医学考试中使用不同语言训练的模型的定性研究
JMIR Med Inform. 2025 Jun 27;13:e69485. doi: 10.2196/69485.
8
Rapamycin and rapalogs for tuberous sclerosis complex.用于结节性硬化症的雷帕霉素及雷帕霉素类似物。
Cochrane Database Syst Rev. 2016 Jul 13;7(7):CD011272. doi: 10.1002/14651858.CD011272.pub2.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

本文引用的文献

1
POEMS syndrome characterized by bone lesions: A case report.POEMS 综合征以骨病变为特征:一例报告。
Medicine (Baltimore). 2023 Dec 15;102(50):e36678. doi: 10.1097/MD.0000000000036678.
2
Updated International Tuberous Sclerosis Complex Diagnostic Criteria and Surveillance and Management Recommendations.更新后的国际结节性硬化症复合体诊断标准及监测与管理建议。
Pediatr Neurol. 2021 Oct;123:50-66. doi: 10.1016/j.pediatrneurol.2021.07.011. Epub 2021 Jul 24.
3
A review of computational drug repurposing.计算性药物重新利用综述。
Transl Clin Pharmacol. 2019 Jun;27(2):59-63. doi: 10.12793/tcp.2019.27.2.59. Epub 2019 Jun 28.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
5
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
6
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
7
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
8
Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis.深度电子健康记录(EHR):深度学习技术在电子健康记录(EHR)分析中的最新进展综述。
IEEE J Biomed Health Inform. 2018 Sep;22(5):1589-1604. doi: 10.1109/JBHI.2017.2767063. Epub 2017 Oct 27.
9
Big Data and Machine Learning in Health Care.医疗保健中的大数据与机器学习
JAMA. 2018 Apr 3;319(13):1317-1318. doi: 10.1001/jama.2017.18391.
10
Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives.比较基于深度学习和概念提取的方法用于从临床叙述中进行患者表型分析。
PLoS One. 2018 Feb 15;13(2):e0192360. doi: 10.1371/journal.pone.0192360. eCollection 2018.