通过将检索增强生成算法与大语言模型相结合来映射药物术语

Mapping Drug Terms via Integration of a Retrieval-Augmented Generation Algorithm with a Large Language Model.

作者信息

Kimura Eizen, Kawakami Yukinobu, Inoue Shingo, Okajima Ai

机构信息

Department of Medical Informatics, Medical School of Ehime University, Toon, Ehime, Japan.

Yuimedi Inc., Tokyo, Japan.

出版信息

Healthc Inform Res. 2024 Oct;30(4):355-363. doi: 10.4258/hir.2024.30.4.355. Epub 2024 Oct 31.

DOI:10.4258/hir.2024.30.4.355

PMID:39551922

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11570653/

Abstract

OBJECTIVES

This study evaluated the efficacy of integrating a retrieval-augmented generation (RAG) model and a large language model (LLM) to improve the accuracy of drug name mapping across international vocabularies.

METHODS

Drug ingredient names were translated into English using the Japanese Accepted Names for Pharmaceuticals. Drug concepts were extracted from the standard vocabulary of OHDSI, and the accuracy of mappings between translated terms and RxNorm was assessed by vector similarity, using the BioBERT-generated embedded vectors as the baseline. Subsequently, we developed LLMs with RAG that distinguished the final candidates from the baseline. We assessed the efficacy of the LLM with RAG in candidate selection by comparing it with conventional methods based on vector similarity.

RESULTS

The evaluation metrics demonstrated the superior performance of the combined LLM + RAG over traditional vector similarity methods. Notably, the hit rates of the Mixtral 8x7b and GPT-3.5 models exceeded 90%, significantly outperforming the baseline rate of 64% across stratified groups of PO drugs, injections, and all interventions. Furthermore, the r-precision metric, which measures the alignment between model judgment and human evaluation, revealed a notable improvement in LLM performance, ranging from 41% to 50% compared to the baseline of 23%.

CONCLUSIONS

Integrating an RAG and an LLM outperformed conventional string comparison and embedding vector similarity techniques, offering a more refined approach to global drug information mapping.

摘要

目的

本研究评估了整合检索增强生成（RAG）模型和大语言模型（LLM）以提高跨国际词汇表的药品名称映射准确性的效果。

方法

使用日本药品通用名称将药品成分名称翻译成英文。从观察性医疗结果合作组织（OHDSI）的标准词汇表中提取药品概念，并以BioBERT生成的嵌入向量为基线，通过向量相似度评估翻译后的术语与RxNorm之间映射的准确性。随后，我们开发了带有RAG的大语言模型，该模型能从基线中区分出最终候选药物。通过将其与基于向量相似度的传统方法进行比较，我们评估了带有RAG的大语言模型在候选药物选择方面的效果。

结果

评估指标表明，大语言模型+RAG组合的性能优于传统的向量相似度方法。值得注意的是，Mixtral 8x7b和GPT-3.5模型的命中率超过90%，在口服药物、注射剂和所有干预措施的分层组中显著优于64%的基线率。此外，衡量模型判断与人工评估一致性的r精度指标显示，大语言模型的性能有显著提升，与23%的基线相比，提升幅度在41%至50%之间。

结论

整合RAG和大语言模型的表现优于传统的字符串比较和嵌入向量相似度技术，为全球药品信息映射提供了一种更精细的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c592/11570653/80a198bd39df/hir-2024-30-4-355f1.jpg

相似文献

Mapping Drug Terms via Integration of a Retrieval-Augmented Generation Algorithm with a Large Language Model.通过将检索增强生成算法与大语言模型相结合来映射药物术语

Healthc Inform Res. 2024 Oct;30(4):355-363. doi: 10.4258/hir.2024.30.4.355. Epub 2024 Oct 31.

Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders.与人工编码员相比，评估检索增强型大语言模型在急诊科ICD-10-CM编码中的性能。

medRxiv. 2024 Oct 17:2024.10.15.24315526. doi: 10.1101/2024.10.15.24315526.

Custom Large Language Models Improve Accuracy: Comparing Retrieval Augmented Generation and Artificial Intelligence Agents to Noncustom Models for Evidence-Based Medicine.定制大语言模型提高准确性：将检索增强生成和人工智能代理与非定制模型在循证医学方面进行比较

Arthroscopy. 2025 Mar;41(3):565-573.e6. doi: 10.1016/j.arthro.2024.10.042. Epub 2024 Nov 7.

Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model.使用检索增强语言模型提高GPT-3/4在生物医学数据上的结果准确性。

PLOS Digit Health. 2024 Aug 21;3(8):e0000568. doi: 10.1371/journal.pdig.0000568. eCollection 2024 Aug.

Advancing health coaching: A comparative study of large language model and health coaches.推进健康辅导：大型语言模型与健康辅导员的比较研究。

Artif Intell Med. 2024 Nov;157:103004. doi: 10.1016/j.artmed.2024.103004. Epub 2024 Oct 19.

Evaluating the accuracy of a state-of-the-art large language model for prediction of admissions from the emergency room.评估最先进的大型语言模型在预测急诊入院方面的准确性。

J Am Med Inform Assoc. 2024 Sep 1;31(9):1921-1928. doi: 10.1093/jamia/ocae103.

Development and Evaluation of a Retrieval-Augmented Large Language Model Framework for Ophthalmology.开发和评估眼科检索增强型大型语言模型框架。

JAMA Ophthalmol. 2024 Sep 1;142(9):798-805. doi: 10.1001/jamaophthalmol.2024.2513.

Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study.分诊表现比较：大型语言模型、ChatGPT 和未经训练的急诊医生：一项对比研究。

J Med Internet Res. 2024 Jun 14;26:e53297. doi: 10.2196/53297.

Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较：评估研究。

J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.

Emergency Patient Triage Improvement through a Retrieval-Augmented Generation Enhanced Large-Scale Language Model.通过检索增强生成改进的大规模语言模型实现急诊患者分诊优化

Prehosp Emerg Care. 2025;29(3):203-209. doi: 10.1080/10903127.2024.2374400. Epub 2024 Jul 11.

引用本文的文献

A dataset for mapping the Japanese drugs to RxNorm standard concepts.一个用于将日本药品映射到RxNorm标准概念的数据集。

Data Brief. 2025 Feb 21;59:111418. doi: 10.1016/j.dib.2025.111418. eCollection 2025 Apr.

本文引用的文献

OHDSI Standardized Vocabularies-a large-scale centralized reference ontology for international data harmonization.OHDSI 标准化词汇表-用于国际数据协调的大规模集中参考本体。

J Am Med Inform Assoc. 2024 Feb 16;31(3):583-590. doi: 10.1093/jamia/ocad247.

Assessing the Use of German Claims Data Vocabularies for Research in the Observational Medical Outcomes Partnership Common Data Model: Development and Evaluation Study.评估德国索赔数据词汇表在观察性医疗结局合作组织通用数据模型研究中的应用：开发与评估研究

JMIR Med Inform. 2023 Nov 7;11:e47959. doi: 10.2196/47959.

Determining and assessing characteristics of data element names impacting the performance of annotation using Usagi.使用 Usagi 确定和评估影响注释性能的数据元素名称特征。

Int J Med Inform. 2023 Oct;178:105200. doi: 10.1016/j.ijmedinf.2023.105200. Epub 2023 Aug 29.

Automatic SNOMED CT coding of Chinese clinical terms via attention-based semantic matching.通过基于注意力的语义匹配对中文临床术语进行自动SNOMED CT编码。

Int J Med Inform. 2022 Mar;159:104676. doi: 10.1016/j.ijmedinf.2021.104676. Epub 2021 Dec 28.

BERT-based Ranking for Biomedical Entity Normalization.基于BERT的生物医学实体规范化排序

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:269-277. eCollection 2020.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

Toward a normalized clinical drug knowledge base in China-applying the RxNorm model to Chinese clinical drugs.迈向中国规范化临床药物知识库——应用 RxNorm 模型于中国临床药物。

J Am Med Inform Assoc. 2018 Jul 1;25(7):809-818. doi: 10.1093/jamia/ocy020.

Towards Implementation of OMOP in a German University Hospital Consortium.迈向在德国大学医院联盟中实施观察医疗结果合作组织（OMOP）。

Appl Clin Inform. 2018 Jan;9(1):54-61. doi: 10.1055/s-0037-1617452. Epub 2018 Jan 24.

Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.观察性健康数据科学与信息学（OHDSI）：观察性研究人员的机遇。

Stud Health Technol Inform. 2015;216:574-8.

Normalized names for clinical drugs: RxNorm at 6 years.临床药物的规范化名称：RxNorm 六年发展

J Am Med Inform Assoc. 2011 Jul-Aug;18(4):441-8. doi: 10.1136/amiajnl-2011-000116. Epub 2011 Apr 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过将检索增强生成算法与大语言模型相结合来映射药物术语

Mapping Drug Terms via Integration of a Retrieval-Augmented Generation Algorithm with a Large Language Model.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献