一种简化的检索器，用于提高大语言模型进行表型标准化的准确性。

A simplified retriever to improve accuracy of phenotype normalizations by large language models.

作者信息

Hier Daniel B, Do Thanh Son, Obafemi-Ajayi Tayo

机构信息

Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States.

Department of Computer Science, Missouri State University, Springfield, MO, United States.

出版信息

Front Digit Health. 2025 Mar 4;7:1495040. doi: 10.3389/fdgth.2025.1495040. eCollection 2025.

DOI:10.3389/fdgth.2025.1495040

PMID:40103736

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11913805/

Abstract

Large language models have shown improved accuracy in phenotype term normalization tasks when augmented with retrievers that suggest candidate normalizations based on term definitions. In this work, we introduce a simplified retriever that enhances large language model accuracy by searching the Human Phenotype Ontology (HPO) for candidate matches using contextual word embeddings from BioBERT without the need for explicit term definitions. Testing this method on terms derived from the clinical synopses of Online Mendelian Inheritance in Man (OMIM), we demonstrate that the normalization accuracy of GPT-4o increases from a baseline of 62% without augmentation to 85% with retriever augmentation. This approach is potentially generalizable to other biomedical term normalization tasks and offers an efficient alternative to more complex retrieval methods.

摘要

当使用基于术语定义建议候选标准化的检索器进行增强时，大型语言模型在表型术语标准化任务中显示出更高的准确性。在这项工作中，我们引入了一种简化的检索器，它通过使用来自BioBERT的上下文词嵌入在人类表型本体（HPO）中搜索候选匹配项来提高大型语言模型的准确性，而无需明确的术语定义。在源自《人类孟德尔遗传在线》（OMIM）临床摘要的术语上测试此方法，我们证明GPT-4o的标准化准确率从无增强时的62%基线提高到有检索器增强时的85%。这种方法有可能推广到其他生物医学术语标准化任务，并为更复杂的检索方法提供了一种有效的替代方案。