HiPrompt：通过面向层次结构的提示实现少样本生物医学知识融合。

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.

作者信息

Lu Jiaying, Shen Jiaming, Xiong Bo, Ma Wenjing, Staab Steffen, Yang Carl

机构信息

Emory University, USA.

Google Research, USA.

出版信息

Int ACM SIGIR Conf Res Dev Inf Retr. 2023 Jul;2023:2052-2056. doi: 10.1145/3539618.3591997. Epub 2023 Jul 18.

DOI:10.1145/3539618.3591997

PMID:38352127

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10863609/

Abstract

Medical decision-making processes can be enhanced by comprehensive biomedical knowledge bases, which require fusing knowledge graphs constructed from different sources via a uniform index system. The index system often organizes biomedical terms in a hierarchy to provide the aligned entities with fine-grained granularity. To address the challenge of scarce supervision in the biomedical knowledge fusion (BKF) task, researchers have proposed various unsupervised methods. However, these methods heavily rely on ad-hoc lexical and structural matching algorithms, which fail to capture the rich semantics conveyed by biomedical entities and terms. Recently, neural embedding models have proved effective in semantic-rich tasks, but they rely on sufficient labeled data to be adequately trained. To bridge the gap between the scarce-labeled BKF and neural embedding models, we propose HiPrompt, a supervision-efficient knowledge fusion framework that elicits the few-shot reasoning ability of large language models through hierarchy-oriented prompts. Empirical results on the collected KG-Hi-BKF benchmark datasets demonstrate the effectiveness of HiPrompt.

摘要

全面的生物医学知识库可以增强医学决策过程，这需要通过统一的索引系统融合从不同来源构建的知识图谱。索引系统通常会以层次结构组织生物医学术语，以便为对齐的实体提供细粒度的粒度。为了应对生物医学知识融合（BKF）任务中监督稀缺的挑战，研究人员提出了各种无监督方法。然而，这些方法严重依赖临时的词汇和结构匹配算法，无法捕捉生物医学实体和术语所传达的丰富语义。最近，神经嵌入模型已被证明在语义丰富的任务中有效，但它们依赖于足够的标记数据才能得到充分训练。为了弥合标记稀缺的BKF与神经嵌入模型之间的差距，我们提出了HiPrompt，这是一个监督高效的知识融合框架，通过面向层次结构的提示来激发大语言模型的少样本推理能力。在收集的KG-Hi-BKF基准数据集上的实证结果证明了HiPrompt的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d37c/10863609/346305cf44ce/nihms-1964215-f0001.jpg

相似文献

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.HiPrompt：通过面向层次结构的提示实现少样本生物医学知识融合。

Int ACM SIGIR Conf Res Dev Inf Retr. 2023 Jul;2023:2052-2056. doi: 10.1145/3539618.3591997. Epub 2023 Jul 18.

An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估：算法开发与验证研究

JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.

Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.利用生物医学知识图谱中的语义模式预测治疗和因果关系。

J Biomed Inform. 2018 Jun;82:189-199. doi: 10.1016/j.jbi.2018.05.003. Epub 2018 May 12.

Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.利用符号知识库和模式理论进行常识自然语言推理。

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13185-13202. doi: 10.1109/TPAMI.2023.3287837. Epub 2023 Oct 3.

Neural sentence embedding models for semantic similarity estimation in the biomedical domain.生物医学领域中语义相似度估计的神经句子嵌入模型。

BMC Bioinformatics. 2019 Apr 11;20(1):178. doi: 10.1186/s12859-019-2789-2.

Enriching contextualized language model from knowledge graph for biomedical information extraction.从知识图谱中丰富上下文相关的语言模型以进行生物医学信息抽取。

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa110.

From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.从零到英雄：利用变压器在零样本和少样本上下文中进行生物医学命名实体识别。

Artif Intell Med. 2024 Oct;156:102970. doi: 10.1016/j.artmed.2024.102970. Epub 2024 Aug 24.

Learning to explain is a good biomedical few-shot learner.学会解释是一个很好的生物医学小样本学习者。

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae589.

Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning.通过知识引导实例生成和提示对比学习的少样本生物医学命名实体识别。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad496.

Prompt Tuning in Biomedical Relation Extraction.生物医学关系抽取中的提示调优

J Healthc Inform Res. 2024 Feb 29;8(2):206-224. doi: 10.1007/s41666-024-00162-9. eCollection 2024 Jun.

引用本文的文献

PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking.PromptLink：利用大语言模型进行跨源生物医学概念链接。

Int ACM SIGIR Conf Res Dev Inf Retr. 2024 Jul;2024:2589-2593. doi: 10.1145/3626772.3657904. Epub 2024 Jul 11.

本文引用的文献

Weakly Supervised Concept Map Generation through Task-Guided Graph Translation.通过任务引导的图翻译进行弱监督概念图生成

IEEE Trans Knowl Data Eng. 2023 Oct;35(10):10871-10883. doi: 10.1109/tkde.2023.3252588. Epub 2023 Mar 6.

Large language models encode clinical knowledge.大语言模型编码临床知识。

Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.

Biomedical discovery through the integrative biomedical knowledge hub (iBKH).通过综合生物医学知识中心（iBKH）进行生物医学发现。

iScience. 2023 Mar 21;26(4):106460. doi: 10.1016/j.isci.2023.106460. eCollection 2023 Apr 21.

Cellcano: supervised cell type identification for single cell ATAC-seq data.Cellcano：单细胞 ATAC-seq 数据的有监督细胞类型识别。

Nat Commun. 2023 Apr 3;14(1):1864. doi: 10.1038/s41467-023-37439-3.

Building a knowledge graph to enable precision medicine.构建知识图谱以实现精准医学。

Sci Data. 2023 Feb 2;10(1):67. doi: 10.1038/s41597-023-01960-3.

Cell Taxonomy: a curated repository of cell types with multifaceted characterization.细胞分类学：一个经过精心整理的细胞类型存储库，具有多方面的特征描述。

Nucleic Acids Res. 2023 Jan 6;51(D1):D853-D860. doi: 10.1093/nar/gkac816.

Multimodal reasoning based on knowledge graph embedding for specific diseases.基于知识图嵌入的特定疾病的多模态推理。

Bioinformatics. 2022 Apr 12;38(8):2235-2245. doi: 10.1093/bioinformatics/btac085.

A knowledge graph to interpret clinical proteomics data.一个解释临床蛋白质组学数据的知识图谱。

Nat Biotechnol. 2022 May;40(5):692-702. doi: 10.1038/s41587-021-01145-6. Epub 2022 Jan 31.

The Human Disease Ontology 2022 update.人类疾病本体 2022 更新版。

Nucleic Acids Res. 2022 Jan 7;50(D1):D1255-D1261. doi: 10.1093/nar/gkab1063.

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications.基于云的生物医学出版物中用户定义短语类别关联的短语挖掘与分析

J Vis Exp. 2019 Feb 23(144). doi: 10.3791/59108.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

HiPrompt：通过面向层次结构的提示实现少样本生物医学知识融合。

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献