• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用具有提示工程的大语言模型构建心力衰竭知识图谱。

Knowledge graph construction for heart failure using large language models with prompt engineering.

作者信息

Xu Tianhan, Gu Yixun, Xue Mantian, Gu Renjie, Li Bin, Gu Xiang

机构信息

School of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China.

School of Information Engineering, Yangzhou Polytechnic Institute, Yangzhou, Jiangsu, China.

出版信息

Front Comput Neurosci. 2024 Jul 2;18:1389475. doi: 10.3389/fncom.2024.1389475. eCollection 2024.

DOI:10.3389/fncom.2024.1389475
PMID:39015745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11250484/
Abstract

INTRODUCTION

Constructing an accurate and comprehensive knowledge graph of specific diseases is critical for practical clinical disease diagnosis and treatment, reasoning and decision support, rehabilitation, and health management. For knowledge graph construction tasks (such as named entity recognition, relation extraction), classical BERT-based methods require a large amount of training data to ensure model performance. However, real-world medical annotation data, especially disease-specific annotation samples, are very limited. In addition, existing models do not perform well in recognizing out-of-distribution entities and relations that are not seen in the training phase.

METHOD

In this study, we present a novel and practical pipeline for constructing a heart failure knowledge graph using large language models and medical expert refinement. We apply prompt engineering to the three phases of schema design: schema design, information extraction, and knowledge completion. The best performance is achieved by designing task-specific prompt templates combined with the TwoStepChat approach.

RESULTS

Experiments on two datasets show that the TwoStepChat method outperforms the Vanillia prompt and outperforms the fine-tuned BERT-based baselines. Moreover, our method saves 65% of the time compared to manual annotation and is better suited to extract the out-of-distribution information in the real world.

摘要

引言

构建特定疾病准确且全面的知识图谱对于临床疾病的实际诊断与治疗、推理与决策支持、康复以及健康管理至关重要。对于知识图谱构建任务(如命名实体识别、关系抽取),基于经典BERT的方法需要大量训练数据来确保模型性能。然而,现实世界中的医学标注数据,尤其是特定疾病的标注样本非常有限。此外,现有模型在识别训练阶段未出现的分布外实体和关系方面表现不佳。

方法

在本研究中,我们提出了一种新颖且实用的流程,利用大语言模型和医学专家优化来构建心力衰竭知识图谱。我们将提示工程应用于模式设计、信息提取和知识完善这三个阶段。通过结合TwoStepChat方法设计特定任务的提示模板,可实现最佳性能。

结果

在两个数据集上的实验表明,TwoStepChat方法优于香草提示,并且优于基于微调BERT的基线方法。此外,与人工标注相比,我们的方法节省了65%的时间,并且更适合提取现实世界中的分布外信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/4789ae64fdc4/fncom-18-1389475-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/3cb8a40e0689/fncom-18-1389475-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/e948e73cf4ba/fncom-18-1389475-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/ec8ffc453d64/fncom-18-1389475-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/42c3dc94d911/fncom-18-1389475-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/db30222c0e62/fncom-18-1389475-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/acbdd4b69969/fncom-18-1389475-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/ffd0ad7ed7f2/fncom-18-1389475-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/a34c1844cda6/fncom-18-1389475-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/873439431186/fncom-18-1389475-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/4789ae64fdc4/fncom-18-1389475-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/3cb8a40e0689/fncom-18-1389475-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/e948e73cf4ba/fncom-18-1389475-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/ec8ffc453d64/fncom-18-1389475-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/42c3dc94d911/fncom-18-1389475-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/db30222c0e62/fncom-18-1389475-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/acbdd4b69969/fncom-18-1389475-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/ffd0ad7ed7f2/fncom-18-1389475-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/a34c1844cda6/fncom-18-1389475-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/873439431186/fncom-18-1389475-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab6f/11250484/4789ae64fdc4/fncom-18-1389475-g0010.jpg

相似文献

1
Knowledge graph construction for heart failure using large language models with prompt engineering.使用具有提示工程的大语言模型构建心力衰竭知识图谱。
Front Comput Neurosci. 2024 Jul 2;18:1389475. doi: 10.3389/fncom.2024.1389475. eCollection 2024.
2
Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction.从中国电子病历中自动提取知识并构建类风湿性关节炎知识图谱。
Quant Imaging Med Surg. 2023 Jun 1;13(6):3873-3890. doi: 10.21037/qims-22-1158. Epub 2023 May 8.
3
A BERT-Span model for Chinese named entity recognition in rehabilitation medicine.一种用于康复医学中文命名实体识别的BERT跨度模型。
PeerJ Comput Sci. 2023 Aug 21;9:e1535. doi: 10.7717/peerj-cs.1535. eCollection 2023.
4
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion.基于路径的知识推理与文本语义信息融合的医疗知识图谱补全方法
BMC Med Inform Decis Mak. 2021 Nov 29;21(Suppl 9):335. doi: 10.1186/s12911-021-01622-7.
5
ShallowBKGC: a BERT-enhanced shallow neural network model for knowledge graph completion.ShallowBKGC:一种用于知识图谱补全的BERT增强型浅层神经网络模型。
PeerJ Comput Sci. 2024 May 15;10:e2058. doi: 10.7717/peerj-cs.2058. eCollection 2024.
6
Drug knowledge discovery via multi-task learning and pre-trained models.通过多任务学习和预训练模型进行药物知识发现。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):251. doi: 10.1186/s12911-021-01614-7.
7
Knowledge Graph Completion for the Chinese Text of Cultural Relics Based on Bidirectional Encoder Representations from Transformers with Entity-Type Information.基于带有实体类型信息的变换器双向编码器表征的文物中文文本知识图谱补全
Entropy (Basel). 2020 Oct 16;22(10):1168. doi: 10.3390/e22101168.
8
Deep learning with language models improves named entity recognition for PharmaCoNER.基于语言模型的深度学习可提高 PharmaCoNER 的命名实体识别能力。
BMC Bioinformatics. 2021 Dec 17;22(Suppl 1):602. doi: 10.1186/s12859-021-04260-y.
9
Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers.基于模型的类风湿性关节炎临床笔记实体识别:使用来自变换器的双向编码器表征
Quant Imaging Med Surg. 2022 Jan;12(1):184-195. doi: 10.21037/qims-21-90.
10
Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study.用于命名实体识别任务的大语言模型微调的样本量考量:方法学研究
JMIR AI. 2024 May 16;3:e52095. doi: 10.2196/52095.

引用本文的文献

1
Knowledge graph construction for intelligent cockpits based on large language models.基于大语言模型的智能座舱知识图谱构建
Sci Rep. 2025 Mar 4;15(1):7635. doi: 10.1038/s41598-025-92002-y.

本文引用的文献

1
The shaky foundations of large language models and foundation models for electronic health records.用于电子健康记录的大语言模型和基础模型的不稳定基础。
NPJ Digit Med. 2023 Jul 29;6(1):135. doi: 10.1038/s41746-023-00879-8.
2
Knowledge graph aids comprehensive explanation of drug and chemical toxicity.知识图谱辅助药物和化学毒性的全面阐释。
CPT Pharmacometrics Syst Pharmacol. 2023 Aug;12(8):1072-1079. doi: 10.1002/psp4.12975. Epub 2023 Jul 20.
3
Large language models in medicine.医学中的大型语言模型。
Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.
4
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
5
Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities.医疗保健知识图谱构建:对最新技术水平、开放问题及机遇的系统综述。
J Big Data. 2023;10(1):81. doi: 10.1186/s40537-023-00774-9. Epub 2023 May 28.
6
Building a knowledge graph to enable precision medicine.构建知识图谱以实现精准医学。
Sci Data. 2023 Feb 2;10(1):67. doi: 10.1038/s41597-023-01960-3.
7
A comprehensive review on knowledge graphs for complex diseases.关于复杂疾病知识图谱的全面综述。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac543.
8
Application of Medical Knowledge Graphs in Cardiology and Cardiovascular Medicine: A Brief Literature Review.医学知识图谱在心脏病学和心血管医学中的应用:文献综述简述。
Adv Ther. 2022 Sep;39(9):4052-4060. doi: 10.1007/s12325-022-02254-7. Epub 2022 Jul 30.
9
BioRED: a rich biomedical relation extraction dataset.BioRED:一个丰富的生物医学关系抽取数据集。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac282.
10
Hybridized sine cosine algorithm with convolutional neural networks dropout regularization application.混合正弦余弦算法与卷积神经网络辍学正则化应用。
Sci Rep. 2022 Apr 15;12(1):6302. doi: 10.1038/s41598-022-09744-2.