• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物知识图谱绘制工具:对从生物医学文献中自动构建知识图谱的初步评估。

BioKGrapher: Initial evaluation of automated knowledge graph construction from biomedical literature.

作者信息

Schäfer Henning, Idrissi-Yaghir Ahmad, Arzideh Kamyar, Damm Hendrik, Pakull Tabea M G, Schmidt Cynthia S, Bahn Mikel, Lodde Georg, Livingstone Elisabeth, Schadendorf Dirk, Nensa Felix, Horn Peter A, Friedrich Christoph M

机构信息

Institute for Transfusion Medicine, University Hospital Essen, Hufelandstraße 55, Essen, 45147, Germany.

Department of Computer Science, University of Applied Sciences and Arts Dortmund (FHDO), Emil-Figge Str. 42, Dortmund, 44227, Germany.

出版信息

Comput Struct Biotechnol J. 2024 Oct 17;24:639-660. doi: 10.1016/j.csbj.2024.10.017. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.10.017
PMID:39502384
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11536026/
Abstract

The growth of biomedical literature presents challenges in extracting and structuring knowledge. Knowledge Graphs (KGs) offer a solution by representing relationships between biomedical entities. However, manual construction of KGs is labor-intensive and time-consuming, highlighting the need for automated methods. This work introduces BioKGrapher, a tool for automatic KG construction using large-scale publication data, with a focus on biomedical concepts related to specific medical conditions. BioKGrapher allows researchers to construct KGs from PubMed IDs. The BioKGrapher pipeline begins with Named Entity Recognition and Linking (NER+NEL) to extract and normalize biomedical concepts from PubMed, mapping them to the Unified Medical Language System (UMLS). Extracted concepts are weighted and re-ranked using Kullback-Leibler divergence and local frequency balancing. These concepts are then integrated into hierarchical KGs, with relationships formed using terminologies like SNOMED CT and NCIt. Downstream applications include multi-label document classification using Adapter-infused Transformer models. BioKGrapher effectively aligns generated concepts with clinical practice guidelines from the German Guideline Program in Oncology (GGPO), achieving -Scores of up to 0.6. In multi-label classification, Adapter-infused models using a BioKGrapher cancer-specific KG improved micro -Scores by up to 0.89 percentage points over a non-specific KG and 2.16 points over base models across three BERT variants. The drug-disease extraction case study identified indications for Nivolumab and Rituximab. BioKGrapher is a tool for automatic KG construction, aligning with the GGPO and enhancing downstream task performance. It offers a scalable solution for managing biomedical knowledge, with potential applications in literature recommendation, decision support, and drug repurposing.

摘要

生物医学文献的增长给知识提取和结构化带来了挑战。知识图谱(KGs)通过表示生物医学实体之间的关系提供了一种解决方案。然而,手动构建知识图谱既费力又耗时,这凸显了自动化方法的必要性。这项工作介绍了BioKGrapher,这是一种利用大规模出版数据自动构建知识图谱的工具,重点关注与特定医疗状况相关的生物医学概念。BioKGrapher允许研究人员从PubMed ID构建知识图谱。BioKGrapher流程始于命名实体识别与链接(NER+NEL),从PubMed中提取并规范化生物医学概念,将它们映射到统一医学语言系统(UMLS)。使用Kullback-Leibler散度和局部频率平衡对提取的概念进行加权和重新排序。然后将这些概念整合到层次化知识图谱中,使用SNOMED CT和NCIt等术语形成关系。下游应用包括使用注入适配器的Transformer模型进行多标签文档分类。BioKGrapher有效地将生成的概念与德国肿瘤学指南计划(GGPO)的临床实践指南对齐,实现了高达0.6的-Scores。在多标签分类中,使用BioKGrapher癌症特异性知识图谱的注入适配器模型在三个BERT变体上,相比于非特异性知识图谱,微-Scores提高了高达0.89个百分点,相比于基础模型提高了2.16个百分点。药物-疾病提取案例研究确定了纳武单抗和利妥昔单抗的适应症。BioKGrapher是一种自动构建知识图谱的工具,与GGPO对齐并提高了下游任务性能。它为管理生物医学知识提供了一种可扩展的解决方案,在文献推荐、决策支持和药物重新利用方面具有潜在应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/59677fc2795e/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/09394f26d3de/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/2d556c55908c/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/875a71da31f9/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/bc219f683a50/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/e14186b3dd00/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/bf4a80abf4b0/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/9f89891c65a0/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/59677fc2795e/gr008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/09394f26d3de/gr001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/2d556c55908c/gr002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/875a71da31f9/gr003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/bc219f683a50/gr004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/e14186b3dd00/gr005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/bf4a80abf4b0/gr006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/9f89891c65a0/gr007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58ae/11536026/59677fc2795e/gr008.jpg

相似文献

1
BioKGrapher: Initial evaluation of automated knowledge graph construction from biomedical literature.生物知识图谱绘制工具:对从生物医学文献中自动构建知识图谱的初步评估。
Comput Struct Biotechnol J. 2024 Oct 17;24:639-660. doi: 10.1016/j.csbj.2024.10.017. eCollection 2024 Dec.
2
KGen: a knowledge graph generator from biomedical scientific literature.KGen:一种从生物医学科学文献中生成知识图谱的工具。
BMC Med Inform Decis Mak. 2020 Dec 14;20(Suppl 4):314. doi: 10.1186/s12911-020-01341-5.
3
Deciphering the Diversity of Mental Models in Neurodevelopmental Disorders: Knowledge Graph Representation of Public Data Using Natural Language Processing.解读神经发育障碍中心理模型的多样性:使用自然语言处理对公共数据进行知识图谱表示。
J Med Internet Res. 2022 Aug 5;24(8):e39888. doi: 10.2196/39888.
4
Automatic knowledge extraction from Chinese electronic medical records and rheumatoid arthritis knowledge graph construction.从中国电子病历中自动提取知识并构建类风湿性关节炎知识图谱。
Quant Imaging Med Surg. 2023 Jun 1;13(6):3873-3890. doi: 10.21037/qims-22-1158. Epub 2023 May 8.
5
Alzheimer's Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction.阿尔茨海默病知识图谱增强知识发现与疾病预测。
bioRxiv. 2024 Jul 5:2024.07.03.601339. doi: 10.1101/2024.07.03.601339.
6
BioBLP: a modular framework for learning on multimodal biomedical knowledge graphs.BioBLP:一种用于多模态生物医学知识图谱学习的模块化框架。
J Biomed Semantics. 2023 Dec 8;14(1):20. doi: 10.1186/s13326-023-00301-y.
7
Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.评估医疗保健中的实体识别:实体模型定量研究。
JMIR Med Inform. 2024 Oct 17;12:e59782. doi: 10.2196/59782.
8
Drug knowledge discovery via multi-task learning and pre-trained models.通过多任务学习和预训练模型进行药物知识发现。
BMC Med Inform Decis Mak. 2021 Nov 16;21(Suppl 9):251. doi: 10.1186/s12911-021-01614-7.
9
MPTN: A message-passing transformer network for drug repurposing from knowledge graph.MPTN:一种基于知识图的药物重定位消息传递转换器网络。
Comput Biol Med. 2024 Jan;168:107800. doi: 10.1016/j.compbiomed.2023.107800. Epub 2023 Dec 1.
10
Biolink Model: A universal schema for knowledge graphs in clinical, biomedical, and translational science.生物链接模型:临床、生物医学和转化科学中知识图谱的通用模式。
Clin Transl Sci. 2022 Aug;15(8):1848-1855. doi: 10.1111/cts.13302. Epub 2022 Jun 6.

引用本文的文献

1
BASIL DB: bioactive semantic integration and linking database.BASIL数据库:生物活性语义整合与链接数据库。
J Biomed Semantics. 2025 Aug 13;16(1):14. doi: 10.1186/s13326-025-00336-3.

本文引用的文献

1
Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities.医疗保健知识图谱构建:对最新技术水平、开放问题及机遇的系统综述。
J Big Data. 2023;10(1):81. doi: 10.1186/s40537-023-00774-9. Epub 2023 May 28.
2
Building a knowledge graph to enable precision medicine.构建知识图谱以实现精准医学。
Sci Data. 2023 Feb 2;10(1):67. doi: 10.1038/s41597-023-01960-3.
3
KG-Predict: A knowledge graph computational framework for drug repurposing.KG-Predict:一种用于药物重定位的知识图谱计算框架。
J Biomed Inform. 2022 Aug;132:104133. doi: 10.1016/j.jbi.2022.104133. Epub 2022 Jul 12.
4
Nivolumab versus sorafenib in advanced hepatocellular carcinoma (CheckMate 459): a randomised, multicentre, open-label, phase 3 trial.纳武利尤单抗对比索拉非尼用于治疗晚期肝细胞癌(CheckMate 459):一项随机、多中心、开放标签、III 期临床试验。
Lancet Oncol. 2022 Jan;23(1):77-90. doi: 10.1016/S1470-2045(21)00604-5. Epub 2021 Dec 13.
5
Multi-domain clinical natural language processing with MedCAT: The Medical Concept Annotation Toolkit.多领域临床自然语言处理与 MedCAT:医学概念标注工具包。
Artif Intell Med. 2021 Jul;117:102083. doi: 10.1016/j.artmed.2021.102083. Epub 2021 May 1.
6
First-line nivolumab plus chemotherapy versus chemotherapy alone for advanced gastric, gastro-oesophageal junction, and oesophageal adenocarcinoma (CheckMate 649): a randomised, open-label, phase 3 trial.一线纳武利尤单抗联合化疗与单纯化疗治疗晚期胃癌、胃食管交界癌和食管腺癌(CheckMate 649):一项随机、开放标签的3期试验。
Lancet. 2021 Jul 3;398(10294):27-40. doi: 10.1016/S0140-6736(21)00797-2. Epub 2021 Jun 5.
7
Novel adjuvant options for cutaneous melanoma.皮肤黑色素瘤的新型辅助治疗选择。
Ann Oncol. 2021 Jul;32(7):854-865. doi: 10.1016/j.annonc.2021.03.198. Epub 2021 Mar 24.
8
KGen: a knowledge graph generator from biomedical scientific literature.KGen:一种从生物医学科学文献中生成知识图谱的工具。
BMC Med Inform Decis Mak. 2020 Dec 14;20(Suppl 4):314. doi: 10.1186/s12911-020-01341-5.
9
Modelling kidney disease using ontology: insights from the Kidney Precision Medicine Project.利用本体模型化肾脏疾病:来自肾脏精准医学计划的见解。
Nat Rev Nephrol. 2020 Nov;16(11):686-696. doi: 10.1038/s41581-020-00335-w. Epub 2020 Sep 16.
10
Safety and Efficacy of Nivolumab in Patients With Advanced Clear Cell Renal Cell Carcinoma: Results From the Phase IIIb/IV CheckMate 374 Study.纳武利尤单抗治疗晚期透明细胞肾细胞癌患者的安全性和疗效:来自 IIIb/IV 期 CheckMate 374 研究的结果。
Clin Genitourin Cancer. 2020 Dec;18(6):469-476.e4. doi: 10.1016/j.clgc.2020.06.002. Epub 2020 Jun 14.