• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

知识图嵌入模型的集合提高了药物发现的预测能力。

Ensembles of knowledge graph embedding models improve predictions for drug discovery.

机构信息

Enveda Biosciences, Boulder, CO, USA.

出版信息

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac481.

DOI:10.1093/bib/bbac481
PMID:36384050
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9677479/
Abstract

Recent advances in Knowledge Graphs (KGs) and Knowledge Graph Embedding Models (KGEMs) have led to their adoption in a broad range of fields and applications. The current publishing system in machine learning requires newly introduced KGEMs to achieve state-of-the-art performance, surpassing at least one benchmark in order to be published. Despite this, dozens of novel architectures are published every year, making it challenging for users, even within the field, to deduce the most suitable configuration for a given application. A typical biomedical application of KGEMs is drug-disease prediction in the context of drug discovery, in which a KGEM is trained to predict triples linking drugs and diseases. These predictions can be later tested in clinical trials following extensive experimental validation. However, given the infeasibility of evaluating each of these predictions and that only a minimal number of candidates can be experimentally tested, models that yield higher precision on the top prioritized triples are preferred. In this paper, we apply the concept of ensemble learning on KGEMs for drug discovery to assess whether combining the predictions of several models can lead to an overall improvement in predictive performance. First, we trained and benchmarked 10 KGEMs to predict drug-disease triples on two independent biomedical KGs designed for drug discovery. Following, we applied different ensemble methods that aggregate the predictions of these models by leveraging the distribution or the position of the predicted triple scores. We then demonstrate how the ensemble models can achieve better results than the original KGEMs by benchmarking the precision (i.e., number of true positives prioritized) of their top predictions. Lastly, we released the source code presented in this work at https://github.com/enveda/kgem-ensembles-in-drug-discovery.

摘要

近年来,知识图谱(Knowledge Graphs,KGs)和知识图谱嵌入模型(Knowledge Graph Embedding Models,KGEMs)的发展使得它们在广泛的领域和应用中得到了采用。目前机器学习的出版系统要求新引入的 KGEM 要达到最先进的性能,即至少要在一个基准上超越,才能被发表。尽管如此,每年仍有数十个新的架构被发表,这使得即使是在该领域内的用户也难以推断出给定应用程序最合适的配置。KGEM 在药物发现背景下进行药物-疾病预测是其在生物医学领域的一个典型应用,在该应用中,KGEM 被训练来预测将药物和疾病联系起来的三元组。这些预测可以在经过广泛的实验验证后,在临床试验中进行测试。然而,由于评估这些预测中的每一个都是不可行的,并且只有少数候选药物可以进行实验测试,因此更倾向于使用那些在优先级最高的三元组上获得更高精度的模型。在本文中,我们将集成学习的概念应用于药物发现的 KGEM 中,以评估是否可以通过结合多个模型的预测来提高预测性能。首先,我们在两个专为药物发现设计的独立生物医学 KGs 上训练和基准测试了 10 个 KGEM,以预测药物-疾病三元组。然后,我们应用了不同的集成方法,通过利用预测三元组分数的分布或位置来汇总这些模型的预测。然后,我们通过基准测试其最高预测的精度(即优先的真阳性数量)来展示集成模型如何获得比原始 KGEM 更好的结果。最后,我们在 https://github.com/enveda/kgem-ensembles-in-drug-discovery 上发布了本文中提出的源代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/b5a2589565b8/bbac481f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/20051da8c066/bbac481f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/99aad438892a/bbac481f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/5d4349822da7/bbac481f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/79c95c074f1a/bbac481f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/343e5dfa231a/bbac481f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/b5a2589565b8/bbac481f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/20051da8c066/bbac481f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/99aad438892a/bbac481f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/5d4349822da7/bbac481f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/79c95c074f1a/bbac481f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/343e5dfa231a/bbac481f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9221/9677479/b5a2589565b8/bbac481f6.jpg

相似文献

1
Ensembles of knowledge graph embedding models improve predictions for drug discovery.知识图嵌入模型的集合提高了药物发现的预测能力。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac481.
2
STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs.STonKGs:基于生物医学文本和知识图谱训练的复杂型 Transformer。
Bioinformatics. 2022 Mar 4;38(6):1648-1656. doi: 10.1093/bioinformatics/btac001.
3
Knowledge Graphs and Their Applications in Drug Discovery.知识图谱及其在药物发现中的应用。
Methods Mol Biol. 2024;2716:203-221. doi: 10.1007/978-1-0716-3449-3_9.
4
Implications of topological imbalance for representation learning on biomedical knowledge graphs.拓扑不平衡对生物医学知识图谱表示学习的影响。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac279.
5
Graph embedding-based link prediction for literature-based discovery in Alzheimer's Disease.基于图嵌入的阿尔茨海默病文献发现链路预测。
J Biomed Inform. 2023 Sep;145:104464. doi: 10.1016/j.jbi.2023.104464. Epub 2023 Aug 2.
6
Adverse Drug Event Prediction Using Noisy Literature-Derived Knowledge Graphs: Algorithm Development and Validation.使用有噪声的文献衍生知识图谱进行药物不良事件预测:算法开发与验证
JMIR Med Inform. 2021 Oct 25;9(10):e32730. doi: 10.2196/32730.
7
Comparing methods for drug-gene interaction prediction on the biomedical literature knowledge graph: performance versus explainability.比较生物医学文献知识图谱上药物-基因相互作用预测方法:性能与可解释性。
BMC Bioinformatics. 2023 Jun 30;24(1):272. doi: 10.1186/s12859-023-05373-2.
8
MPTN: A message-passing transformer network for drug repurposing from knowledge graph.MPTN:一种基于知识图的药物重定位消息传递转换器网络。
Comput Biol Med. 2024 Jan;168:107800. doi: 10.1016/j.compbiomed.2023.107800. Epub 2023 Dec 1.
9
A review of biomedical datasets relating to drug discovery: a knowledge graph perspective.生物医学数据集在药物发现中的应用综述:知识图谱视角。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac404.
10
Ensembling graph attention networks for human microbe-drug association prediction.基于图注意力网络的微生物-药物关联预测方法
Bioinformatics. 2020 Dec 30;36(Suppl_2):i779-i786. doi: 10.1093/bioinformatics/btaa891.

引用本文的文献

1
Bind: large-scale biological interaction network discovery through knowledge graph-driven machine learning.Bind:通过知识图谱驱动的机器学习发现大规模生物相互作用网络
J Transl Med. 2025 Jul 31;23(1):856. doi: 10.1186/s12967-025-06789-5.
2
Understanding ecological systems using knowledge graphs: an application to highly pathogenic avian influenza.利用知识图谱理解生态系统:高致病性禽流感的一个应用
Bioinform Adv. 2025 Feb 5;5(1):vbaf016. doi: 10.1093/bioadv/vbaf016. eCollection 2025.
3
Hybrid Explainable Artificial Intelligence Models for Targeted Metabolomics Analysis of Diabetic Retinopathy.

本文引用的文献

1
A review of biomedical datasets relating to drug discovery: a knowledge graph perspective.生物医学数据集在药物发现中的应用综述:知识图谱视角。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac404.
2
Implications of topological imbalance for representation learning on biomedical knowledge graphs.拓扑不平衡对生物医学知识图谱表示学习的影响。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac279.
3
Task-driven knowledge graph filtering improves prioritizing drugs for repurposing.任务驱动的知识图过滤可改善药物再利用的优先级排序。
用于糖尿病视网膜病变靶向代谢组学分析的混合可解释人工智能模型
Diagnostics (Basel). 2024 Jun 27;14(13):1364. doi: 10.3390/diagnostics14131364.
4
KGE-UNIT: toward the unification of molecular interactions prediction based on knowledge graph and multi-task learning on drug discovery.KGE-UNIT:基于知识图和药物发现的多任务学习的分子相互作用预测的统一方法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae043.
BMC Bioinformatics. 2022 Mar 4;23(1):84. doi: 10.1186/s12859-022-04608-y.
4
A knowledge graph to interpret clinical proteomics data.一个解释临床蛋白质组学数据的知识图谱。
Nat Biotechnol. 2022 May;40(5):692-702. doi: 10.1038/s41587-021-01145-6. Epub 2022 Jan 31.
5
Bringing Light Into the Dark: A Large-Scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework.照亮黑暗:统一框架下知识图谱嵌入模型的大规模评估
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):8825-8845. doi: 10.1109/TPAMI.2021.3124805. Epub 2022 Nov 7.
6
Application of network link prediction in drug discovery.网络链接预测在药物发现中的应用。
BMC Bioinformatics. 2021 Apr 12;22(1):187. doi: 10.1186/s12859-021-04082-y.
7
Benchmark and Best Practices for Biomedical Knowledge Graph Embeddings.生物医学知识图谱嵌入的基准和最佳实践
Proc Conf Assoc Comput Linguist Meet. 2020 Jul;2020:167-176. doi: 10.18653/v1/2020.bionlp-1.18.
8
PharmKG: a dedicated knowledge graph benchmark for bomedical data mining.PharmKG:用于生物医学数据挖掘的专用知识图谱基准
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa344.
9
Drug2ways: Reasoning over causal paths in biological networks for drug discovery.Drug2ways:在生物网络中进行药物发现的因果路径推理。
PLoS Comput Biol. 2020 Dec 2;16(12):e1008464. doi: 10.1371/journal.pcbi.1008464. eCollection 2020 Dec.
10
Preclinical validation of therapeutic targets predicted by tensor factorization on heterogeneous graphs.基于异质图张量分解预测的治疗靶点的临床前验证。
Sci Rep. 2020 Oct 26;10(1):18250. doi: 10.1038/s41598-020-74922-z.