• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用知识图谱嵌入技术预测信息有限疾病的基因-疾病关联。

Predicting gene disease associations with knowledge graph embeddings for diseases with curtailed information.

作者信息

Gualdi Francesco, Oliva Baldomero, Piñero Janet

机构信息

Integrative Biomedical Informatics, Research Programme on Biomedical Informatics (IBI-GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, C/Dr Aiguader 88, E-08003 Barcelona, Spain.

Structural Bioinformatics Lab, Research Programme on Biomedical Informatics (SBI-GRIB), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, C/Dr Aiguader 88, E-08003 Barcelona, Spain.

出版信息

NAR Genom Bioinform. 2024 May 14;6(2):lqae049. doi: 10.1093/nargab/lqae049. eCollection 2024 Jun.

DOI:10.1093/nargab/lqae049
PMID:38745993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11091931/
Abstract

Knowledge graph embeddings (KGE) are a powerful technique used in the biomedical domain to represent biological knowledge in a low dimensional space. However, a deep understanding of these methods is still missing, and, in particular, regarding their applications to prioritize genes associated with complex diseases with reduced genetic information. In this contribution, we built a knowledge graph (KG) by integrating heterogeneous biomedical data and generated KGE by implementing state-of-the-art methods, and two novel algorithms: Dlemb and BioKG2vec. Extensive testing of the embeddings with unsupervised clustering and supervised methods showed that KGE can be successfully implemented to predict genes associated with diseases and that our novel approaches outperform most existing algorithms in both scenarios. Our findings underscore the significance of data quality, preprocessing, and integration in achieving accurate predictions. Additionally, we applied KGE to predict genes linked to Intervertebral Disc Degeneration (IDD) and illustrated that functions pertinent to the disease are enriched within the prioritized gene set.

摘要

知识图谱嵌入(KGE)是生物医学领域中用于在低维空间表示生物知识的一种强大技术。然而,对这些方法仍缺乏深入理解,尤其是在利用减少的遗传信息对与复杂疾病相关的基因进行优先级排序的应用方面。在本研究中,我们通过整合异构生物医学数据构建了一个知识图谱(KG),并通过实施最先进的方法以及两种新算法:Dlemb和BioKG2vec生成了KGE。使用无监督聚类和监督方法对嵌入进行的广泛测试表明,KGE可以成功用于预测与疾病相关的基因,并且我们的新方法在这两种情况下均优于大多数现有算法。我们的研究结果强调了数据质量、预处理和整合在实现准确预测中的重要性。此外,我们应用KGE预测与椎间盘退变(IDD)相关的基因,并表明在优先级基因集中富集了与该疾病相关的功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/6e09524c34b3/lqae049fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/940bd3cb543a/lqae049figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/5bccc383160d/lqae049fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/ed321c867ccd/lqae049fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/443b18ab2d40/lqae049fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/6e09524c34b3/lqae049fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/940bd3cb543a/lqae049figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/5bccc383160d/lqae049fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/ed321c867ccd/lqae049fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/443b18ab2d40/lqae049fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a3c8/11091931/6e09524c34b3/lqae049fig4.jpg

相似文献

1
Predicting gene disease associations with knowledge graph embeddings for diseases with curtailed information.利用知识图谱嵌入技术预测信息有限疾病的基因-疾病关联。
NAR Genom Bioinform. 2024 May 14;6(2):lqae049. doi: 10.1093/nargab/lqae049. eCollection 2024 Jun.
2
Community knowledge graph abstraction for enhanced link prediction: A study on PubMed knowledge graph.面向增强链接预测的社区知识图抽象:基于 PubMed 知识图的研究。
J Biomed Inform. 2024 Oct;158:104725. doi: 10.1016/j.jbi.2024.104725. Epub 2024 Sep 10.
3
Multi-domain knowledge graph embeddings for gene-disease association prediction.多领域知识图谱嵌入在基因-疾病关联预测中的应用。
J Biomed Semantics. 2023 Aug 14;14(1):11. doi: 10.1186/s13326-023-00291-x.
4
HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.HPO2Vec+:利用异构知识资源丰富人类表型本体的节点嵌入。
J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27.
5
Duality-Induced Regularizer for Semantic Matching Knowledge Graph Embeddings.用于语义匹配知识图谱嵌入的对偶诱导正则化器
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1652-1667. doi: 10.1109/TPAMI.2022.3161804. Epub 2023 Jan 6.
6
GFCNet: Utilizing graph feature collection networks for coronavirus knowledge graph embeddings.GFCNet:利用图特征收集网络进行冠状病毒知识图谱嵌入
Inf Sci (N Y). 2022 Aug;608:1557-1571. doi: 10.1016/j.ins.2022.07.031. Epub 2022 Jul 14.
7
Knowledge Graph Embeddings for ICU readmission prediction.知识图嵌入在 ICU 再入院预测中的应用。
BMC Med Inform Decis Mak. 2023 Jan 19;23(1):12. doi: 10.1186/s12911-022-02070-7.
8
A lightweight CNN-based knowledge graph embedding model with channel attention for link prediction.基于轻量级 CNN 的带通道注意力的知识图嵌入模型,用于链路预测。
Math Biosci Eng. 2023 Mar 21;20(6):9607-9624. doi: 10.3934/mbe.2023421.
9
Implications of topological imbalance for representation learning on biomedical knowledge graphs.拓扑不平衡对生物医学知识图谱表示学习的影响。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac279.
10
Drug-Drug Interaction Predictions via Knowledge Graph and Text Embedding: Instrument Validation Study.通过知识图谱和文本嵌入进行药物-药物相互作用预测:工具验证研究
JMIR Med Inform. 2021 Jun 24;9(6):e28277. doi: 10.2196/28277.

引用本文的文献

1
proteomic analysis of the role of IL-4 and IL-10 in IVD degeneration: Protein-protein interaction networks for candidate prioritisation.白细胞介素-4和白细胞介素-10在椎间盘退变中作用的蛋白质组学分析:用于候选优先级排序的蛋白质-蛋白质相互作用网络
Comput Struct Biotechnol J. 2025 Apr 14;27:1600-1613. doi: 10.1016/j.csbj.2025.04.015. eCollection 2025.
2
KG2ML: Integrating Knowledge Graphs and Positive Unlabeled Learning for Identifying Disease-Associated Genes.KG2ML:整合知识图谱与正例无标注学习以识别疾病相关基因
medRxiv. 2025 Mar 17:2025.03.17.25323906. doi: 10.1101/2025.03.17.25323906.
3
Cross-modal embedding integrator for disease-gene/protein association prediction using a multi-head attention mechanism.

本文引用的文献

1
Multi-domain knowledge graph embeddings for gene-disease association prediction.多领域知识图谱嵌入在基因-疾病关联预测中的应用。
J Biomed Semantics. 2023 Aug 14;14(1):11. doi: 10.1186/s13326-023-00291-x.
2
Knowledge Graphs: Opportunities and Challenges.知识图谱:机遇与挑战。
Artif Intell Rev. 2023 Apr 3:1-32. doi: 10.1007/s10462-023-10465-9.
3
A knowledge graph-based disease-gene prediction system using multi-relational graph convolution networks.基于知识图的多关系图卷积网络疾病-基因预测系统。
基于多头注意力机制的疾病-基因/蛋白质关联预测的跨模态嵌入集成器。
Pharmacol Res Perspect. 2024 Dec;12(6):e70034. doi: 10.1002/prp2.70034.
AMIA Annu Symp Proc. 2023 Apr 29;2022:468-476. eCollection 2022.
4
Integrating and formatting biomedical data as pre-calculated knowledge graph embeddings in the Bioteque.在 Bioteque 中整合和格式化生物医学数据作为预先计算的知识图嵌入。
Nat Commun. 2022 Sep 9;13(1):5304. doi: 10.1038/s41467-022-33026-0.
5
Immuno-Modulatory Effects of Intervertebral Disc Cells.椎间盘细胞的免疫调节作用
Front Cell Dev Biol. 2022 Jun 29;10:924692. doi: 10.3389/fcell.2022.924692. eCollection 2022.
6
Identifying disease-gene associations using a convolutional neural network-based model by embedding a biological knowledge graph with entity descriptions.基于卷积神经网络的模型,通过嵌入具有实体描述的生物知识图来识别疾病基因关联。
PLoS One. 2021 Oct 15;16(10):e0258626. doi: 10.1371/journal.pone.0258626. eCollection 2021.
7
Discovery and implications of polygenicity of common diseases.常见疾病多基因遗传的发现及其意义。
Science. 2021 Sep 24;373(6562):1468-1473. doi: 10.1126/science.abi8206. Epub 2021 Sep 23.
8
Identification of disease treatment mechanisms through the multiscale interactome.通过多尺度互作网络鉴定疾病治疗机制
Nat Commun. 2021 Mar 19;12(1):1796. doi: 10.1038/s41467-021-21770-8.
9
Graph embeddings on gene ontology annotations for protein-protein interaction prediction.基于基因本体论注释的图嵌入在蛋白质相互作用预测中的应用。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):560. doi: 10.1186/s12859-020-03816-8.
10
Semantic similarity and machine learning with ontologies.语义相似性和本体论的机器学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.