• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TarKG:一个全面的生物医学知识图谱,用于目标发现。

TarKG: a comprehensive biomedical knowledge graph for target discovery.

机构信息

Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Department of Medicinal Chemistry, West China School of Pharmacy, Sichuan University, Chengdu 610041, China.

Division of Data Intelligence, Department of Computer Science, Shantou University, Shantou 515063, China.

出版信息

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae598.

DOI:10.1093/bioinformatics/btae598
PMID:39392404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11513019/
Abstract

MOTIVATION

Target discovery is a crucial step in drug development, as it directly affects the success rate of clinical trials. Knowledge graphs (KGs) offer unique advantages in processing complex biological data and inferring new relationships. Existing biomedical KGs primarily focus on tasks such as drug repositioning and drug-target interactions, leaving a gap in the construction of KGs tailored for target discovery.

RESULTS

We established a comprehensive biomedical KG focusing on target discovery, termed TarKG, by integrating seven existing biomedical KGs, nine public databases, and traditional Chinese medicine knowledge databases. TarKG consists of 1 143 313 entities and 32 806 467 relations across 15 entity categories and 171 relation types, all centered around 3 core entity types: Disease, Gene, and Compound. TarKG provides specialized knowledges for the core entities including chemical structures, protein sequences, or text descriptions. By using different KG embedding algorithms, we assessed the knowledge completion capabilities of TarKG, particularly for disease-target link prediction. In case studies, we further examined TarKG's ability to predict potential protein targets for Alzheimer's disease (AD) and to identify diseases potentially associated with the metallo-deubiquitinase CSN5, using literature analysis for validation. Furthermore, we provided a user-friendly web server (https://tarkg.ddtmlab.org) that enables users to perform knowledge retrieval and relation inference using TarKG.

AVAILABILITY AND IMPLEMENTATION

TarKG is accessible at https://tarkg.ddtmlab.org.

摘要

动机

靶点发现是药物开发的关键步骤,因为它直接影响临床试验的成功率。知识图谱 (KG) 在处理复杂的生物数据和推断新的关系方面具有独特的优势。现有的生物医学知识图谱主要集中在药物重定位和药物-靶点相互作用等任务上,而在构建针对靶点发现的知识图谱方面存在空白。

结果

我们通过整合七个现有的生物医学知识图谱、九个公共数据库和中药知识数据库,建立了一个全面的生物医学知识图谱,专门用于靶点发现,称为 TarKG。TarKG 由 1143313 个实体和 32806467 个关系组成,涵盖 15 个实体类别和 171 个关系类型,所有这些都以 3 个核心实体类型为中心:疾病、基因和化合物。TarKG 为核心实体提供了专门的知识,包括化学结构、蛋白质序列或文本描述。通过使用不同的 KG 嵌入算法,我们评估了 TarKG 的知识完成能力,特别是针对疾病-靶点链接预测。在案例研究中,我们进一步检验了 TarKG 预测阿尔茨海默病(AD)潜在蛋白靶点的能力,并通过文献分析进行验证,以识别可能与金属去泛素酶 CSN5 相关的疾病。此外,我们提供了一个用户友好的网络服务器 (https://tarkg.ddtmlab.org),允许用户使用 TarKG 进行知识检索和关系推断。

可用性和实现

TarKG 可在 https://tarkg.ddtmlab.org 访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/723e382c35de/btae598f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/efdf7d64dac6/btae598f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/4ebd268a81fd/btae598f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/0ec2fc07b265/btae598f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/723e382c35de/btae598f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/efdf7d64dac6/btae598f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/4ebd268a81fd/btae598f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/0ec2fc07b265/btae598f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fcad/11513019/723e382c35de/btae598f4.jpg

相似文献

1
TarKG: a comprehensive biomedical knowledge graph for target discovery.TarKG:一个全面的生物医学知识图谱,用于目标发现。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae598.
2
Alzheimer's Disease Knowledge Graph Enhances Knowledge Discovery and Disease Prediction.阿尔茨海默病知识图谱增强知识发现与疾病预测。
bioRxiv. 2024 Jul 5:2024.07.03.601339. doi: 10.1101/2024.07.03.601339.
3
Knowledge graphs and their applications in drug discovery.知识图谱及其在药物发现中的应用。
Expert Opin Drug Discov. 2021 Sep;16(9):1057-1069. doi: 10.1080/17460441.2021.1910673. Epub 2021 Apr 12.
4
Graph embedding-based link prediction for literature-based discovery in Alzheimer's Disease.基于图嵌入的阿尔茨海默病文献发现链路预测。
J Biomed Inform. 2023 Sep;145:104464. doi: 10.1016/j.jbi.2023.104464. Epub 2023 Aug 2.
5
Accurate prediction of drug-target interactions in Chinese and western medicine by the CWI-DTI model.基于 CWI-DTI 模型的中西医药物靶点相互作用的准确预测。
Sci Rep. 2024 Oct 23;14(1):25054. doi: 10.1038/s41598-024-76367-0.
6
iDrug: Integration of drug repositioning and drug-target prediction via cross-network embedding.iDrug:通过跨网络嵌入实现药物重定位和药物靶点预测的整合。
PLoS Comput Biol. 2020 Jul 15;16(7):e1008040. doi: 10.1371/journal.pcbi.1008040. eCollection 2020 Jul.
7
Computational Drug Repositioning: A Lateral Approach to Traditional Drug Discovery?计算药物重新定位:传统药物发现的一种横向方法?
Curr Top Med Chem. 2016;16(19):2069-77. doi: 10.2174/1568026616666160216153249.
8
DrugRepoBank: a comprehensive database and discovery platform for accelerating drug repositioning.DrugRepoBank:一个全面的数据库和发现平台,用于加速药物重新定位。
Database (Oxford). 2024 Jul 11;2024. doi: 10.1093/database/baae051.
9
BioBLP: a modular framework for learning on multimodal biomedical knowledge graphs.BioBLP:一种用于多模态生物医学知识图谱学习的模块化框架。
J Biomed Semantics. 2023 Dec 8;14(1):20. doi: 10.1186/s13326-023-00301-y.
10
Expediting knowledge acquisition by a web framework for Knowledge Graph Exploration and Visualization (KGEV): case studies on COVID-19 and Human Phenotype Ontology.利用知识图谱探索和可视化(KGEV)的网络框架加速知识获取:COVID-19 和人类表型本体论的案例研究。
BMC Med Inform Decis Mak. 2022 Jun 2;22(Suppl 2):147. doi: 10.1186/s12911-022-01848-z.

引用本文的文献

1
Harnessing artificial intelligence for brain disease: advances in diagnosis, drug discovery, and closed-loop therapeutics.利用人工智能应对脑部疾病:诊断、药物研发及闭环治疗方面的进展
Front Neurol. 2025 Jul 28;16:1615523. doi: 10.3389/fneur.2025.1615523. eCollection 2025.

本文引用的文献

1
Knowledge Graph Convolutional Network with Heuristic Search for Drug Repositioning.基于启发式搜索的知识图卷积网络在药物重定位中的应用。
J Chem Inf Model. 2024 Jun 24;64(12):4928-4937. doi: 10.1021/acs.jcim.4c00737. Epub 2024 Jun 5.
2
PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge.PubTator 3.0:一款人工智能驱动的文献资源,用于解锁生物医学知识。
Nucleic Acids Res. 2024 Jul 5;52(W1):W540-W546. doi: 10.1093/nar/gkae235.
3
2023 FDA approvals: unprecedented volume at moderate value.2023年美国食品药品监督管理局的批准情况:数量空前但价值一般。
Nat Rev Drug Discov. 2024 Feb;23(2):98. doi: 10.1038/d41573-024-00011-9.
4
ccTCM: A quantitative component and compound platform for promoting the research of traditional Chinese medicine.中医成分与化合物定量平台(ccTCM):推动中医药研究的定量成分与化合物平台
Comput Struct Biotechnol J. 2023 Nov 20;21:5807-5817. doi: 10.1016/j.csbj.2023.11.030. eCollection 2023.
5
The DO-KB Knowledgebase: a 20-year journey developing the disease open science ecosystem.DO-KB 知识库:开发疾病开放科学生态系统的 20 年历程。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1305-D1314. doi: 10.1093/nar/gkad1051.
6
DrugBank 6.0: the DrugBank Knowledgebase for 2024.DrugBank 6.0:2024 年版 DrugBank 知识库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1265-D1275. doi: 10.1093/nar/gkad976.
7
The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.2023 年的 ChEMBL 数据库:一个涵盖多种生物活性数据类型和时间段的药物发现平台。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1180-D1192. doi: 10.1093/nar/gkad1004.
8
BATMAN-TCM 2.0: an enhanced integrative database for known and predicted interactions between traditional Chinese medicine ingredients and target proteins.BATMAN-TCM 2.0:一个增强型的综合性数据库,用于已知和预测的中药成分与靶蛋白之间的相互作用。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1110-D1120. doi: 10.1093/nar/gkad926.
9
Expanding PROTACtable genome universe of E3 ligases.拓展 E3 连接酶的 PROTAC 可及基因组范围。
Nat Commun. 2023 Oct 16;14(1):6509. doi: 10.1038/s41467-023-42233-2.
10
DCABM-TCM: A Database of Constituents Absorbed into the Blood and Metabolites of Traditional Chinese Medicine.DCABM-TCM:一个中药成分进入血液和代谢产物的数据库。
J Chem Inf Model. 2023 Aug 14;63(15):4948-4959. doi: 10.1021/acs.jcim.3c00365. Epub 2023 Jul 24.