• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用模式约束知识图嵌入技术预测研究不足的暗激酶的蛋白质和途径关联。

Predicting protein and pathway associations for understudied dark kinases using pattern-constrained knowledge graph embedding.

机构信息

Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, United States of America.

Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America.

出版信息

PeerJ. 2023 Oct 18;11:e15815. doi: 10.7717/peerj.15815. eCollection 2023.

DOI:10.7717/peerj.15815
PMID:37868056
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590106/
Abstract

The 534 protein kinases encoded in the human genome constitute a large druggable class of proteins that include both well-studied and understudied "dark" members. Accurate prediction of dark kinase functions is a major bioinformatics challenge. Here, we employ a graph mining approach that uses the evolutionary and functional context encoded in knowledge graphs (KGs) to predict protein and pathway associations for understudied kinases. We propose a new scalable graph embedding approach, RegPattern2Vec, which employs regular pattern constrained random walks to sample diverse aspects of node context within a KG flexibly. RegPattern2Vec learns functional representations of kinases, interacting partners, post-translational modifications, pathways, cellular localization, and chemical interactions from a kinase-centric KG that integrates and conceptualizes data from curated heterogeneous data resources. By contextualizing information relevant to prediction, RegPattern2Vec improves accuracy and efficiency in comparison to other random walk-based graph embedding approaches. We show that the predictions produced by our model overlap with pathway enrichment data produced using experimentally validated Protein-Protein Interaction (PPI) data from both publicly available databases and experimental datasets not used in training. Our model also has the advantage of using the collected random walks as biological context to interpret the predicted protein-pathway associations. We provide high-confidence pathway predictions for 34 dark kinases and present three case studies in which analysis of meta-paths associated with the prediction enables biological interpretation. Overall, RegPattern2Vec efficiently samples multiple node types for link prediction on biological knowledge graphs and the predicted associations between understudied kinases, pseudokinases, and known pathways serve as a conceptual starting point for hypothesis generation and testing.

摘要

人类基因组中编码的 534 种蛋白激酶构成了一个庞大的可成药蛋白类,其中包括研究充分和研究不足的“暗”激酶成员。准确预测暗激酶的功能是一个主要的生物信息学挑战。在这里,我们采用了一种图挖掘方法,该方法使用知识图(KG)中编码的进化和功能上下文来预测研究不足的激酶的蛋白质和途径关联。我们提出了一种新的可扩展图嵌入方法 RegPattern2Vec,它使用正则模式约束的随机游走灵活地在 KG 中采样节点上下文的多个方面。RegPattern2Vec 从以激酶为中心的 KG 中学习激酶、相互作用伙伴、翻译后修饰、途径、细胞定位和化学相互作用的功能表示,该 KG 集成和概念化了来自精心策划的异构数据资源的数据。通过将与预测相关的信息置于上下文中,RegPattern2Vec 与其他基于随机游走的图嵌入方法相比,提高了准确性和效率。我们表明,我们的模型产生的预测与使用来自公共可用数据库和未用于训练的实验数据集的实验验证的蛋白质-蛋白质相互作用(PPI)数据生成的途径富集数据重叠。我们的模型还具有使用收集的随机游走作为生物背景来解释预测的蛋白质-途径关联的优势。我们为 34 种暗激酶提供了高可信度的途径预测,并提供了三个案例研究,其中与预测相关的元路径的分析使生物学解释成为可能。总体而言,RegPattern2Vec 可以有效地对生物知识图上的多个节点类型进行链接预测,并且对研究不足的激酶、假激酶和已知途径之间的预测关联可以作为生成和测试假说的概念起点。

相似文献

1
Predicting protein and pathway associations for understudied dark kinases using pattern-constrained knowledge graph embedding.利用模式约束知识图嵌入技术预测研究不足的暗激酶的蛋白质和途径关联。
PeerJ. 2023 Oct 18;11:e15815. doi: 10.7717/peerj.15815. eCollection 2023.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Short-Term Memory Impairment短期记忆障碍
4
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
Advancing edge-based clustering and graph embedding for biological network analysis: a case study in RASopathies.用于生物网络分析的基于前沿的聚类和图嵌入:以RASopathies为例的研究
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf320.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
10
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

引用本文的文献

1
Identifying and evaluating understudied protein kinases using biological and chemical criteria.使用生物学和化学标准鉴定和评估研究不足的蛋白激酶。
RSC Med Chem. 2025 Jun 5. doi: 10.1039/d5md00306g.
2
Heterogeneous network approaches to protein pathway prediction.用于蛋白质通路预测的异构网络方法。
Comput Struct Biotechnol J. 2024 Jun 27;23:2727-2739. doi: 10.1016/j.csbj.2024.06.022. eCollection 2024 Dec.
3
Informatic challenges and advances in illuminating the druggable proteome.阐明可成药蛋白质组中的信息学挑战与进展。

本文引用的文献

1
Dark kinase annotation, mining, and visualization using the Protein Kinase Ontology.利用蛋白激酶本体进行暗激酶注释、挖掘和可视化。
PeerJ. 2023 Dec 5;11:e16087. doi: 10.7717/peerj.16087. eCollection 2023.
2
Automated assembly of molecular mechanisms at scale from text mining and curated databases.从文本挖掘和经过整理的数据库中大规模自动组装分子机制。
Mol Syst Biol. 2023 May 9;19(5):e11325. doi: 10.15252/msb.202211325. Epub 2023 Mar 20.
3
An atlas of substrate specificities for the human serine/threonine kinome.人类丝氨酸/苏氨酸激酶组的底物特异性图谱
Drug Discov Today. 2024 Mar;29(3):103894. doi: 10.1016/j.drudis.2024.103894. Epub 2024 Jan 22.
Nature. 2023 Jan;613(7945):759-766. doi: 10.1038/s41586-022-05575-3. Epub 2023 Jan 11.
4
Pharos 2023: an integrated resource for the understudied human proteome.Pharos 2023:一个针对人类蛋白质组中未被充分研究的部分的综合资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1405-D1416. doi: 10.1093/nar/gkac1033.
5
Evolutionary and cellular analysis of the 'dark' pseudokinase PSKH2.“暗”假激酶 PSKH2 的进化和细胞分析。
Biochem J. 2023 Jan 31;480(2):141-160. doi: 10.1042/BCJ20220474.
6
A R mediated modulation in IP levels altering the [Ca] through cAMP-dependent PKA signalling pathway.A R 介导的 IP 水平调节,通过 cAMP 依赖的 PKA 信号通路改变 [Ca]。
Biochim Biophys Acta Gen Subj. 2022 Dec;1866(12):130242. doi: 10.1016/j.bbagen.2022.130242. Epub 2022 Sep 6.
7
A knowledge graph representation learning approach to predict novel kinase-substrate interactions.一种基于知识图表示学习的方法,用于预测新型激酶-底物相互作用。
Mol Omics. 2022 Oct 31;18(9):853-864. doi: 10.1039/d1mo00521a.
8
Implications of topological imbalance for representation learning on biomedical knowledge graphs.拓扑不平衡对生物医学知识图谱表示学习的影响。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac279.
9
NCMD: Node2vec-Based Neural Collaborative Filtering for Predicting MiRNA-Disease Association.NCMD:基于Node2vec的神经协同过滤用于预测微小RNA与疾病的关联
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1257-1268. doi: 10.1109/TCBB.2022.3191972. Epub 2023 Apr 3.
10
Supervised learning with word embeddings derived from PubMed captures latent knowledge about protein kinases and cancer.利用源自PubMed的词嵌入进行监督学习,可捕获有关蛋白激酶和癌症的潜在知识。
NAR Genom Bioinform. 2021 Dec 8;3(4):lqab113. doi: 10.1093/nargab/lqab113. eCollection 2021 Dec.