• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AptRank:一种用于生物关系图上蛋白质功能预测的自适应 PageRank 模型。

AptRank: an adaptive PageRank model for protein function prediction on   bi-relational graphs.

机构信息

Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.

Department of Mathematics, Purdue University, West Lafayette, IN, USA.

出版信息

Bioinformatics. 2017 Jun 15;33(12):1829-1836. doi: 10.1093/bioinformatics/btx029.

DOI:10.1093/bioinformatics/btx029
PMID:28200073
Abstract

MOTIVATION

Diffusion-based network models are widely used for protein function prediction using protein network data and have been shown to outperform neighborhood-based and module-based methods. Recent studies have shown that integrating the hierarchical structure of the Gene Ontology (GO) data dramatically improves prediction accuracy. However, previous methods usually either used the GO hierarchy to refine the prediction results of multiple classifiers, or flattened the hierarchy into a function-function similarity kernel. No study has taken the GO hierarchy into account together with the protein network as a two-layer network model.

RESULTS

We first construct a Bi-relational graph (Birg) model comprised of both protein-protein association and function-function hierarchical networks. We then propose two diffusion-based methods, BirgRank and AptRank, both of which use PageRank to diffuse information on this two-layer graph model. BirgRank is a direct application of traditional PageRank with fixed decay parameters. In contrast, AptRank utilizes an adaptive diffusion mechanism to improve the performance of BirgRank. We evaluate the ability of both methods to predict protein function on yeast, fly and human protein datasets, and compare with four previous methods: GeneMANIA, TMC, ProteinRank and clusDCA. We design four different validation strategies: missing function prediction, de novo function prediction, guided function prediction and newly discovered function prediction to comprehensively evaluate predictability of all six methods. We find that both BirgRank and AptRank outperform the previous methods, especially in missing function prediction when using only 10% of the data for training.

AVAILABILITY AND IMPLEMENTATION

The MATLAB code is available at https://github.rcac.purdue.edu/mgribsko/aptrank .

CONTACT

gribskov@purdue.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基于扩散的网络模型广泛用于使用蛋白质网络数据进行蛋白质功能预测,并且已经被证明优于基于邻域和基于模块的方法。最近的研究表明,整合基因本体论(GO)数据的层次结构可以显著提高预测准确性。然而,以前的方法通常要么使用 GO 层次结构来细化多个分类器的预测结果,要么将层次结构平展为功能-功能相似性核。没有研究将 GO 层次结构与蛋白质网络一起考虑作为两层网络模型。

结果

我们首先构建了一个由蛋白质-蛋白质相互作用和功能-功能层次网络组成的双关系图(Birg)模型。然后,我们提出了两种基于扩散的方法,BirgRank 和 AptRank,它们都使用 PageRank 在这个两层图模型上扩散信息。BirgRank 是传统 PageRank 的直接应用,具有固定的衰减参数。相比之下,AptRank 利用自适应扩散机制来提高 BirgRank 的性能。我们在酵母、果蝇和人类蛋白质数据集上评估了这两种方法预测蛋白质功能的能力,并与之前的四种方法进行了比较:GeneMANIA、TMC、ProteinRank 和 clusDCA。我们设计了四种不同的验证策略:缺失功能预测、从头功能预测、引导功能预测和新发现功能预测,以全面评估所有六种方法的可预测性。我们发现,BirgRank 和 AptRank 都优于之前的方法,特别是在仅使用 10%的数据进行训练时,在缺失功能预测方面表现出色。

可用性和实现

MATLAB 代码可在 https://github.rcac.purdue.edu/mgribsko/aptrank 获得。

联系人

gribskov@purdue.edu。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
AptRank: an adaptive PageRank model for protein function prediction on   bi-relational graphs.AptRank:一种用于生物关系图上蛋白质功能预测的自适应 PageRank 模型。
Bioinformatics. 2017 Jun 15;33(12):1829-1836. doi: 10.1093/bioinformatics/btx029.
2
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.
3
Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.Hum-mPLoc 3.0:通过对基因本体和功能域特征的隐藏相关性进行建模来增强人类蛋白质亚细胞定位预测
Bioinformatics. 2017 Mar 15;33(6):843-853. doi: 10.1093/bioinformatics/btw723.
4
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
5
PrimAlign: PageRank-inspired Markovian alignment for large biological networks.PrimAlign:基于 PageRank 启发的马尔可夫对齐算法,用于大型生物网络。
Bioinformatics. 2018 Jul 1;34(13):i537-i546. doi: 10.1093/bioinformatics/bty288.
6
HEMDAG: a family of modular and scalable hierarchical ensemble methods to improve Gene Ontology term prediction.HEMDAG:一种模块化且可扩展的分层集成方法家族,用于改进基因本体术语预测。
Bioinformatics. 2021 Dec 7;37(23):4526-4533. doi: 10.1093/bioinformatics/btab485.
7
GLIDER: function prediction from GLIDE-based neighborhoods.GLIDER:基于 GLIDE 近邻的功能预测。
Bioinformatics. 2022 Jun 27;38(13):3395-3406. doi: 10.1093/bioinformatics/btac322.
8
NegGOA: negative GO annotations selection using ontology structure.NegGOA:基于本体结构的负 GO 注释选择。
Bioinformatics. 2016 Oct 1;32(19):2996-3004. doi: 10.1093/bioinformatics/btw366. Epub 2016 Jun 17.
9
Integrating multiple networks for protein function prediction.整合多个网络用于蛋白质功能预测。
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S3. doi: 10.1186/1752-0509-9-S1-S3. Epub 2015 Jan 21.
10
Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations.Onto2Vec:基于向量的生物实体联合表示及其基于本体论的标注。
Bioinformatics. 2018 Jul 1;34(13):i52-i60. doi: 10.1093/bioinformatics/bty259.

引用本文的文献

1
A deep learning framework for predicting disease-gene associations with functional modules and graph augmentation.一种基于功能模块和图增强的深度学习框架,用于预测疾病-基因关联。
BMC Bioinformatics. 2024 Jun 14;25(1):214. doi: 10.1186/s12859-024-05841-3.
2
TO-UGDA: target-oriented unsupervised graph domain adaptation.TO-UGDA:面向目标的无监督图域适应
Sci Rep. 2024 Apr 22;14(1):9165. doi: 10.1038/s41598-024-59890-y.
3
PRRGO: A Tool for Visualizing and Mapping Globally Expressed Genes in Public Gene Expression Omnibus RNA-Sequencing Studies to PageRank-scored Gene Ontology Terms.
PRRGO:一种用于在公共基因表达综合RNA测序研究中,将全球表达基因可视化并映射到PageRank评分的基因本体术语的工具。
bioRxiv. 2024 Jan 24:2024.01.21.576540. doi: 10.1101/2024.01.21.576540.
4
Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-2.可解释的网络传播及其在扩展与 SARS-CoV-2 相互作用的人类蛋白质组中的应用。
Gigascience. 2021 Dec 29;10(12). doi: 10.1093/gigascience/giab082.
5
Multimodal Brain Network Jointly Construction and Fusion for Diagnosis of Epilepsy.用于癫痫诊断的多模态脑网络联合构建与融合
Front Neurosci. 2021 Sep 29;15:734711. doi: 10.3389/fnins.2021.734711. eCollection 2021.
6
Revealing latent traits in the social behavior of distance learning students.揭示远程学习学生社会行为中的潜在特征。
Educ Inf Technol (Dordr). 2022;27(3):3529-3565. doi: 10.1007/s10639-021-10742-6. Epub 2021 Sep 29.
7
The effect of statistical normalization on network propagation scores.统计归一化对网络传播评分的影响。
Bioinformatics. 2021 May 5;37(6):845-852. doi: 10.1093/bioinformatics/btaa896.
8
Predicting Protein Functions Based on Differential Co-expression and Neighborhood Analysis.基于差异共表达和邻域分析预测蛋白质功能。
J Comput Biol. 2021 Jan;28(1):1-18. doi: 10.1089/cmb.2019.0120. Epub 2020 Apr 17.
9
Reconstructing signaling pathways using regular language constrained paths.使用正则语言约束路径重建信号通路。
Bioinformatics. 2019 Jul 15;35(14):i624-i633. doi: 10.1093/bioinformatics/btz360.
10
Benchmarking network propagation methods for disease gene identification.用于疾病基因识别的网络传播方法的基准测试。
PLoS Comput Biol. 2019 Sep 3;15(9):e1007276. doi: 10.1371/journal.pcbi.1007276. eCollection 2019 Sep.