• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过在基因本体上进行向下随机游走预测蛋白质功能。

Predicting protein function via downward random walks on a gene ontology.

作者信息

Yu Guoxian, Zhu Hailong, Domeniconi Carlotta, Liu Jiming

机构信息

College of Computer and Information Sciences, Southwest University, Beibei, Chongqing, China.

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China.

出版信息

BMC Bioinformatics. 2015 Aug 27;16:271. doi: 10.1186/s12859-015-0713-y.

DOI:10.1186/s12859-015-0713-y
PMID:26310806
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4551531/
Abstract

BACKGROUND

High-throughput bio-techniques accumulate ever-increasing amount of genomic and proteomic data. These data are far from being functionally characterized, despite the advances in gene (or gene's product proteins) functional annotations. Due to experimental techniques and to the research bias in biology, the regularly updated functional annotation databases, i.e., the Gene Ontology (GO), are far from being complete. Given the importance of protein functions for biological studies and drug design, proteins should be more comprehensively and precisely annotated.

RESULTS

We proposed downward Random Walks (dRW) to predict missing (or new) functions of partially annotated proteins. Particularly, we apply downward random walks with restart on the GO directed acyclic graph, along with the available functions of a protein, to estimate the probability of missing functions. To further boost the prediction accuracy, we extend dRW to dRW-kNN. dRW-kNN computes the semantic similarity between proteins based on the functional annotations of proteins; it then predicts functions based on the functions estimated by dRW, together with the functions associated with the k nearest proteins. Our proposed models can predict two kinds of missing functions: (i) the ones that are missing for a protein but associated with other proteins of interest; (ii) the ones that are not available for any protein of interest, but exist in the GO hierarchy. Experimental results on the proteins of Yeast and Human show that dRW and dRW-kNN can replenish functions more accurately than other related approaches, especially for sparse functions associated with no more than 10 proteins.

CONCLUSION

The empirical study shows that the semantic similarity between GO terms and the ontology hierarchy play important roles in predicting protein function. The proposed dRW and dRW-kNN can serve as tools for replenishing functions of partially annotated proteins.

摘要

背景

高通量生物技术积累了数量不断增加的基因组和蛋白质组数据。尽管在基因(或基因的产物蛋白质)功能注释方面取得了进展,但这些数据的功能特征仍远未明确。由于实验技术和生物学研究的偏差,定期更新的功能注释数据库,即基因本体论(GO),远未完善。鉴于蛋白质功能在生物学研究和药物设计中的重要性,蛋白质应得到更全面、精确的注释。

结果

我们提出了向下随机游走(dRW)来预测部分注释蛋白质的缺失(或新的)功能。具体而言,我们在GO有向无环图上应用带重启的向下随机游走,并结合蛋白质的已知功能,来估计缺失功能的概率。为进一步提高预测准确性,我们将dRW扩展为dRW-kNN。dRW-kNN基于蛋白质的功能注释计算蛋白质之间的语义相似性;然后根据dRW估计的功能以及与k个最近蛋白质相关的功能来预测功能。我们提出的模型可以预测两种缺失功能:(i)蛋白质缺失但与其他感兴趣蛋白质相关的功能;(ii)任何感兴趣蛋白质都不具备但存在于GO层次结构中的功能。酵母和人类蛋白质的实验结果表明,dRW和dRW-kNN比其他相关方法能更准确地补充功能,特别是对于与不超过10种蛋白质相关的稀疏功能。

结论

实证研究表明,GO术语之间的语义相似性和本体层次结构在预测蛋白质功能中起着重要作用。所提出的dRW和dRW-kNN可作为补充部分注释蛋白质功能的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63bc/4551531/931bce1a9154/12859_2015_713_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63bc/4551531/e4c8436b01f3/12859_2015_713_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63bc/4551531/931bce1a9154/12859_2015_713_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63bc/4551531/e4c8436b01f3/12859_2015_713_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63bc/4551531/931bce1a9154/12859_2015_713_Fig2_HTML.jpg

相似文献

1
Predicting protein function via downward random walks on a gene ontology.通过在基因本体上进行向下随机游走预测蛋白质功能。
BMC Bioinformatics. 2015 Aug 27;16:271. doi: 10.1186/s12859-015-0713-y.
2
HashGO: hashing gene ontology for protein function prediction.HashGO:用于蛋白质功能预测的基因本体哈希法
Comput Biol Chem. 2017 Dec;71:264-273. doi: 10.1016/j.compbiolchem.2017.09.010. Epub 2017 Oct 4.
3
NMFGO: Gene Function Prediction via Nonnegative Matrix Factorization with Gene Ontology.NMFGO:基于基因本体论的非负矩阵分解进行基因功能预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):238-249. doi: 10.1109/TCBB.2018.2861379. Epub 2018 Jul 30.
4
NewGOA: Predicting New GO Annotations of Proteins by Bi-Random Walks on a Hybrid Graph.NewGOA:基于混合图双随机游走的蛋白质新 GO 注释预测。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1390-1402. doi: 10.1109/TCBB.2017.2715842. Epub 2017 Jun 15.
5
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.信息论应用于稀疏基因本体注释网络以预测新的基因功能。
Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195.
6
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
7
Predicting functions of maize proteins using graph convolutional network.利用图卷积网络预测玉米蛋白的功能。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6.
8
Improving GO semantic similarity measures by exploring the ontology beneath the terms and modelling uncertainty.通过探索术语下的本体和建模不确定性来改进 GO 语义相似性度量。
Bioinformatics. 2012 May 15;28(10):1383-9. doi: 10.1093/bioinformatics/bts129. Epub 2012 Apr 19.
9
GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.GO 功能相似性聚类取决于相似性度量、聚类方法和注释完整性。
BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2.
10
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.

引用本文的文献

1
Single-Cell Resolution of Individual Variation in Hypothalamic Neurons Allows Targeted Manipulation Affecting Social Motivation.下丘脑神经元个体差异的单细胞分辨率能够实现影响社会动机的靶向操纵。
bioRxiv. 2025 Mar 11:2025.03.10.642464. doi: 10.1101/2025.03.10.642464.
2
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
3
Optimizing gene set annotations combining GO structure and gene expression data.

本文引用的文献

1
Genome-Wide Protein Function Prediction through Multi-Instance Multi-Label Learning.通过多示例多标签学习进行全基因组蛋白质功能预测
IEEE/ACM Trans Comput Biol Bioinform. 2014 Sep-Oct;11(5):891-902. doi: 10.1109/TCBB.2014.2323058.
2
Protein Function Prediction with Incomplete Annotations.利用不完整注释进行蛋白质功能预测。
IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):579-91. doi: 10.1109/TCBB.2013.142.
3
Hierarchical ensemble methods for protein function prediction.用于蛋白质功能预测的分层集成方法。
结合基因本体结构和基因表达数据优化基因集注释
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):133. doi: 10.1186/s12918-018-0659-6.
4
Gene ontology improves template selection in comparative protein docking.基因本体论改进了比较蛋白质对接中的模板选择。
Proteins. 2019 Mar;87(3):245-253. doi: 10.1002/prot.25645. Epub 2018 Dec 27.
5
Protein Function Prediction Using Deep Restricted Boltzmann Machines.使用深度受限玻尔兹曼机进行蛋白质功能预测。
Biomed Res Int. 2017;2017:1729301. doi: 10.1155/2017/1729301. Epub 2017 Jun 28.
6
NoGOA: predicting noisy GO annotations using evidences and sparse representation.NoGOA:利用证据和稀疏表示预测有噪声的基因本体注释
BMC Bioinformatics. 2017 Jul 21;18(1):350. doi: 10.1186/s12859-017-1764-z.
7
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
8
SGFSC: speeding the gene functional similarity calculation based on hash tables.SGFSC:基于哈希表加速基因功能相似性计算
BMC Bioinformatics. 2016 Nov 4;17(1):445. doi: 10.1186/s12859-016-1294-0.
ISRN Bioinform. 2014 May 4;2014:901419. doi: 10.1155/2014/901419. eCollection 2014.
4
Predicting protein functions using incomplete hierarchical labels.使用不完整的层次标签预测蛋白质功能。
BMC Bioinformatics. 2015 Jan 16;16:1. doi: 10.1186/s12859-014-0430-y.
5
The impact of incomplete knowledge on the evaluation of protein function prediction: a structured-output learning perspective.知识不完整对蛋白质功能预测评估的影响:结构化输出学习视角
Bioinformatics. 2014 Sep 1;30(17):i609-16. doi: 10.1093/bioinformatics/btu472.
6
The automated function prediction SIG looks back at 2013 and prepares for 2014.自动化功能预测 SIG 回顾 2013 年并为 2014 年做准备。
Bioinformatics. 2014 Jul 15;30(14):2091-2. doi: 10.1093/bioinformatics/btu117. Epub 2014 Mar 3.
7
CAFA and the open world of protein function predictions.计算机辅助功能注释(CAFA)与蛋白质功能预测的开放世界
Trends Genet. 2013 Nov;29(11):609-10. doi: 10.1016/j.tig.2013.09.005. Epub 2013 Oct 15.
8
Exploiting disjointness axioms to improve semantic similarity measures.利用不相交公理提高语义相似性度量。
Bioinformatics. 2013 Nov 1;29(21):2781-7. doi: 10.1093/bioinformatics/btt491. Epub 2013 Sep 3.
9
Information-theoretic evaluation of predicted ontological annotations.基于信息论的预测本体论注释评估。
Bioinformatics. 2013 Jul 1;29(13):i53-61. doi: 10.1093/bioinformatics/btt228.
10
Biases in the experimental annotations of protein function and their effect on our understanding of protein function space.蛋白质功能的实验注释中的偏差及其对我们理解蛋白质功能空间的影响。
PLoS Comput Biol. 2013;9(5):e1003063. doi: 10.1371/journal.pcbi.1003063. Epub 2013 May 30.