• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NegGOA:基于本体结构的负 GO 注释选择。

NegGOA: negative GO annotations selection using ontology structure.

机构信息

College of Computer and Information Science, Southwest University, Chongqing 400715, China.

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.

出版信息

Bioinformatics. 2016 Oct 1;32(19):2996-3004. doi: 10.1093/bioinformatics/btw366. Epub 2016 Jun 17.

DOI:10.1093/bioinformatics/btw366
PMID:27318205
Abstract

MOTIVATION

Predicting the biological functions of proteins is one of the key challenges in the post-genomic era. Computational models have demonstrated the utility of applying machine learning methods to predict protein function. Most prediction methods explicitly require a set of negative examples-proteins that are known not carrying out a particular function. However, Gene Ontology (GO) almost always only provides the knowledge that proteins carry out a particular function, and functional annotations of proteins are incomplete. GO structurally organizes more than tens of thousands GO terms and a protein is annotated with several (or dozens) of these terms. For these reasons, the negative examples of a protein can greatly help distinguishing true positive examples of the protein from such a large candidate GO space.

RESULTS

In this paper, we present a novel approach (called NegGOA) to select negative examples. Specifically, NegGOA takes advantage of the ontology structure, available annotations and potentiality of additional annotations of a protein to choose negative examples of the protein. We compare NegGOA with other negative examples selection algorithms and find that NegGOA produces much fewer false negatives than them. We incorporate the selected negative examples into an efficient function prediction model to predict the functions of proteins in Yeast, Human, Mouse and Fly. NegGOA also demonstrates improved accuracy than these comparing algorithms across various evaluation metrics. In addition, NegGOA is less suffered from incomplete annotations of proteins than these comparing methods.

AVAILABILITY AND IMPLEMENTATION

The Matlab and R codes are available at https://sites.google.com/site/guoxian85/neggoa

CONTACT

gxyu@swu.edu.cn

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

预测蛋白质的生物功能是后基因组时代的主要挑战之一。计算模型已经证明了应用机器学习方法来预测蛋白质功能的有效性。大多数预测方法明确需要一组负例——已知不执行特定功能的蛋白质。然而,GO(Gene Ontology)几乎总是只提供蛋白质执行特定功能的知识,并且蛋白质的功能注释是不完整的。GO 结构上组织了超过数万 GO 术语,并且一个蛋白质被注释了几个(或几十个)这些术语。由于这些原因,蛋白质的负例可以极大地帮助从如此大的候选 GO 空间中区分蛋白质的真正阳性例子。

结果

在本文中,我们提出了一种选择负例的新方法(称为 NegGOA)。具体来说,NegGOA 利用本体结构、蛋白质的可用注释和潜在的额外注释来选择蛋白质的负例。我们将 NegGOA 与其他负例选择算法进行比较,发现 NegGOA 产生的假阴性比它们少得多。我们将选择的负例纳入到一个有效的功能预测模型中,以预测酵母、人类、小鼠和果蝇中的蛋白质功能。NegGOA 在各种评估指标上的准确性也优于这些比较算法。此外,NegGOA 比这些比较方法受蛋白质注释不完整的影响更小。

可用性和实现

Matlab 和 R 代码可在 https://sites.google.com/site/guoxian85/neggoa 获得。

联系信息

gxyu@swu.edu.cn

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
NegGOA: negative GO annotations selection using ontology structure.NegGOA:基于本体结构的负 GO 注释选择。
Bioinformatics. 2016 Oct 1;32(19):2996-3004. doi: 10.1093/bioinformatics/btw366. Epub 2016 Jun 17.
2
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.信息论应用于稀疏基因本体注释网络以预测新的基因功能。
Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195.
3
Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations.Onto2Vec:基于向量的生物实体联合表示及其基于本体论的标注。
Bioinformatics. 2018 Jul 1;34(13):i52-i60. doi: 10.1093/bioinformatics/bty259.
4
Evaluating the impact of topological protein features on the negative examples selection.评估拓扑蛋白特征对负例选择的影响。
BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):417. doi: 10.1186/s12859-018-2385-x.
5
NewGOA: Predicting New GO Annotations of Proteins by Bi-Random Walks on a Hybrid Graph.NewGOA:基于混合图双随机游走的蛋白质新 GO 注释预测。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1390-1402. doi: 10.1109/TCBB.2017.2715842. Epub 2017 Jun 15.
6
Transfer learning across ontologies for phenome-genome association prediction.跨本体的迁移学习用于表型-基因组关联预测。
Bioinformatics. 2017 Feb 15;33(4):529-536. doi: 10.1093/bioinformatics/btw649.
7
NoGOA: predicting noisy GO annotations using evidences and sparse representation.NoGOA:利用证据和稀疏表示预测有噪声的基因本体注释
BMC Bioinformatics. 2017 Jul 21;18(1):350. doi: 10.1186/s12859-017-1764-z.
8
Exploiting ontology graph for predicting sparsely annotated gene function.利用本体图预测注释稀疏的基因功能。
Bioinformatics. 2015 Jun 15;31(12):i357-64. doi: 10.1093/bioinformatics/btv260.
9
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks.通过结合基因本体注释和基因共功能网络来测量语义相似性。
BMC Bioinformatics. 2015 Feb 14;16:44. doi: 10.1186/s12859-015-0474-7.
10
Co-complex protein membership evaluation using Maximum Entropy on GO ontology and InterPro annotation.使用最大熵方法对 GO 本体论和 InterPro 注释进行共复合物蛋白成员评估。
Bioinformatics. 2018 Jun 1;34(11):1884-1892. doi: 10.1093/bioinformatics/btx803.

引用本文的文献

1
A multi-source molecular network representation model for protein-protein interactions prediction.一种用于蛋白质相互作用预测的多源分子网络表示模型。
Sci Rep. 2024 Mar 14;14(1):6184. doi: 10.1038/s41598-024-56286-w.
2
Computational Methods for Prediction of Human Protein-Phenotype Associations: A Review.人类蛋白质-表型关联预测的计算方法:综述
Phenomics. 2021 Aug 6;1(4):171-185. doi: 10.1007/s43657-021-00019-w. eCollection 2021 Aug.
3
Predicting functions of maize proteins using graph convolutional network.利用图卷积网络预测玉米蛋白的功能。
BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6.
4
Automatic Gene Function Prediction in the 2020's.21 世纪的自动基因功能预测。
Genes (Basel). 2020 Oct 27;11(11):1264. doi: 10.3390/genes11111264.
5
Benchmarking gene ontology function predictions using negative annotations.利用负注释进行基因本体论功能预测的基准测试。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i210-i218. doi: 10.1093/bioinformatics/btaa466.
6
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
7
Novel comparison of evaluation metrics for gene ontology classifiers reveals drastic performance differences.新型基因本体论分类器评估指标比较揭示出显著的性能差异。
PLoS Comput Biol. 2019 Nov 4;15(11):e1007419. doi: 10.1371/journal.pcbi.1007419. eCollection 2019 Nov.
8
Prediction of Human Phenotype Ontology terms by means of hierarchical ensemble methods.通过分层集成方法预测人类表型本体术语
BMC Bioinformatics. 2017 Oct 12;18(1):449. doi: 10.1186/s12859-017-1854-y.
9
NoGOA: predicting noisy GO annotations using evidences and sparse representation.NoGOA:利用证据和稀疏表示预测有噪声的基因本体注释
BMC Bioinformatics. 2017 Jul 21;18(1):350. doi: 10.1186/s12859-017-1764-z.
10
SGFSC: speeding the gene functional similarity calculation based on hash tables.SGFSC:基于哈希表加速基因功能相似性计算
BMC Bioinformatics. 2016 Nov 4;17(1):445. doi: 10.1186/s12859-016-1294-0.