计算推断的基因本体论注释的质量。

Quality of computationally inferred gene ontology annotations.

机构信息

Ruđer Bošković Institute, Division of Electronics, Zagreb, Croatia.

出版信息

PLoS Comput Biol. 2012 May;8(5):e1002533. doi: 10.1371/journal.pcbi.1002533. Epub 2012 May 31.

DOI:10.1371/journal.pcbi.1002533

PMID:22693439

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3364937/

Abstract

Gene Ontology (GO) has established itself as the undisputed standard for protein function annotation. Most annotations are inferred electronically, i.e. without individual curator supervision, but they are widely considered unreliable. At the same time, we crucially depend on those automated annotations, as most newly sequenced genomes are non-model organisms. Here, we introduce a methodology to systematically and quantitatively evaluate electronic annotations. By exploiting changes in successive releases of the UniProt Gene Ontology Annotation database, we assessed the quality of electronic annotations in terms of specificity, reliability, and coverage. Overall, we not only found that electronic annotations have significantly improved in recent years, but also that their reliability now rivals that of annotations inferred by curators when they use evidence other than experiments from primary literature. This work provides the means to identify the subset of electronic annotations that can be relied upon-an important outcome given that >98% of all annotations are inferred without direct curation.

摘要

基因本体论 (GO) 已成为蛋白质功能注释的无可争议的标准。大多数注释都是通过电子方式推断出来的，即没有单独的注释员监督，但它们被广泛认为是不可靠的。与此同时，我们又严重依赖这些自动注释，因为大多数新测序的基因组是非模式生物。在这里，我们引入了一种系统地和定量地评估电子注释的方法。通过利用 UniProt 基因本体论注释数据库的连续版本的变化，我们从特异性、可靠性和覆盖范围等方面评估了电子注释的质量。总的来说，我们不仅发现电子注释近年来有了显著的改进，而且它们的可靠性现在与注释员使用来自主要文献的实验以外的证据进行推断时的可靠性相当。这项工作为识别可以信赖的电子注释子集提供了手段，鉴于超过 98%的注释都是在没有直接注释的情况下推断出来的，这是一个重要的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0e9/3364937/201793fbfaac/pcbi.1002533.g001.jpg

相似文献

Quality of computationally inferred gene ontology annotations.计算推断的基因本体论注释的质量。

PLoS Comput Biol. 2012 May;8(5):e1002533. doi: 10.1371/journal.pcbi.1002533. Epub 2012 May 31.

CvManGO, a method for leveraging computational predictions to improve literature-based Gene Ontology annotations.CvManGO，一种利用计算预测来改进基于文献的基因本体论注释的方法。

Database (Oxford). 2012 Mar 20;2012:bas001. doi: 10.1093/database/bas001. Print 2012.

Mining GO annotations for improving annotation consistency.挖掘 GO 注释以提高注释一致性。

PLoS One. 2012;7(7):e40519. doi: 10.1371/journal.pone.0040519. Epub 2012 Jul 25.

Assessment of community-submitted ontology annotations from a novel database-journal partnership.评估来自新型数据库-期刊合作的社区提交的本体注释。

Database (Oxford). 2012 Aug 1;2012:bas030. doi: 10.1093/database/bas030. Print 2012.

Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study.利用计算预测改进基于文献的基因本体论注释：一项可行性研究。

Database (Oxford). 2011 Mar 15;2011:bar004. doi: 10.1093/database/bar004. Print 2011.

GOChase-II: correcting semantic inconsistencies from Gene Ontology-based annotations for gene products.GOChase-II：纠正基于基因本体论注释的基因产物中的语义不一致性。

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S40. doi: 10.1186/1471-2105-12-S1-S40.

Cross-organism learning method to discover new gene functionalities.跨生物学习方法发现新基因功能。

Comput Methods Programs Biomed. 2016 Apr;126:20-34. doi: 10.1016/j.cmpb.2015.12.002. Epub 2015 Dec 17.

The UniProt-GO Annotation database in 2011.2011 年的 UniProt-GO Annotation 数据库。

Nucleic Acids Res. 2012 Jan;40(Database issue):D565-70. doi: 10.1093/nar/gkr1048. Epub 2011 Nov 28.

Evaluating Computational Gene Ontology Annotations.评估计算基因本体注释

Methods Mol Biol. 2017;1446:97-109. doi: 10.1007/978-1-4939-3743-1_8.

Gene Ontology annotations: what they mean and where they come from.基因本体论注释：它们的含义及来源

BMC Bioinformatics. 2008 Apr 29;9 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2105-9-S5-S2.

引用本文的文献

A compendium of human gene functions derived from evolutionary modelling.基于进化建模得出的人类基因功能概要。

Nature. 2025 Apr;640(8057):146-154. doi: 10.1038/s41586-025-08592-0. Epub 2025 Feb 26.

Integration of background knowledge for automatic detection of inconsistencies in gene ontology annotation.背景知识的整合用于自动检测基因本体论注释中的不一致性。

Bioinformatics. 2024 Jun 28;40(Suppl 1):i390-i400. doi: 10.1093/bioinformatics/btae246.

Exploring automatic inconsistency detection for literature-based gene ontology annotation.探索基于文献的基因本体论自动标注不一致性检测。

Bioinformatics. 2022 Jun 24;38(Suppl 1):i273-i281. doi: 10.1093/bioinformatics/btac230.

Calculating genetic risk for dysfunction in pleiotropic biological processes using whole exome sequencing data.利用全外显子组测序数据计算多效性生物过程功能障碍的遗传风险。

J Neurodev Disord. 2022 Jun 24;14(1):39. doi: 10.1186/s11689-022-09448-8.

Automatic consistency assurance for literature-based gene ontology annotation.基于文献的基因本体论自动一致性保证。

BMC Bioinformatics. 2021 Nov 25;22(1):565. doi: 10.1186/s12859-021-04479-9.

Crowdsourcing biocuration: The Community Assessment of Community Annotation with Ontologies (CACAO).众包生物注释：使用本体的社区注释评估 (CACAO)。

PLoS Comput Biol. 2021 Oct 28;17(10):e1009463. doi: 10.1371/journal.pcbi.1009463. eCollection 2021 Oct.

Single-cell co-expression analysis reveals that transcriptional modules are shared across cell types in the brain.单细胞共表达分析表明，大脑中的转录模块在细胞类型之间是共享的。

Cell Syst. 2021 Jul 21;12(7):748-756.e3. doi: 10.1016/j.cels.2021.04.010. Epub 2021 May 19.

PhotoModPlus: A web server for photosynthetic protein prediction from genome neighborhood features.PhotoModPlus：一个基于基因组邻近特征预测光合蛋白的网络服务器。

PLoS One. 2021 Mar 17;16(3):e0248682. doi: 10.1371/journal.pone.0248682. eCollection 2021.

Automatic Gene Function Prediction in the 2020's.21 世纪的自动基因功能预测。

Genes (Basel). 2020 Oct 27;11(11):1264. doi: 10.3390/genes11111264.

Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns.术语矩阵：一种基于本体论术语共同注释模式的新型基因本体论注释质量控制系统。

Open Biol. 2020 Sep;10(9):200149. doi: 10.1098/rsob.200149. Epub 2020 Sep 2.

本文引用的文献

Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.基于系统发生的基因本体论联盟功能注释传播。

Brief Bioinform. 2011 Sep;12(5):449-62. doi: 10.1093/bib/bbr042. Epub 2011 Aug 27.

How the gene ontology evolves.基因本体论的演变。

BMC Bioinformatics. 2011 Aug 5;12:325. doi: 10.1186/1471-2105-12-325.

REVIGO summarizes and visualizes long lists of gene ontology terms.REVIGO 对基因本体论术语的长列表进行总结和可视化。

PLoS One. 2011;6(7):e21800. doi: 10.1371/journal.pone.0021800. Epub 2011 Jul 18.

A new approach to assess and predict the functional roles of proteins across all known structures.一种评估和预测所有已知结构中蛋白质功能作用的新方法。

J Struct Funct Genomics. 2011 Mar;12(1):9-20. doi: 10.1007/s10969-011-9105-3. Epub 2011 Mar 29.

The what, where, how and why of gene ontology--a primer for bioinformaticians.基因本体论的是什么、在哪里、如何以及为什么——生物信息学家入门。

Brief Bioinform. 2011 Nov;12(6):723-35. doi: 10.1093/bib/bbr002. Epub 2011 Feb 17.

IntelliGO: a new vector-based semantic similarity measure including annotation origin.IntelliGO：一种新的基于向量的语义相似性度量方法，包含注释来源。

BMC Bioinformatics. 2010 Dec 1;11:588. doi: 10.1186/1471-2105-11-588.

Identifying informative subsets of the Gene Ontology with information bottleneck methods.利用信息瓶颈方法识别基因本体论的信息子集。

Bioinformatics. 2010 Oct 1;26(19):2445-51. doi: 10.1093/bioinformatics/btq449. Epub 2010 Aug 11.

More than 1,001 problems with protein domain databases: transmembrane regions, signal peptides and the issue of sequence homology.蛋白质结构域数据库的 1001 个问题：跨膜区、信号肽和序列同源性问题。

PLoS Comput Biol. 2010 Jul 29;6(7):e1000867. doi: 10.1371/journal.pcbi.1000867.

The Gene Ontology in 2010: extensions and refinements.2010 年的基因本体论：扩展和改进。

Nucleic Acids Res. 2010 Jan;38(Database issue):D331-5. doi: 10.1093/nar/gkp1018. Epub 2009 Nov 17.

The Universal Protein Resource (UniProt) in 2010.2010 年的通用蛋白质资源（UniProt）。

Nucleic Acids Res. 2010 Jan;38(Database issue):D142-8. doi: 10.1093/nar/gkp846. Epub 2009 Oct 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

计算推断的基因本体论注释的质量。

Quality of computationally inferred gene ontology annotations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献