Suppr
超能文献

基于系统发生的基因本体论联盟功能注释传播。

Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.

机构信息

Swiss Institute for Bioinformatics, CMU, 1 Rue Michel Servet, 1211 Geneva 4, Switzerland.

出版信息

Brief Bioinform. 2011 Sep;12(5):449-62. doi: 10.1093/bib/bbr042. Epub 2011 Aug 27.

DOI:10.1093/bib/bbr042

PMID:21873635

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3178059/

Abstract

The goal of the Gene Ontology (GO) project is to provide a uniform way to describe the functions of gene products from organisms across all kingdoms of life and thereby enable analysis of genomic data. Protein annotations are either based on experiments or predicted from protein sequences. Since most sequences have not been experimentally characterized, most available annotations need to be based on predictions. To make as accurate inferences as possible, the GO Consortium's Reference Genome Project is using an explicit evolutionary framework to infer annotations of proteins from a broad set of genomes from experimental annotations in a semi-automated manner. Most components in the pipeline, such as selection of sequences, building multiple sequence alignments and phylogenetic trees, retrieving experimental annotations and depositing inferred annotations, are fully automated. However, the most crucial step in our pipeline relies on software-assisted curation by an expert biologist. This curation tool, Phylogenetic Annotation and INference Tool (PAINT) helps curators to infer annotations among members of a protein family. PAINT allows curators to make precise assertions as to when functions were gained and lost during evolution and record the evidence (e.g. experimentally supported GO annotations and phylogenetic information including orthology) for those assertions. In this article, we describe how we use PAINT to infer protein function in a phylogenetic context with emphasis on its strengths, limitations and guidelines. We also discuss specific examples showing how PAINT annotations compare with those generated by other highly used homology-based methods.

摘要

GO 项目的目标是提供一种统一的方式来描述来自所有生命领域的生物体的基因产物的功能，从而能够分析基因组数据。蛋白质注释要么基于实验，要么从蛋白质序列预测。由于大多数序列尚未经过实验表征，因此大多数可用的注释需要基于预测。为了尽可能做出准确的推断，GO 联盟的参考基因组项目正在使用明确的进化框架，以半自动化的方式从一组广泛的基因组中推断蛋白质的注释，这些基因组具有实验注释。该管道中的大多数组件，例如序列选择、构建多序列比对和系统发育树、检索实验注释和存储推断注释，都是完全自动化的。然而，我们管道中最关键的步骤依赖于专家生物学家的软件辅助策展。这个策展工具，系统发育注释和推断工具（PAINT），帮助策展人在蛋白质家族成员之间推断注释。PAINT 允许策展人准确地断言在进化过程中何时获得和失去功能，并记录证据（例如，实验支持的 GO 注释和包括同源性的系统发育信息）。在本文中，我们描述了如何在系统发育背景下使用 PAINT 推断蛋白质功能，重点介绍了其优势、限制和指南。我们还讨论了具体示例，展示了 PAINT 注释如何与其他高度使用的基于同源性的方法生成的注释进行比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7ff/3178059/8054bca39e19/bbr042f1.jpg

相似文献

Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.

Brief Bioinform. 2011 Sep;12(5):449-62. doi: 10.1093/bib/bbr042. Epub 2011 Aug 27.

Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw155. Print 2016.

Interpreting Gene Ontology Annotations Derived from Sequence Homology Methods.

Methods Mol Biol. 2024;2836:285-298. doi: 10.1007/978-1-0716-4007-4_15.

CvManGO, a method for leveraging computational predictions to improve literature-based Gene Ontology annotations.

Database (Oxford). 2012 Mar 20;2012:bas001. doi: 10.1093/database/bas001. Print 2012.

zDB: bacterial comparative genomics made easy.

mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28.

A guide to best practices for Gene Ontology (GO) manual annotation.

Database (Oxford). 2013 Jul 9;2013:bat054. doi: 10.1093/database/bat054. Print 2013.

Quality of computationally inferred gene ontology annotations.

PLoS Comput Biol. 2012 May;8(5):e1002533. doi: 10.1371/journal.pcbi.1002533. Epub 2012 May 31.

Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study.

Database (Oxford). 2011 Mar 15;2011:bar004. doi: 10.1093/database/bar004. Print 2011.

PANTHER version 10: expanded protein families and functions, and analysis tools.

Nucleic Acids Res. 2016 Jan 4;44(D1):D336-42. doi: 10.1093/nar/gkv1194. Epub 2015 Nov 17.

引用本文的文献

Photocatalytic labelling-enabled subcellular-resolved RNA profiling and synchronous multi-omics investigation.

Nat Chem. 2025 Sep 16. doi: 10.1038/s41557-025-01946-1.

DIAMOND2GO: rapid Gene Ontology assignment and enrichment detection for functional genomics.

Front Bioinform. 2025 Aug 15;5:1634042. doi: 10.3389/fbinf.2025.1634042. eCollection 2025.

The potassium channel K2.1 shapes the morphology and function of brain endothelial cells via actin network remodeling.

Nat Commun. 2025 Jul 18;16(1):6622. doi: 10.1038/s41467-025-61816-9.

Plasma Proteomic Profiling of a Group of Anxious Dogs by LC-MS/MS: A Case-Control Study.

Proteomics Clin Appl. 2025 Jul 4:e70014. doi: 10.1002/prca.70014.

KChIP3 fosters neuroinflammation and synaptic dysfunction in the 5XFAD mouse model of Alzheimer's disease.

J Neuroinflammation. 2025 Jun 19;22(1):160. doi: 10.1186/s12974-025-03426-2.

Excess Wnt in neurological disease.

Biochem J. 2025 May 16;482(10):601-18. doi: 10.1042/BCJ20240265.

Exploring the Interconnections Between Mitochondrial Dysfunction and Polycystic Ovary Syndrome: A Comprehensive Integrated Analysis.

Biochem Genet. 2025 Apr 21. doi: 10.1007/s10528-025-11104-4.

Genome-wide DNA methylation analysis of CBCVd-infected hop plants ( var. "Celeia") provides novel insights into viroid pathogenesis.

Microbiol Spectr. 2025 Jun 3;13(6):e0039424. doi: 10.1128/spectrum.00394-24. Epub 2025 Apr 16.

PTOV1 interacts with ZNF449 to promote colorectal cancer development.

Commun Biol. 2025 Mar 25;8(1):489. doi: 10.1038/s42003-025-07930-2.

Ltc1 localization by EMC regulates cell membrane fluidity to facilitate membrane protein biogenesis.

iScience. 2025 Feb 24;28(3):112096. doi: 10.1016/j.isci.2025.112096. eCollection 2025 Mar 21.

本文引用的文献

The what, where, how and why of gene ontology--a primer for bioinformaticians.

Brief Bioinform. 2011 Nov;12(6):723-35. doi: 10.1093/bib/bbr002. Epub 2011 Feb 17.

InterPro protein classification.

Methods Mol Biol. 2011;694:37-47. doi: 10.1007/978-1-60761-977-2_3.

Formalization of taxon-based constraints to detect inconsistencies in annotation and ontology development.

BMC Bioinformatics. 2010 Oct 25;11:530. doi: 10.1186/1471-2105-11-530.

GIGA: a simple, efficient algorithm for gene tree inference in the genomic age.

BMC Bioinformatics. 2010 Jun 9;11:312. doi: 10.1186/1471-2105-11-312.

PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium.

Nucleic Acids Res. 2010 Jan;38(Database issue):D204-10. doi: 10.1093/nar/gkp1019. Epub 2009 Dec 16.

The Gene Ontology in 2010: extensions and refinements.

Nucleic Acids Res. 2010 Jan;38(Database issue):D331-5. doi: 10.1093/nar/gkp1018. Epub 2009 Nov 17.

The Gene Ontology's Reference Genome Project: a unified framework for functional annotation across species.

PLoS Comput Biol. 2009 Jul;5(7):e1000431. doi: 10.1371/journal.pcbi.1000431. Epub 2009 Jul 3.

How confident can we be that orthologs are similar, but paralogs differ?

Trends Genet. 2009 May;25(5):210-6. doi: 10.1016/j.tig.2009.03.004. Epub 2009 Apr 14.

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.

Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.

The Princeton Protein Orthology Database (P-POD): a comparative genomics analysis tool for biologists.

PLoS One. 2007 Aug 22;2(8):e766. doi: 10.1371/journal.pone.0000766.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于系统发生的基因本体论联盟功能注释传播。

Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译