Suppr超能文献

通过蛋白质的系统发育分布预测其功能的实践与理论进展。

Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution.

作者信息

Kensche Philip R, van Noort Vera, Dutilh Bas E, Huynen Martijn A

机构信息

Centre for Molecular and Biomolecular Informatics/Nijmegen, Centre for Molecular Life Sciences, Radboud University Medical Centre, PO Box 9101, 6500 HB Nijmegen, The Netherlands.

出版信息

J R Soc Interface. 2008 Feb 6;5(19):151-70. doi: 10.1098/rsif.2007.1047.

Abstract

The gap between the amount of genome information released by genome sequencing projects and our knowledge about the proteins' functions is rapidly increasing. To fill this gap, various 'genomic-context' methods have been proposed that exploit sequenced genomes to predict the functions of the encoded proteins. One class of methods, phylogenetic profiling, predicts protein function by correlating the phylogenetic distribution of genes with that of other genes or phenotypic characteristics. The functions of a number of proteins, including ones of medical relevance, have thus been predicted and subsequently confirmed experimentally. Additionally, various approaches to measure the similarity of phylogenetic profiles and to account for the phylogenetic bias in the data have been proposed. We review the successful applications of phylogenetic profiling and analyse the performance of various profile similarity measures with a set of one microsporidial and 25 fungal genomes. In the fungi, phylogenetic profiling yields high-confidence predictions for the highest and only the highest scoring gene pairs illustrating both the power and the limitations of the approach. Both practical examples and theoretical considerations suggest that in order to get a reliable and specific picture of a protein's function, results from phylogenetic profiling have to be combined with other sources of evidence.

摘要

基因组测序项目所公布的基因组信息量与我们对蛋白质功能的了解之间的差距正在迅速扩大。为了填补这一差距,人们提出了各种“基因组上下文”方法,利用已测序的基因组来预测编码蛋白质的功能。其中一类方法,即系统发育谱分析,通过将基因的系统发育分布与其他基因或表型特征的分布相关联来预测蛋白质功能。包括一些具有医学相关性的蛋白质的功能,因此已被预测并随后通过实验得到证实。此外,还提出了各种测量系统发育谱相似性以及考虑数据中系统发育偏差的方法。我们回顾了系统发育谱分析的成功应用,并使用一组微孢子虫基因组和25个真菌基因组分析了各种谱相似性测量方法的性能。在真菌中,系统发育谱分析对得分最高且唯一得分最高的基因对产生了高可信度的预测,这既说明了该方法的强大之处,也说明了其局限性。实际例子和理论思考都表明,为了可靠而具体地了解蛋白质的功能,系统发育谱分析的结果必须与其他证据来源相结合。

相似文献

3
5
Towards validating the hypothesis of phylogenetic profiling.迈向验证系统发育谱分析假说。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S25. doi: 10.1186/1471-2105-8-S7-S25.

引用本文的文献

1
ProTaxoVis-protein taxonomic visualisation of presence.ProTaxoVis——蛋白质分类存在情况的可视化
BMC Bioinformatics. 2025 May 19;26(1):128. doi: 10.1186/s12859-025-06146-9.
2
Assembling bacterial puzzles: piecing together functions into microbial pathways.组装细菌谜题:将功能拼凑成微生物途径。
NAR Genom Bioinform. 2024 Aug 24;6(3):lqae109. doi: 10.1093/nargab/lqae109. eCollection 2024 Sep.
9
Novel metric for hyperbolic phylogenetic tree embeddings.双曲系统发生树嵌入的新度量。
Biol Methods Protoc. 2021 Mar 27;6(1):bpab006. doi: 10.1093/biomethods/bpab006. eCollection 2021.

本文引用的文献

2
LIKELIHOOD OF ANCESTOR STATES IN ADAPTIVE RADIATION.适应性辐射中祖先状态的可能性
Evolution. 1997 Dec;51(6):1699-1711. doi: 10.1111/j.1558-5646.1997.tb05095.x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验