Suppr超能文献

整合Magnum数据库中的蛋白质结构和预先计算的谱系:以细胞视黄醇结合蛋白为例。

Integrating protein structures and precomputed genealogies in the Magnum database: examples with cellular retinoid binding proteins.

作者信息

Bradley Michael E, Benner Steven A

机构信息

Department of Chemistry, University of Florida, PO Box 117200, Gainesville, FL 32611, USA.

出版信息

BMC Bioinformatics. 2006 Feb 23;7:89. doi: 10.1186/1471-2105-7-89.

Abstract

BACKGROUND

When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use.

RESULTS

The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures.

CONCLUSION

We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins.

摘要

背景

当将蛋白质序列趋异进化的精确模型与互补的生物学信息(如折叠的蛋白质结构)相结合时,对组合数据的分析往往会产生关于分子生理学的新假设。这是生物信息学如何用于指导实验研究的一个绝佳例子。然而,由于缺乏适用于一般用途的公开可用资源,这一方向的进展一直较为缓慢。

结果

预先计算的Magnum数据库为大约1800个具有至少一个晶体结构的全长蛋白质家族解决了这一问题。Magnum提供的成果包括:1)多序列比对;2)比对位点到晶体结构位点的映射;3)系统发育树;4)内部树节点处的推断祖先序列;5)沿树枝的氨基酸替换。综合评估表明,用于构建Magnum的自动化程序产生了蛋白质趋异进化(即谱系)的精确模型,并将这些模型与结构数据正确整合。为了展示Magnum的能力,我们查找了位于蛋白质内部结构位点、需要三个核苷酸替换且发生在短系统发育树分支上的氨基酸替换。在细胞视黄醇结合蛋白家族中,发现了一个可能调节配体结合亲和力的位点。在日行壁虎中,细胞视黄醇结合蛋白被招募来充当晶状体晶状体蛋白,这为展示一个包含与蛋白质结构整合的分支替换模式的可浏览数据库的预测价值提供了另一个机会。

结论

我们大规模整合了蛋白质科学的两个领域——进化和结构,并创建了一个预先计算的数据库Magnum,这是同类中的首个免费可用资源。Magnum提供了进化和结构生物信息学资源,有助于识别关于蛋白质行为和功能分子基础的可实验验证的假设,如细胞视黄醇结合蛋白的例子所示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/7832f0848254/1471-2105-7-89-1.jpg

相似文献

2
Pandit: a database of protein and associated nucleotide domains with inferred trees.
Bioinformatics. 2003 Aug 12;19(12):1556-63. doi: 10.1093/bioinformatics/btg188.
3
Imprint of evolutionary conservation and protein structure variation on the binding function of protein tyrosine kinases.
Bioinformatics. 2006 Aug 1;22(15):1846-54. doi: 10.1093/bioinformatics/btl199. Epub 2006 May 23.
4
Domain-based small molecule binding site annotation.
BMC Bioinformatics. 2006 Mar 17;7:152. doi: 10.1186/1471-2105-7-152.
5
Multiple Alignment of protein structures and sequences for VMD.
Bioinformatics. 2006 Feb 15;22(4):504-6. doi: 10.1093/bioinformatics/bti825. Epub 2005 Dec 8.
6
7
Adding some SPICE to DAS.
Bioinformatics. 2005 Sep 1;21 Suppl 2(Suppl 2):ii40-1. doi: 10.1093/bioinformatics/bti1106.
8
Gecko iota-crystallin: how cellular retinol-binding protein became an eye lens ultraviolet filter.
Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3282-7. doi: 10.1073/pnas.97.7.3282.
9
Using Dali for structural comparison of proteins.
Curr Protoc Bioinformatics. 2006 Jul;Chapter 5:Unit 5.5. doi: 10.1002/0471250953.bi0505s14.
10

引用本文的文献

1
Sulfate activation enzymes: phylogeny and association with pyrophosphatase.
J Mol Evol. 2009 Jan;68(1):1-13. doi: 10.1007/s00239-008-9181-6. Epub 2008 Dec 6.

本文引用的文献

1
Resurrecting ancestral alcohol dehydrogenases from yeast.
Nat Genet. 2005 Jun;37(6):630-5. doi: 10.1038/ng1553. Epub 2005 May 1.
4
Seq2Struct: a resource for establishing sequence-structure links.
Bioinformatics. 2005 Feb 15;21(4):551-3. doi: 10.1093/bioinformatics/bti049. Epub 2004 Sep 28.
5
Reconstruction of ancestral protein sequences and its applications.
BMC Evol Biol. 2004 Sep 17;4:33. doi: 10.1186/1471-2148-4-33.
6
Evolution of coral pigments recreated.
Science. 2004 Sep 3;305(5689):1433. doi: 10.1126/science.1099597.
7
The planetary biology of cytochrome P450 aromatases.
BMC Biol. 2004 Aug 17;2:19. doi: 10.1186/1741-7007-2-19.
8
The iProClass integrated database for protein functional analysis.
Comput Biol Chem. 2004 Feb;28(1):87-96. doi: 10.1016/j.compbiolchem.2003.10.003.
9
Phylogenomic inference of protein molecular function: advances and challenges.
Bioinformatics. 2004 Jan 22;20(2):170-9. doi: 10.1093/bioinformatics/bth021.
10
The Pfam protein families database.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D138-41. doi: 10.1093/nar/gkh121.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验