• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合Magnum数据库中的蛋白质结构和预先计算的谱系:以细胞视黄醇结合蛋白为例。

Integrating protein structures and precomputed genealogies in the Magnum database: examples with cellular retinoid binding proteins.

作者信息

Bradley Michael E, Benner Steven A

机构信息

Department of Chemistry, University of Florida, PO Box 117200, Gainesville, FL 32611, USA.

出版信息

BMC Bioinformatics. 2006 Feb 23;7:89. doi: 10.1186/1471-2105-7-89.

DOI:10.1186/1471-2105-7-89
PMID:16504077
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1475641/
Abstract

BACKGROUND

When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use.

RESULTS

The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures.

CONCLUSION

We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins.

摘要

背景

当将蛋白质序列趋异进化的精确模型与互补的生物学信息(如折叠的蛋白质结构)相结合时,对组合数据的分析往往会产生关于分子生理学的新假设。这是生物信息学如何用于指导实验研究的一个绝佳例子。然而,由于缺乏适用于一般用途的公开可用资源,这一方向的进展一直较为缓慢。

结果

预先计算的Magnum数据库为大约1800个具有至少一个晶体结构的全长蛋白质家族解决了这一问题。Magnum提供的成果包括:1)多序列比对;2)比对位点到晶体结构位点的映射;3)系统发育树;4)内部树节点处的推断祖先序列;5)沿树枝的氨基酸替换。综合评估表明,用于构建Magnum的自动化程序产生了蛋白质趋异进化(即谱系)的精确模型,并将这些模型与结构数据正确整合。为了展示Magnum的能力,我们查找了位于蛋白质内部结构位点、需要三个核苷酸替换且发生在短系统发育树分支上的氨基酸替换。在细胞视黄醇结合蛋白家族中,发现了一个可能调节配体结合亲和力的位点。在日行壁虎中,细胞视黄醇结合蛋白被招募来充当晶状体晶状体蛋白,这为展示一个包含与蛋白质结构整合的分支替换模式的可浏览数据库的预测价值提供了另一个机会。

结论

我们大规模整合了蛋白质科学的两个领域——进化和结构,并创建了一个预先计算的数据库Magnum,这是同类中的首个免费可用资源。Magnum提供了进化和结构生物信息学资源,有助于识别关于蛋白质行为和功能分子基础的可实验验证的假设,如细胞视黄醇结合蛋白的例子所示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/e2da601a669f/1471-2105-7-89-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/7832f0848254/1471-2105-7-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/14d92425f778/1471-2105-7-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/43a45dab58e7/1471-2105-7-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/d1742ca220b7/1471-2105-7-89-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/3036bd5b8b7e/1471-2105-7-89-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/6addf0314652/1471-2105-7-89-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/e2da601a669f/1471-2105-7-89-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/7832f0848254/1471-2105-7-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/14d92425f778/1471-2105-7-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/43a45dab58e7/1471-2105-7-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/d1742ca220b7/1471-2105-7-89-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/3036bd5b8b7e/1471-2105-7-89-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/6addf0314652/1471-2105-7-89-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3dfd/1475641/e2da601a669f/1471-2105-7-89-7.jpg

相似文献

1
Integrating protein structures and precomputed genealogies in the Magnum database: examples with cellular retinoid binding proteins.整合Magnum数据库中的蛋白质结构和预先计算的谱系:以细胞视黄醇结合蛋白为例。
BMC Bioinformatics. 2006 Feb 23;7:89. doi: 10.1186/1471-2105-7-89.
2
Pandit: a database of protein and associated nucleotide domains with inferred trees.潘迪特:一个带有推断树的蛋白质及相关核苷酸结构域数据库。
Bioinformatics. 2003 Aug 12;19(12):1556-63. doi: 10.1093/bioinformatics/btg188.
3
Imprint of evolutionary conservation and protein structure variation on the binding function of protein tyrosine kinases.蛋白质酪氨酸激酶结合功能上的进化保守印记与蛋白质结构变异
Bioinformatics. 2006 Aug 1;22(15):1846-54. doi: 10.1093/bioinformatics/btl199. Epub 2006 May 23.
4
Domain-based small molecule binding site annotation.基于结构域的小分子结合位点注释。
BMC Bioinformatics. 2006 Mar 17;7:152. doi: 10.1186/1471-2105-7-152.
5
Multiple Alignment of protein structures and sequences for VMD.用于VMD的蛋白质结构和序列的多序列比对。
Bioinformatics. 2006 Feb 15;22(4):504-6. doi: 10.1093/bioinformatics/bti825. Epub 2005 Dec 8.
6
L/D Protein Ligand Database (PLD): additional understanding of the nature and specificity of protein-ligand complexes.L/D蛋白质配体数据库(PLD):对蛋白质-配体复合物的性质和特异性的进一步理解。
Bioinformatics. 2003 Sep 22;19(14):1856-7. doi: 10.1093/bioinformatics/btg243.
7
Adding some SPICE to DAS.给数据采集系统增添一些特色。
Bioinformatics. 2005 Sep 1;21 Suppl 2(Suppl 2):ii40-1. doi: 10.1093/bioinformatics/bti1106.
8
Gecko iota-crystallin: how cellular retinol-binding protein became an eye lens ultraviolet filter.壁虎ι-晶体蛋白:细胞视黄醇结合蛋白如何成为晶状体紫外线滤光片。
Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3282-7. doi: 10.1073/pnas.97.7.3282.
9
Using Dali for structural comparison of proteins.使用Dali进行蛋白质的结构比较。
Curr Protoc Bioinformatics. 2006 Jul;Chapter 5:Unit 5.5. doi: 10.1002/0471250953.bi0505s14.
10
Identification of putative domain linkers by a neural network - application to a large sequence database.通过神经网络识别假定的结构域连接子——应用于大型序列数据库
BMC Bioinformatics. 2006 Jun 27;7:323. doi: 10.1186/1471-2105-7-323.

引用本文的文献

1
Sulfate activation enzymes: phylogeny and association with pyrophosphatase.硫酸盐激活酶:系统发育及其与焦磷酸酶的关联
J Mol Evol. 2009 Jan;68(1):1-13. doi: 10.1007/s00239-008-9181-6. Epub 2008 Dec 6.

本文引用的文献

1
Resurrecting ancestral alcohol dehydrogenases from yeast.复活酵母中的祖先乙醇脱氢酶
Nat Genet. 2005 Jun;37(6):630-5. doi: 10.1038/ng1553. Epub 2005 May 1.
2
Phylogenomic approaches to common problems encountered in the analysis of low copy repeats: the sulfotransferase 1A gene family example.针对低拷贝重复序列分析中常见问题的系统发育基因组学方法:磺基转移酶1A基因家族实例
BMC Evol Biol. 2005 Mar 7;5:22. doi: 10.1186/1471-2148-5-22.
3
Generality of the structurally constrained protein evolution model: assessment on representatives of the four main fold classes.
结构受限蛋白质进化模型的通用性:对四大折叠类别的代表进行评估
Gene. 2005 Jan 17;345(1):45-53. doi: 10.1016/j.gene.2004.11.025. Epub 2004 Dec 24.
4
Seq2Struct: a resource for establishing sequence-structure links.Seq2Struct:一个用于建立序列-结构联系的资源。
Bioinformatics. 2005 Feb 15;21(4):551-3. doi: 10.1093/bioinformatics/bti049. Epub 2004 Sep 28.
5
Reconstruction of ancestral protein sequences and its applications.祖先蛋白质序列的重建及其应用。
BMC Evol Biol. 2004 Sep 17;4:33. doi: 10.1186/1471-2148-4-33.
6
Evolution of coral pigments recreated.珊瑚色素的进化过程得以重现。
Science. 2004 Sep 3;305(5689):1433. doi: 10.1126/science.1099597.
7
The planetary biology of cytochrome P450 aromatases.细胞色素P450芳香化酶的行星生物学
BMC Biol. 2004 Aug 17;2:19. doi: 10.1186/1741-7007-2-19.
8
The iProClass integrated database for protein functional analysis.用于蛋白质功能分析的iProClass综合数据库。
Comput Biol Chem. 2004 Feb;28(1):87-96. doi: 10.1016/j.compbiolchem.2003.10.003.
9
Phylogenomic inference of protein molecular function: advances and challenges.蛋白质分子功能的系统发育基因组学推断:进展与挑战
Bioinformatics. 2004 Jan 22;20(2):170-9. doi: 10.1093/bioinformatics/bth021.
10
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D138-41. doi: 10.1093/nar/gkh121.