来自多序列比对的QR分解的进化概况。

Evolutionary profiles from the QR factorization of multiple sequence alignments.

作者信息

Sethi Anurag, O'Donoghue Patrick, Luthey-Schulten Zaida

机构信息

Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.

出版信息

Proc Natl Acad Sci U S A. 2005 Mar 15;102(11):4045-50. doi: 10.1073/pnas.0409715102. Epub 2005 Mar 1.

DOI:10.1073/pnas.0409715102

PMID:15741270

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC554820/

Abstract

We present an algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of the homologous group. The method, based on the multidimensional QR factorization of numerically encoded multiple sequence alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. We observe a general trend that these smaller, more evolutionarily balanced profiles have comparable and, in many cases, better performance in database searches than conventional profiles containing hundreds of sequences, constructed in an iterative and computationally intensive procedure. For more diverse families or superfamilies, with sequence identity <30%, structural alignments, based purely on the geometry of the protein structures, provide better alignments than pure sequence-based methods. Merging the structure and sequence information allows the construction of accurate profiles for distantly related groups. These structure-based profiles outperformed other sequence-based methods for finding distant homologs and were used to identify a putative class II cysteinyl-tRNA synthetase (CysRS) in several archaea that eluded previous annotation studies. Phylogenetic analysis showed the putative class II CysRSs to be a monophyletic group and homology modeling revealed a constellation of active site residues similar to that in the known class I CysRS.

摘要

我们提出了一种算法，用于生成完整的进化图谱，以表示同源组分子系统发育树的拓扑结构。该方法基于对数字编码的多序列比对进行多维QR分解，去除比对中的冗余，并按线性依赖性增加的顺序排列蛋白质序列，从而确定跨越蛋白质同源组进化空间的最小序列基集。我们观察到一个普遍趋势，即这些更小、进化上更平衡的图谱在数据库搜索中具有与传统图谱相当的性能，并且在许多情况下，比通过迭代和计算密集型程序构建的包含数百个序列的传统图谱表现更好。对于序列同一性<30%的更多样化的家族或超家族，纯粹基于蛋白质结构几何形状的结构比对比基于纯序列的方法提供更好的比对。合并结构和序列信息允许为远缘相关组构建准确的图谱。这些基于结构的图谱在寻找远缘同源物方面优于其他基于序列的方法，并被用于在几个古菌中鉴定出一种先前注释研究中未发现的假定的II类半胱氨酰-tRNA合成酶（CysRS）。系统发育分析表明，假定的II类CysRSs是一个单系群，同源性建模揭示了一组与已知I类CysRS中相似的活性位点残基。

相似文献

Evolutionary profiles from the QR factorization of multiple sequence alignments.

Proc Natl Acad Sci U S A. 2005 Mar 15;102(11):4045-50. doi: 10.1073/pnas.0409715102. Epub 2005 Mar 1.

Evolutionary profiles derived from the QR factorization of multiple structural alignments gives an economy of information.

J Mol Biol. 2005 Feb 25;346(3):875-94. doi: 10.1016/j.jmb.2004.11.053. Epub 2005 Jan 22.

MultiSeq: unifying sequence and structure data for evolutionary analysis.

BMC Bioinformatics. 2006 Aug 16;7:382. doi: 10.1186/1471-2105-7-382.

PASS2: an automated database of protein alignments organised as structural superfamilies.

BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.

An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.

J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.

Sequence and hydropathy profile analysis of two classes of secondary transporters.

Mol Membr Biol. 2005 May-Jun;22(3):177-89. doi: 10.1080/09687860500063324.

PROMALS: towards accurate multiple sequence alignments of distantly related proteins.

Bioinformatics. 2007 Apr 1;23(7):802-8. doi: 10.1093/bioinformatics/btm017. Epub 2007 Jan 31.

Application of protein structure alignments to iterated hidden Markov model protocols for structure prediction.

BMC Bioinformatics. 2006 Sep 14;7:410. doi: 10.1186/1471-2105-7-410.

Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.

Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.

Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments.

J Mol Biol. 1996 Dec 13;264(4):823-38. doi: 10.1006/jmbi.1996.0679.

引用本文的文献

RNA-Dependent Cysteine Biosynthesis in Bacteria and Archaea.

mBio. 2017 May 9;8(3):e00561-17. doi: 10.1128/mBio.00561-17.

Exploring the role of receptor flexibility in structure-based drug discovery.

Biophys Chem. 2014 Feb;186:31-45. doi: 10.1016/j.bpc.2013.10.007. Epub 2013 Nov 9.

Cryo-EM visualization of the ribosome in termination complex with apo-RF3 and RF1.

Elife. 2013 Jun 4;2:e00411. doi: 10.7554/eLife.00411.

Quantifying intramolecular binding in multivalent interactions: a structure-based synergistic study on Grb2-Sos1 complex.

PLoS Comput Biol. 2011 Oct;7(10):e1002192. doi: 10.1371/journal.pcbi.1002192. Epub 2011 Oct 13.

Recognition of the regulatory nascent chain TnaC by the ribosome.

Structure. 2010 May 12;18(5):627-37. doi: 10.1016/j.str.2010.02.011.

Exit strategies for charged tRNA from GluRS.

J Mol Biol. 2010 Apr 16;397(5):1350-71. doi: 10.1016/j.jmb.2010.02.003. Epub 2010 Feb 13.

Horizontal gene transfer of zinc and non-zinc forms of bacterial ribosomal protein S4.

BMC Evol Biol. 2009 Jul 29;9:179. doi: 10.1186/1471-2148-9-179.

Classification and energetics of the base-phosphate interactions in RNA.

Nucleic Acids Res. 2009 Aug;37(15):4898-918. doi: 10.1093/nar/gkp468. Epub 2009 Jun 14.

Dynamical networks in tRNA:protein complexes.

Proc Natl Acad Sci U S A. 2009 Apr 21;106(16):6620-5. doi: 10.1073/pnas.0810961106. Epub 2009 Apr 7.

Frequency and isostericity of RNA base pairs.

Nucleic Acids Res. 2009 Apr;37(7):2294-312. doi: 10.1093/nar/gkp011. Epub 2009 Feb 24.

本文引用的文献

Evolutionary profiles derived from the QR factorization of multiple structural alignments gives an economy of information.

J Mol Biol. 2005 Feb 25;346(3):875-94. doi: 10.1016/j.jmb.2004.11.053. Epub 2005 Jan 22.

The ASTRAL Compendium in 2004.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D189-92. doi: 10.1093/nar/gkh034.

Cysteinyl-tRNA(Cys) formation in Methanocaldococcus jannaschii: the mechanism is still unknown.

J Bacteriol. 2004 Jan;186(1):8-14. doi: 10.1128/JB.186.1.8-14.2004.

On the evolution of structure in aminoacyl-tRNA synthetases.

Microbiol Mol Biol Rev. 2003 Dec;67(4):550-73. doi: 10.1128/MMBR.67.4.550-573.2003.

Zinc-mediated amino acid discrimination in cysteinyl-tRNA synthetase.

J Mol Biol. 2003 Apr 11;327(5):911-7. doi: 10.1016/s0022-2836(03)00241-9.

The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Nucleic Acids Res. 2003 Jan 1;31(1):365-70. doi: 10.1093/nar/gkg095.

Cysteinyl-tRNA formation and prolyl-tRNA synthetase.

FEBS Lett. 2002 Mar 6;514(1):34-6. doi: 10.1016/s0014-5793(02)02331-1.

Functional convergence of two lysyl-tRNA synthetases with unrelated topologies.

Nat Struct Biol. 2002 Apr;9(4):257-62. doi: 10.1038/nsb777.

Cysteinyl-tRNA synthetase is not essential for viability of the archaeon Methanococcus maripaludis.

Proc Natl Acad Sci U S A. 2001 Dec 4;98(25):14292-7. doi: 10.1073/pnas.201540498. Epub 2001 Nov 20.

On the structure of hisH: protein structure prediction in the context of structural and functional genomics.

J Struct Biol. 2001 May-Jun;134(2-3):257-68. doi: 10.1006/jsbi.2001.4390.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

来自多序列比对的QR分解的进化概况。

Evolutionary profiles from the QR factorization of multiple sequence alignments.

作者信息

Sethi Anurag, O'Donoghue Patrick, Luthey-Schulten Zaida

机构信息

Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.

出版信息

Proc Natl Acad Sci U S A. 2005 Mar 15;102(11):4045-50. doi: 10.1073/pnas.0409715102. Epub 2005 Mar 1.

DOI:10.1073/pnas.0409715102

PMID:15741270

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC554820/

Abstract

摘要

来自多序列比对的QR分解的进化概况。

Evolutionary profiles from the QR factorization of multiple sequence alignments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

来自多序列比对的QR分解的进化概况。

Evolutionary profiles from the QR factorization of multiple sequence alignments.

作者信息

机构信息

出版信息