成对结构比对的进化不准确性。

Evolutionary inaccuracy of pairwise structural alignments.

机构信息

Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London, UK.

出版信息

Bioinformatics. 2012 May 1;28(9):1209-15. doi: 10.1093/bioinformatics/bts103. Epub 2012 Mar 6.

DOI:10.1093/bioinformatics/bts103

PMID:22399676

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3338010/

Abstract

MOTIVATION

Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary history of most proteins. Since homology is an equivalence relation, an upper bound on alignment quality can be found by assessing the consistency of alignments. Measuring the consistency of current methods of structure alignment and determining the causes of inconsistencies can, therefore, provide information on the quality of current methods and suggest possibilities for further improvement.

RESULTS

We analyze the self-consistency of seven widely-used structural alignment methods (SAP, TM-align, Fr-TM-align, MAMMOTH, DALI, CE and FATCAT) on a diverse, non-redundant set of 1863 domains from the SCOP database and demonstrate that even for relatively similar proteins the degree of inconsistency of the alignments on a residue level is high (30%). We further show that levels of consistency vary substantially between methods, with two methods (SAP and Fr-TM-align) producing more consistent alignments than the rest. Inconsistency is found to be higher near gaps and for proteins of low structural complexity, as well as for helices. The ability of the methods to identify good structural alignments is also assessed using geometric measures, for which FATCAT (flexible mode) is found to be the best performer despite being highly inconsistent. We conclude that there is substantial scope for improving the consistency of structural alignment methods.

CONTACT

msadows@nimr.mrc.ac.uk

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

结构比对方法被广泛用于生成金标准比对，以改进多序列比对和功能注释的转移，以及用于分配蛋白质之间的结构距离。然而，这些方法生成的比对的正确性很难客观评估，因为大多数蛋白质的精确进化历史知之甚少。由于同源性是一种等价关系，因此通过评估比对的一致性，可以找到比对质量的上限。因此，衡量当前结构比对方法的一致性并确定不一致的原因，可以提供关于当前方法质量的信息，并为进一步改进提供可能性。

结果

我们分析了七种广泛使用的结构比对方法（SAP、TM-align、Fr-TM-align、MAMMOTH、DALI、CE 和 FATCAT）在 SCOP 数据库中多样化的、非冗余的 1863 个结构域上的自一致性，并证明即使对于相对相似的蛋白质，残基水平上比对的不一致程度也很高（30%）。我们进一步表明，方法之间的一致性水平差异很大，两种方法（SAP 和 Fr-TM-align）产生的比对比其他方法更一致。在缺口附近和结构复杂性低的蛋白质以及螺旋处，发现不一致性更高。我们还使用几何度量评估了这些方法识别良好结构比对的能力，尽管 FATCAT（灵活模式）高度不一致，但发现它是表现最好的方法。我们得出结论，在提高结构比对方法的一致性方面还有很大的改进空间。

联系方式

msadows@nimr.mrc.ac.uk

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6e4/3338010/6c1424a55ad0/bts103f1.jpg

相似文献

Evolutionary inaccuracy of pairwise structural alignments.成对结构比对的进化不准确性。

Bioinformatics. 2012 May 1;28(9):1209-15. doi: 10.1093/bioinformatics/bts103. Epub 2012 Mar 6.

CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对：一种基于残基-残基接触面积的灵活蛋白质结构比对方法。

PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.

Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score.Fr-TM-align：一种基于片段比对和TM分数的新型蛋白质结构比对方法。

BMC Bioinformatics. 2008 Dec 12;9:531. doi: 10.1186/1471-2105-9-531.

Comparative analysis of protein structure alignments.蛋白质结构比对的比较分析

BMC Struct Biol. 2007 Jul 26;7:50. doi: 10.1186/1472-6807-7-50.

Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。

Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.

High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABER-TOOTH.使用 SABER-TOOTH 结合结构轮廓预测和轮廓比对进行高质量蛋白质序列比对。

BMC Bioinformatics. 2010 May 14;11:251. doi: 10.1186/1471-2105-11-251.

Accuracy of structure-based sequence alignment of automatic methods.自动方法的基于结构的序列比对准确性。

BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.

SVM-dependent pairwise HMM: an application to protein pairwise alignments.基于 SVM 的成对隐马尔可夫模型：在蛋白质两两比对中的应用。

Bioinformatics. 2017 Dec 15;33(24):3902-3908. doi: 10.1093/bioinformatics/btx391.

Improving the alignment quality of consistency based aligners with an evaluation function using synonymous protein words.利用同义蛋白质词的评估函数提高一致性比对器的比对质量。

PLoS One. 2011;6(12):e27872. doi: 10.1371/journal.pone.0027872. Epub 2011 Dec 2.

Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy.膜蛋白的结构比对：现有工具的准确性及一种共识策略

Proteins. 2015 Sep;83(9):1720-32. doi: 10.1002/prot.24857. Epub 2015 Aug 1.

引用本文的文献

Novel insights into the origin and diversification of photosynthesis based on analyses of conserved indels in the core reaction center proteins.基于对核心反应中心蛋白中保守插入缺失的分析，对光合作用起源和多样化的新见解。

Photosynth Res. 2017 Feb;131(2):159-171. doi: 10.1007/s11120-016-0307-1. Epub 2016 Sep 16.

CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.CAB比对：一种基于残基-残基接触面积的灵活蛋白质结构比对方法。

PLoS One. 2015 Oct 26;10(10):e0141440. doi: 10.1371/journal.pone.0141440. eCollection 2015.

Structural Bridges through Fold Space.穿越折叠空间的结构桥梁。

PLoS Comput Biol. 2015 Sep 15;11(9):e1004466. doi: 10.1371/journal.pcbi.1004466. eCollection 2015 Sep.

Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy.膜蛋白的结构比对：现有工具的准确性及一种共识策略

Proteins. 2015 Sep;83(9):1720-32. doi: 10.1002/prot.24857. Epub 2015 Aug 1.

Consequences of domain insertion on sequence-structure divergence in a superfold.结构域插入对超折叠中序列-结构分歧的影响。

Proc Natl Acad Sci U S A. 2013 Sep 3;110(36):E3381-7. doi: 10.1073/pnas.1305519110. Epub 2013 Aug 19.

本文引用的文献

Exploring the limits of fold discrimination by structural alignment: a large scale benchmark using decoys of known fold.通过结构比对探索折叠分类的极限：利用已知折叠的诱饵进行大规模基准测试。

Comput Biol Chem. 2011 Jun;35(3):174-88. doi: 10.1016/j.compbiolchem.2011.04.008. Epub 2011 May 13.

Protein structures, folds and fold spaces.蛋白质结构、折叠和折叠空间。

J Phys Condens Matter. 2010 Jan 27;22(3):033103. doi: 10.1088/0953-8984/22/3/033103. Epub 2009 Dec 21.

A spectral approach to protein structure alignment.一种基于光谱的蛋白质结构比对方法。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Jul-Aug;8(4):867-75. doi: 10.1109/TCBB.2011.24.

GOSSIP: a method for fast and accurate global alignment of protein structures.闲话：一种快速准确的蛋白质结构全局比对方法。

Bioinformatics. 2011 Apr 1;27(7):925-32. doi: 10.1093/bioinformatics/btr044. Epub 2011 Feb 3.

SUPERFAMILY 1.75 including a domain-centric gene ontology method.超家族1.75，包括一种以结构域为中心的基因本体方法。

Nucleic Acids Res. 2011 Jan;39(Database issue):D427-34. doi: 10.1093/nar/gkq1130. Epub 2010 Nov 9.

Searching protein 3-D structures for optimal structure alignment using intelligent algorithms and data structures.使用智能算法和数据结构搜索蛋白质三维结构以实现最佳结构比对。

IEEE Trans Inf Technol Biomed. 2010 Nov;14(6):1378-86. doi: 10.1109/TITB.2010.2079939. Epub 2010 Sep 27.

On the evolutionary origins of "Fold Space Continuity": a study of topological convergence and divergence in mixed alpha-beta domains.论“折叠空间连续性”的进化起源：混合 α-β 域中拓扑收敛和发散的研究。

J Struct Biol. 2010 Dec;172(3):244-52. doi: 10.1016/j.jsb.2010.07.016. Epub 2010 Aug 5.

FragBag, an accurate representation of protein structure, retrieves structural neighbors from the entire PDB quickly and accurately.FragBag 是一种准确表示蛋白质结构的方法，它可以快速准确地从整个 PDB 中检索结构邻居。

Proc Natl Acad Sci U S A. 2010 Feb 23;107(8):3481-6. doi: 10.1073/pnas.0914097107. Epub 2010 Feb 3.

FlexSnap: flexible non-sequential protein structure alignment.FlexSnap：灵活的非顺序蛋白质结构比对

Algorithms Mol Biol. 2010 Jan 4;5:12. doi: 10.1186/1748-7188-5-12.

Flexible structural protein alignment by a sequence of local transformations.通过一系列局部变换进行灵活的结构蛋白比对。

Bioinformatics. 2009 Jul 1;25(13):1625-31. doi: 10.1093/bioinformatics/btp296. Epub 2009 May 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

成对结构比对的进化不准确性。

Evolutionary inaccuracy of pairwise structural alignments.

机构信息

出版信息

MOTIVATION

RESULTS

CONTACT

SUPPLEMENTARY INFORMATION

动机

结果

联系方式

补充信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献