Suppr超能文献

针对手动标准(蛋白质的scop分类)对自动结构比对进行全面评估。

Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins.

作者信息

Gerstein M, Levitt M

机构信息

Molecular Biophysics & Biochemistry Department, Yale University, New Haven, Connecticut 06520-8114, USA.

出版信息

Protein Sci. 1998 Feb;7(2):445-56. doi: 10.1002/pro.5560070226.

Abstract

We apply a simple method for aligning protein sequences on the basis of a 3D structure, on a large scale, to the proteins in the scop classification of fold families. This allows us to assess, understand, and improve our automatic method against an objective, manually derived standard, a type of comprehensive evaluation that has not yet been possible for other structural alignment algorithms. Our basic approach directly matches the backbones of two structures, using repeated cycles of dynamic programming and least-squares fitting to determine an alignment minimizing coordinate difference. Because of simplicity, our method can be readily modified to take into account additional features of protein structure such as the orientation of side chains or the location-dependent cost of opening a gap. Our basic method, augmented by such modifications, can find reasonable alignments for all but 1.5% of the known structural similarities in scop, i.e., all but 32 of the 2,107 superfamily pairs. We discuss the specific protein structural features that make these 32 pairs so difficult to align and show how our procedure effectively partitions the relationships in scop into different categories, depending on what aspects of protein structure are involved (e.g., depending on whether or not consideration of side-chain orientation is necessary for proper alignment). We also show how our pairwise alignment procedure can be extended to generate a multiple alignment for a group of related structures. We have compared these alignments in detail with corresponding manual ones culled from the literature. We find good agreement (to within 95% for the core regions), and detailed comparison highlights how particular protein structural features (such as certain strands) are problematical to align, giving somewhat ambiguous results. With these improvements and systematic tests, our procedure should be useful for the development of scop and the future classification of protein folds.

摘要

我们应用一种基于三维结构的简单方法,大规模地将蛋白质序列与蛋白质结构分类数据库(scop)中折叠家族的蛋白质进行比对。这使我们能够根据一个客观的、人工推导的标准来评估、理解并改进我们的自动方法,这种全面评估对于其他结构比对算法来说是无法实现的。我们的基本方法直接匹配两个结构的主链,通过动态规划和最小二乘法拟合的重复循环来确定使坐标差异最小化的比对。由于方法简单,我们的方法可以很容易地进行修改,以考虑蛋白质结构的其他特征,如侧链的方向或打开缺口的位置依赖性成本。通过这些修改增强后的基本方法,能够为蛋白质结构分类数据库中除1.5%之外的所有已知结构相似性找到合理的比对,即2107个超家族对中除32对外的所有比对。我们讨论了使这32对难以比对的特定蛋白质结构特征,并展示了我们的程序如何根据所涉及的蛋白质结构方面(例如,根据正确比对是否需要考虑侧链方向)有效地将蛋白质结构分类数据库中的关系划分为不同类别。我们还展示了如何扩展我们的两两比对程序以生成一组相关结构的多重比对。我们已将这些比对与从文献中挑选出的相应人工比对进行了详细比较。我们发现两者吻合度良好(核心区域在95%以内),详细比较突出了特定蛋白质结构特征(如某些链)在比对时存在问题,结果有些模糊。通过这些改进和系统测试,我们的程序应该对蛋白质结构分类数据库的发展以及未来蛋白质折叠的分类有用。

相似文献

2
Large-scale comparison of protein sequence alignment algorithms with structure alignments.
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
3
Accuracy of structure-based sequence alignment of automatic methods.
BMC Bioinformatics. 2007 Sep 20;8:355. doi: 10.1186/1471-2105-8-355.
4
Towards an automatic classification of protein structural domains based on structural similarity.
BMC Bioinformatics. 2008 Jan 31;9:74. doi: 10.1186/1471-2105-9-74.
5
Automatic classification of protein structures using low-dimensional structure space mappings.
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-15-S2-S1. Epub 2014 Jan 24.
6
Vorolign--fast structural alignment using Voronoi contacts.
Bioinformatics. 2007 Jan 15;23(2):e205-11. doi: 10.1093/bioinformatics/btl294.
7

引用本文的文献

1
Why do eukaryotic proteins contain more intrinsically disordered regions?
PLoS Comput Biol. 2019 Jul 22;15(7):e1007186. doi: 10.1371/journal.pcbi.1007186. eCollection 2019 Jul.
2
Evaluation system and web infrastructure for the second cryo-EM model challenge.
J Struct Biol. 2018 Oct;204(1):96-108. doi: 10.1016/j.jsb.2018.07.006. Epub 2018 Jul 12.
4
Parallel-SymD: A Parallel Approach to Detect Internal Symmetry in Protein Domains.
Biomed Res Int. 2016;2016:4628592. doi: 10.1155/2016/4628592. Epub 2016 Sep 26.
5
ProQ3: Improved model quality assessments using Rosetta energy terms.
Sci Rep. 2016 Oct 4;6:33509. doi: 10.1038/srep33509.
6
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
Ann Appl Stat. 2014;8(4):2068-2095. doi: 10.1214/14-AOAS780. Epub 2014 Dec 19.
7
SymD webserver: a platform for detecting internally symmetric protein structures.
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W296-300. doi: 10.1093/nar/gku364. Epub 2014 May 5.
8
Homology modeling a fast tool for drug discovery: current perspectives.
Indian J Pharm Sci. 2012 Jan;74(1):1-17. doi: 10.4103/0250-474X.102537.
9
Overcoming sequence misalignments with weighted structural superposition.
Proteins. 2012 Nov;80(11):2523-35. doi: 10.1002/prot.24134. Epub 2012 Jul 28.
10
Accelerated protein structure comparison using TM-score-GPU.
Bioinformatics. 2012 Aug 15;28(16):2191-2. doi: 10.1093/bioinformatics/bts345. Epub 2012 Jun 19.

本文引用的文献

1
Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core.
Curr Biol. 1993 Mar;3(3):141-8. doi: 10.1016/0960-9822(93)90255-m.
3
Optimum superimposition of protein structures: ambiguities and implications.
Fold Des. 1996;1(2):123-32. doi: 10.1016/s1359-0278(96)00021-1.
4
New structure--novel fold?
Structure. 1997 Feb 15;5(2):165-71. doi: 10.1016/s0969-2126(97)00176-7.
5
Protein evolution. How far can sequences diverge?
Nature. 1997 Feb 13;385(6617):579, 581. doi: 10.1038/385579a0.
6
SCOP: a structural classification of proteins database.
Nucleic Acids Res. 1997 Jan 1;25(1):236-9. doi: 10.1093/nar/25.1.236.
9
The structural alignment between two proteins: is there a unique answer?
Protein Sci. 1996 Jul;5(7):1325-38. doi: 10.1002/pro.5560050711.
10
Surprising similarities in structure comparison.
Curr Opin Struct Biol. 1996 Jun;6(3):377-85. doi: 10.1016/s0959-440x(96)80058-3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验