目的确定蛋白质结构叠加的残基范围。

Objective identification of residue ranges for the superposition of protein structures.

机构信息

Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, and Frankfurt Institute for Advanced Studies, Goethe University Frankfurt am Main, Max-von-Laue-Str, 9, 60438 Frankfurt am Main, Germany.

出版信息

BMC Bioinformatics. 2011 May 18;12:170. doi: 10.1186/1471-2105-12-170.

DOI:10.1186/1471-2105-12-170

PMID:21592348

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3120703/

Abstract

BACKGROUND

The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace.

RESULTS

We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank.

CONCLUSIONS

The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures.

摘要

背景

客观选择氨基酸残基范围进行结构叠加对于有意义且一致的蛋白质结构分析非常重要。到目前为止，对于实验确定的蛋白质结构，还没有广泛使用的标准来选择这些残基范围，手动选择残基范围或使用非最佳标准仍然很常见。

结果

我们提出了一种自动且客观的方法，用于为蛋白质结构的叠加和分析寻找氨基酸残基范围，特别是对于 NMR 结构计算产生的结构束。该方法在算法 CYRANGE 中实现，在大多数常见情况下，包括低精度结构束、多域蛋白、对称多聚体和蛋白质复合物，无需蛋白质特定参数调整，就可以生成合适的残基范围。残基范围的选择是为了包含尽可能多的蛋白质结构域残基，如果增加其数量会导致 RMSD 值急剧上升。残基范围是通过首先根据距离方差矩阵将残基聚类为结构域，然后针对每个结构域逐个排除残基来细化初始残基选择，直到 RMSD 值的相对降低变得不显著为止。为了获得尽可能简单但不是更简单的结果，对打开间隙的惩罚有利于连续的残基范围。结果针对一组 37 个蛋白质进行了给出，并与常用的蛋白质结构验证包的结果进行了比较。我们还为蛋白质数据库中的 6351 个 NMR 结构提供了残基范围。

结论

CYRANGE 方法能够自动确定用于各种蛋白质结构叠加的蛋白质结构束的残基范围。该方法正确识别有序区域。基于 CYRANGE 残基范围的全局结构叠加允许清晰地呈现结构，并且所选范围内不存在不必要的小间隙。在大多数情况下，与其他方法相比，CYRANGE 的残基范围包含更少的间隙，并且覆盖了序列的更大部分，而不会显著增加 RMSD 值。因此，CYRANGE 为蛋白质结构叠加中残基范围的选择提供了一种客观且自动的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e04/3120703/305b87e4e66b/1471-2105-12-170-1.jpg

相似文献

Objective identification of residue ranges for the superposition of protein structures.目的确定蛋白质结构叠加的残基范围。

BMC Bioinformatics. 2011 May 18;12:170. doi: 10.1186/1471-2105-12-170.

Robust probabilistic superposition and comparison of protein structures.蛋白质结构的稳健概率叠加和比较。

BMC Bioinformatics. 2010 Jul 1;11:363. doi: 10.1186/1471-2105-11-363.

Overcoming sequence misalignments with weighted structural superposition.利用加权结构叠加克服序列不对齐。

Proteins. 2012 Nov;80(11):2523-35. doi: 10.1002/prot.24134. Epub 2012 Jul 28.

Clustering algorithms for identifying core atom sets and for assessing the precision of protein structure ensembles.用于识别核心原子集和评估蛋白质结构集合精度的聚类算法。

Proteins. 2005 Jun 1;59(4):673-86. doi: 10.1002/prot.20402.

MUSTANG-MR structural sieving server: applications in protein structural analysis and crystallography. MUSTANG-MR 结构筛析服务器：在蛋白质结构分析和晶体学中的应用。

PLoS One. 2010 Apr 6;5(4):e10048. doi: 10.1371/journal.pone.0010048.

Algorithms for optimal protein structure alignment.最优蛋白质结构比对算法。

Bioinformatics. 2009 Nov 1;25(21):2751-6. doi: 10.1093/bioinformatics/btp530. Epub 2009 Sep 4.

CATHEDRAL: a fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures.大教堂：一种从多结构域蛋白质结构预测折叠和结构域边界的快速有效算法。

PLoS Comput Biol. 2007 Nov;3(11):e232. doi: 10.1371/journal.pcbi.0030232.

Structure determination of symmetric homo-oligomers by a complete search of symmetry configuration space, using NMR restraints and van der Waals packing.通过对对称构型空间进行完全搜索，并利用核磁共振约束和范德华堆积来确定对称同型寡聚体的结构。

Proteins. 2006 Oct 1;65(1):203-19. doi: 10.1002/prot.21091.

Tertiary structure predictions on a comprehensive benchmark of medium to large size proteins.对中大型蛋白质综合基准进行三级结构预测。

Biophys J. 2004 Oct;87(4):2647-55. doi: 10.1529/biophysj.104.045385.

A large data set comparison of protein structures determined by crystallography and NMR: statistical test for structural differences and the effect of crystal packing.通过晶体学和核磁共振确定的蛋白质结构的大数据集比较：结构差异的统计检验及晶体堆积的影响

Proteins. 2007 Nov 15;69(3):449-65. doi: 10.1002/prot.21507.

引用本文的文献

Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR.通过AlphaFold-NMR构象选择揭示的蛋白质隐藏结构状态

Res Sq. 2025 Feb 19:rs.3.rs-5994356. doi: 10.21203/rs.3.rs-5994356/v1.

Integrative Modeling of Protein-Polypeptide Complexes by Bayesian Model Selection using AlphaFold and NMR Chemical Shift Perturbation Data.利用AlphaFold和核磁共振化学位移扰动数据通过贝叶斯模型选择对蛋白质-多肽复合物进行整合建模

bioRxiv. 2024 Sep 22:2024.09.19.613999. doi: 10.1101/2024.09.19.613999.

Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR.通过AlphaFold-NMR构象选择揭示的蛋白质隐藏结构状态

bioRxiv. 2025 Feb 26:2024.06.26.600902. doi: 10.1101/2024.06.26.600902.

Restraint validation of biomolecular structures determined by NMR in the Protein Data Bank.利用 NMR 确定的生物分子结构在蛋白质数据库中的约束验证。

Structure. 2024 Jun 6;32(6):824-837.e1. doi: 10.1016/j.str.2024.02.011. Epub 2024 Mar 14.

The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis.100 种蛋白质 NMR 波谱数据集：生物分子 NMR 数据分析的资源。

Sci Data. 2024 Jan 4;11(1):30. doi: 10.1038/s41597-023-02879-5.

Time-optimized protein NMR assignment with an integrative deep learning approach using AlphaFold and chemical shift prediction.基于 AlphaFold 和化学位移预测的集成深度学习方法实现蛋白质 NMR 谱峰的时间优化分配。

Sci Adv. 2023 Nov 24;9(47):eadi9323. doi: 10.1126/sciadv.adi9323. Epub 2023 Nov 22.

Representing structures of the multiple conformational states of proteins.表示蛋白质的多种构象状态的结构。

Curr Opin Struct Biol. 2023 Dec;83:102703. doi: 10.1016/j.sbi.2023.102703. Epub 2023 Sep 28.

Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data.使用实验 NMR 数据对单体 AlphaFold2 蛋白质结构模型进行盲评估。

J Magn Reson. 2023 Jul;352:107481. doi: 10.1016/j.jmr.2023.107481. Epub 2023 May 20.

Blind Assessment of Monomeric AlphaFold2 Protein Structure Models with Experimental NMR Data.利用实验核磁共振数据对单体AlphaFold2蛋白质结构模型进行盲评估。

bioRxiv. 2023 Jan 22:2023.01.22.525096. doi: 10.1101/2023.01.22.525096.

Identification of resistance gene analogs of the NBS-LRR family through transcriptome probing and prediction of the expressome of under dieback disease stress.通过转录组探测鉴定NBS-LRR家族的抗性基因类似物，并预测死于枯萎病胁迫下的表达组。

Front Genet. 2022 Oct 7;13:1036029. doi: 10.3389/fgene.2022.1036029. eCollection 2022.

本文引用的文献

Robust probabilistic superposition and comparison of protein structures.蛋白质结构的稳健概率叠加和比较。

BMC Bioinformatics. 2010 Jul 1;11:363. doi: 10.1186/1471-2105-11-363.

Structural investigation of the C-terminal catalytic fragment of presenilin 1.早老素1 C末端催化片段的结构研究

Proc Natl Acad Sci U S A. 2010 May 25;107(21):9644-9. doi: 10.1073/pnas.1000778107. Epub 2010 May 5.

MolProbity: all-atom contacts and structure validation for proteins and nucleic acids.MolProbity：蛋白质和核酸的全原子接触与结构验证

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W375-83. doi: 10.1093/nar/gkm216. Epub 2007 Apr 22.

Solution structure of an atypical WW domain in a novel beta-clam-like dimeric form.一种新型β-蛤样二聚体形式的非典型WW结构域的溶液结构。

FEBS Lett. 2007 Feb 6;581(3):462-8. doi: 10.1016/j.febslet.2007.01.008. Epub 2007 Jan 16.

Evaluating protein structures determined by structural genomics consortia.评估由结构基因组学联盟确定的蛋白质结构。

Proteins. 2007 Mar 1;66(4):778-95. doi: 10.1002/prot.21165.

Automated protein structure determination from NMR spectra.通过核磁共振光谱自动测定蛋白质结构。

J Am Chem Soc. 2006 Oct 11;128(40):13112-22. doi: 10.1021/ja061136l.

Solution structures of the first and fourth TSR domains of F-spondin.F-spondin的第一个和第四个TSR结构域的溶液结构

Proteins. 2006 Aug 15;64(3):665-72. doi: 10.1002/prot.21030.

Is one solution good enough?一个解决方案就足够了吗？

Nat Struct Mol Biol. 2006 Mar;13(3):184-5; discussion 185. doi: 10.1038/nsmb0306-184.

Optimal isotope labelling for NMR protein structure determinations.用于核磁共振蛋白质结构测定的最佳同位素标记

Nature. 2006 Mar 2;440(7080):52-7. doi: 10.1038/nature04525.

Solution structure of the Src homology 2 domain from the human feline sarcoma oncogene Fes.源自人类猫肉瘤癌基因Fes的Src同源2结构域的溶液结构。

J Biomol NMR. 2005 Apr;31(4):357-61. doi: 10.1007/s10858-005-0946-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

目的确定蛋白质结构叠加的残基范围。

Objective identification of residue ranges for the superposition of protein structures.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献