Department of Biochemistry and Molecular Biology, College of Medicine, Pennsylvania State University, Hershey, PA 17033, USA.
J Biomol NMR. 2011 Feb;49(2):85-98. doi: 10.1007/s10858-010-9467-z. Epub 2010 Dec 30.
Experimental residual dipolar couplings (RDCs) in combination with structural models have the potential for accelerating the protein backbone resonance assignment process because RDCs can be measured accurately and interpreted quantitatively. However, this application has been limited due to the need for very high-resolution structural templates. Here, we introduce a new approach to resonance assignment based on optimal agreement between the experimental and calculated RDCs from a structural template that contains all assignable residues. To overcome the inherent computational complexity of such a global search, we have adopted an efficient two-stage search algorithm and included connectivity data from conventional assignment experiments. In the first stage, a list of strings of resonances (CA-links) is generated via exhaustive searches for short segments of sequentially connected residues in a protein (local templates), and then ranked by the agreement of the experimental (13)C(α) chemical shifts and (15)N-(1)H RDCs to the predicted values for each local template. In the second stage, the top CA-links for different local templates in stage I are combinatorially connected to produce CA-links for all assignable residues. The resulting CA-links are ranked for resonance assignment according to their measured RDCs and predicted values from a tertiary structure. Since the final RDC ranking of CA-links includes all assignable residues and the assignment is derived from a "global minimum", our approach is far less reliant on the quality of experimental data and structural templates. The present approach is validated with the assignments of several proteins, including a 42 kDa maltose binding protein (MBP) using RDCs and structural templates of varying quality. Since backbone resonance assignment is an essential first step for most of biomolecular NMR applications and is often a bottleneck for large systems, we expect that this new approach will improve the efficiency of the assignment process for small and medium size proteins and will extend the size limits assignable by current methods for proteins with structural models.
实验残余偶极耦合(RDC)与结构模型相结合,具有加速蛋白质骨架共振分配过程的潜力,因为 RDC 可以被准确测量并进行定量解释。然而,由于需要非常高分辨率的结构模板,这种应用受到了限制。在这里,我们介绍了一种新的基于实验和计算 RDC 之间最佳一致性的共振分配方法,该方法使用包含所有可分配残基的结构模板。为了克服这种全局搜索固有的计算复杂性,我们采用了一种高效的两阶段搜索算法,并包括了来自传统分配实验的连通性数据。在第一阶段,通过对蛋白质中顺序连接的短片段进行穷举搜索,生成一系列共振(CA 链接)的列表(局部模板),然后根据实验(13)C(α)化学位移和(15)N-(1)H RDC 与每个局部模板的预测值之间的一致性对其进行排序。在第二阶段,将第一阶段中不同局部模板的前 CA 链接进行组合连接,以生成所有可分配残基的 CA 链接。根据实测 RDC 和三级结构预测值,对生成的 CA 链接进行共振分配排序。由于最终 CA 链接的 RDC 排序包含所有可分配残基,并且分配是从“全局最小值”得出的,因此我们的方法对实验数据和结构模板的质量依赖性要小得多。该方法已通过对几种蛋白质的分配进行验证,包括使用 RDC 和不同质量的结构模板的 42 kDa 麦芽糖结合蛋白(MBP)的分配。由于骨架共振分配是大多数生物分子 NMR 应用的基本第一步,并且通常是大型系统的瓶颈,因此我们预计这种新方法将提高小分子和中等大小蛋白质分配过程的效率,并扩展当前方法可分配的蛋白质的大小限制具有结构模型的蛋白质。