Meiler Jens, Baker David
Department of Biochemistry, University of Washington, BOX 357350, Seattle, WA 98195, USA.
J Magn Reson. 2005 Apr;173(2):310-6. doi: 10.1016/j.jmr.2004.11.031.
We illustrate how moderate resolution protein structures can be rapidly obtained by interlinking computational prediction methodologies with un- or partially assigned NMR data. To facilitate the application of our recently described method of ranking and subsequent refining alternative structural models using unassigned NMR data [Proc. Natl. Acad. Sci. USA 100 (2003) 15404] for such "structural genomics"-type experiments it is combined with protein models from several prediction techniques, enhanced to utilize partial assignments, and applied on a protein with an unknown structure and fold. From the original NMR spectra obtained for the 140 residue fumarate sensor DcuS, 1100 1H, 13C, and 15N chemical shift signals, 3000 1H-1H NOESY cross peak intensities, and 209 backbone residual dipolar couplings were extracted and used to rank models produced by de novo structure prediction and comparative modeling methods. The ranking proceeds in two steps: first, an optimal assignment of the NMR peaks to atoms is found for each model independently, and second, the models are ranked based on the consistency between the NMR data and the model assuming these optimal assignments. The low-resolution model selected using this ranking procedure had the correct overall fold and a global backbone RMSD of 6.0 angstrom, and was subsequently refined to 3.7 angstrom RMSD. With the incorporation of a small number of NOE and residual dipolar coupling constraints available very early in the traditional spectral assignment process, a model with an RMSD of 2.8 angstrom could rapidly be built. The ability to generate moderate resolution models within days of NMR data collection should facilitate large scale NMR structure determination efforts.
我们展示了如何通过将计算预测方法与未分配或部分分配的核磁共振(NMR)数据相联系,快速获得中等分辨率的蛋白质结构。为便于将我们最近描述的使用未分配NMR数据对替代结构模型进行排序及后续优化的方法[《美国国家科学院院刊》100 (2003) 15404]应用于此类“结构基因组学”类型的实验,该方法与多种预测技术得到的蛋白质模型相结合,进行了改进以利用部分分配信息,并应用于一种结构和折叠未知的蛋白质。从为140个残基的富马酸传感器DcuS获得的原始NMR谱中,提取了1100个氢、碳和氮化学位移信号、3000个氢 - 氢核欧沃豪斯效应(NOESY)交叉峰强度以及209个主链残余偶极耦合,并用于对从头结构预测和比较建模方法产生的模型进行排序。排序分两步进行:首先,为每个模型独立找到NMR峰到原子的最佳分配;其次,基于NMR数据与假设这些最佳分配的模型之间的一致性对模型进行排序。使用该排序程序选择的低分辨率模型具有正确的整体折叠,主链全局均方根偏差(RMSD)为6.0埃,随后优化至RMSD为3.7埃。在传统谱图分配过程非常早期就纳入少量可用的NOE和残余偶极耦合约束,能够快速构建一个RMSD为2.8埃的模型。在NMR数据收集后的数天内生成中等分辨率模型的能力应有助于大规模NMR结构测定工作。