Langmead Christopher James, Donald Bruce Randall
Dartmouth Computer Science Department, Hanover, NH 03755, USA.
J Biomol NMR. 2004 Jun;29(2):111-38. doi: 10.1023/B:JNMR.0000019247.89110.e6.
We report an automated procedure for high-throughput NMR resonance assignment for a protein of known structure, or of an homologous structure. Our algorithm performs Nuclear Vector Replacement (NVR) by Expectation/Maximization (EM) to compute assignments. NVR correlates experimentally-measured NH residual dipolar couplings (RDCs) and chemical shifts to a given a priori whole-protein 3D structural model. The algorithm requires only uniform (15)N-labelling of the protein, and processes unassigned H(N)-(15)N HSQC spectra, H(N)-(15)N RDCs, and sparse H(N)-H(N) NOE's (d(NN)s). NVR runs in minutes and efficiently assigns the (H(N),(15)N) backbone resonances as well as the sparse d(NN)s from the 3D (15)N-NOESY spectrum, in O (n(3)) time. The algorithm is demonstrated on NMR data from a 76-residue protein, human ubiquitin, matched to four structures, including one mutant (homolog), determined either by X-ray crystallography or by different NMR experiments (without RDCs). NVR achieves an average assignment accuracy of over 99%. We further demonstrate the feasibility of our algorithm for different and larger proteins, using different combinations of real and simulated NMR data for hen lysozyme (129 residues) and streptococcal protein G (56 residues), matched to a variety of 3D structural models.
我们报告了一种针对已知结构或同源结构蛋白质的高通量核磁共振共振归属的自动化程序。我们的算法通过期望最大化(EM)执行核向量替换(NVR)来计算归属。NVR将实验测量的NH残余偶极耦合(RDC)和化学位移与给定的先验全蛋白三维结构模型相关联。该算法仅需要蛋白质的均匀(15)N标记,并处理未归属的H(N)-(15)N HSQC谱、H(N)-(15)N RDC和稀疏的H(N)-H(N)核Overhauser效应(d(NN))。NVR在数分钟内运行,并能在O(n(3))时间内有效地归属(H(N),(15)N)主链共振以及来自三维(15)N-NOESY谱的稀疏d(NN)。该算法在来自76个残基的蛋白质人泛素的核磁共振数据上得到了验证,该数据与四种结构匹配,包括一种通过X射线晶体学或不同的核磁共振实验(无RDC)确定的突变体(同源物)。NVR实现了超过99%的平均归属准确率。我们还使用与各种三维结构模型匹配的鸡溶菌酶(129个残基)和链球菌蛋白G(56个残基)的真实和模拟核磁共振数据的不同组合,进一步证明了我们的算法对不同的更大蛋白质的可行性。