Simon Katya, Xu Jun, Kim Chinpal, Skrynnikov Nikolai R
Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA.
J Biomol NMR. 2005 Oct;33(2):83-93. doi: 10.1007/s10858-005-2601-7.
It has been commonly recognized that residual dipolar coupling data provide a measure of quality for protein structures. To quantify this observation, a database of 100 single-domain proteins has been compiled where each protein was represented by two independently solved structures. Backbone 1H-15N dipolar couplings were simulated for the target structures and then fitted to the model structures. The fits were characterized by an R-factor which was corrected for the effects of non-uniform distribution of dipolar vectors on a unit sphere. The analyses show that favorable R values virtually guarantee high accuracy of the model structure (where accuracy is defined as the backbone coordinate rms deviation). On the other hand, unfavorable R values do not necessarily suggest low accuracy. Based on the simulated data, a simple empirical formula is proposed to estimate the accuracy of protein structures. The method is illustrated with a number of examples, including PDZ2 domain of human phosphatase hPTP1E.
人们普遍认识到,剩余偶极耦合数据可作为衡量蛋白质结构质量的指标。为了量化这一观察结果,已编制了一个包含100个单结构域蛋白质的数据库,其中每个蛋白质由两个独立解析的结构表示。针对目标结构模拟了主链1H-15N偶极耦合,然后将其拟合到模型结构中。拟合结果通过一个R因子来表征,该因子针对偶极矢量在单位球面上的非均匀分布效应进行了校正。分析表明,良好的R值实际上保证了模型结构的高精度(其中精度定义为主链坐标均方根偏差)。另一方面,不利的R值并不一定意味着精度低。基于模拟数据,提出了一个简单的经验公式来估计蛋白质结构的精度。该方法通过多个例子进行了说明,包括人磷酸酶hPTP1E的PDZ2结构域。