Shen Yang, Delaglio Frank, Cornilescu Gabriel, Bax Ad
Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0520, USA.
J Biomol NMR. 2009 Aug;44(4):213-23. doi: 10.1007/s10858-009-9333-z. Epub 2009 Jun 23.
NMR chemical shifts in proteins depend strongly on local structure. The program TALOS establishes an empirical relation between 13C, 15N and 1H chemical shifts and backbone torsion angles phi and psi (Cornilescu et al. J Biomol NMR 13 289-302, 1999). Extension of the original 20-protein database to 200 proteins increased the fraction of residues for which backbone angles could be predicted from 65 to 74%, while reducing the error rate from 3 to 2.5%. Addition of a two-layer neural network filter to the database fragment selection process forms the basis for a new program, TALOS+, which further enhances the prediction rate to 88.5%, without increasing the error rate. Excluding the 2.5% of residues for which TALOS+ makes predictions that strongly differ from those observed in the crystalline state, the accuracy of predicted phi and psi angles, equals +/-13 degrees . Large discrepancies between predictions and crystal structures are primarily limited to loop regions, and for the few cases where multiple X-ray structures are available such residues are often found in different states in the different structures. The TALOS+ output includes predictions for individual residues with missing chemical shifts, and the neural network component of the program also predicts secondary structure with good accuracy.
蛋白质中的核磁共振化学位移在很大程度上取决于局部结构。TALOS程序建立了13C、15N和1H化学位移与主链扭转角phi和psi之间的经验关系(Cornilescu等人,《生物分子核磁共振杂志》13:289 - 302,1999年)。将原始的20个蛋白质数据库扩展到200个蛋白质后,可预测主链角度的残基比例从65%提高到了74%,同时错误率从3%降低到了2.5%。在数据库片段选择过程中添加一个两层神经网络过滤器构成了新程序TALOS +的基础,该程序进一步将预测率提高到88.5%,且不增加错误率。排除TALOS +做出的预测与晶体状态下观察到的结果有很大差异的2.5%的残基后,预测的phi和psi角的准确度为±13度。预测结果与晶体结构之间的较大差异主要局限于环区,并且在少数有多个X射线结构的情况下,这些残基在不同结构中常常处于不同状态。TALOS +的输出包括对化学位移缺失的单个残基的预测,并且该程序的神经网络组件还能以较高的准确度预测二级结构。