Winn P J, Battey J N D, Schleinkofer K, Banerjee A, Wade R C
European Molecular Biology Laboratory, Heidelberg, Germany.
Proteins. 2005 Feb 1;58(2):367-75. doi: 10.1002/prot.20318.
Sequences of the ubiquitin-conjugating enzyme (UBC or E2) family were used as a test set to investigate issues associated with the high-throughput comparative modelling of protein structures. A semi-automatic method was initially developed with particular emphasis on producing models of a quality suitable for structural comparison. Structural and sequence features of the E2 family were used to improve the sequence alignment and the quality of the structural templates. Initially, failure to correct for subtle structural inconsistencies between templates lead to problems in the comparative analysis of the UBC electrostatic potentials. Modelling of known UBC structures using Modeller 4.0 showed that multiple templates produced, on average, no better models than the use of just one template, as judged by the root-mean-squared deviation between the comparative model and crystal structure backbones. Using four different quality-checking methods, for a given target sequence, it was not possible to distinguish the model most similar to the experimental structure. The UBC models were thus finally modelled using only the crystal structure template with the highest sequence identity to the target to be modelled, and producing only one model solution. Quality checking was used to reject models with obvious structural anomalies (e.g., bad side-chain packing). The resulting models have been used for a comparison of UBC structural features and of their electrostatic potentials. The work was extended through the development of a fully automated pipeline that identifies E2 sequences in the sequence databases, aligns and models them, and calculates the associated electrostatic potential.
泛素结合酶(UBC或E2)家族的序列被用作测试集,以研究与蛋白质结构高通量比较建模相关的问题。最初开发了一种半自动方法,特别强调生成适合结构比较的质量模型。利用E2家族的结构和序列特征来改进序列比对和结构模板的质量。最初,未能校正模板之间细微的结构不一致导致了UBC静电势比较分析中的问题。使用Modeller 4.0对已知的UBC结构进行建模表明,根据比较模型与晶体结构主链之间的均方根偏差判断,多个模板平均产生的模型并不比仅使用一个模板产生的模型更好。对于给定的目标序列,使用四种不同的质量检查方法无法区分与实验结构最相似的模型。因此,最终仅使用与待建模目标序列同一性最高的晶体结构模板对UBC模型进行建模,并仅生成一个模型解。使用质量检查来剔除具有明显结构异常(例如,侧链堆积不良)的模型。所得模型已用于比较UBC的结构特征及其静电势。通过开发一个全自动流程,该工作得到了扩展,该流程可在序列数据库中识别E2序列,对其进行比对和建模,并计算相关的静电势。