Chakravarty Suvobrata, Godbole Sucheta, Zhang Bing, Berger Seth, Sanchez Roberto
Department of Structural and Chemical Biology, Mount Sinai School of Medicine, 1425 Madison Avenue, New York, NY 10029, USA.
BMC Struct Biol. 2008 Jul 16;8:31. doi: 10.1186/1472-6807-8-31.
Although multiple templates are frequently used in comparative modeling, the effect of inclusion of additional template(s) on model accuracy (when compared to that of corresponding single-template based models) is not clear. To address this, we systematically analyze two-template models, the simplest case of multiple-template modeling. For an existing target-template pair (single-template modeling), a two-template based model of the target sequence is constructed by including an additional template without changing the original alignment to measure the effect of the second template on model accuracy.
Even though in a large number of cases a two-template model showed higher accuracy than the corresponding one-template model, over the entire dataset only a marginal improvement was observed on average, as there were many cases where no change or the reverse change was observed. The increase in accuracy due to the structural complementarity of the templates increases at higher alignment accuracies. The combination of templates showing the highest potential for improvement is that where both templates share similar and low (less than 30%) sequence identity with the target, as well as low sequence identity with each other. The structural similarity between the templates also helps in identifying template combinations having a higher chance of resulting in an improved model.
Inclusion of additional template(s) does not necessarily improve model quality, but there are distinct combinations of the two templates, which can be selected a priori, that tend to show improvement in model quality over the single template model. The benefit derived from the structural complementarity is dependent on the accuracy of the modeling alignment. The study helps to explain the observation that a careful selection of templates together with an accurate target:template alignment are necessary to the benefit from using multiple templates in comparative modeling and provides guidelines to maximize the benefit from using multiple templates. This enables formulation of simple template selection rules to rank targets of a protein family in the context of structural genomics.
尽管在比较建模中经常使用多个模板,但添加额外模板对模型准确性的影响(与相应的基于单模板的模型相比)尚不清楚。为了解决这个问题,我们系统地分析了双模板模型,这是多模板建模中最简单的情况。对于现有的目标 - 模板对(单模板建模),通过添加一个额外的模板来构建目标序列的双模板模型,同时不改变原始比对,以测量第二个模板对模型准确性的影响。
尽管在大量情况下,双模板模型显示出比相应的单模板模型更高的准确性,但在整个数据集中,平均仅观察到微小的改进,因为有许多情况未观察到变化或出现反向变化。由于模板的结构互补性导致的准确性提高在比对准确性较高时会增加。显示出最大改进潜力的模板组合是两个模板与目标序列具有相似且低(小于30%)的序列同一性,并且彼此之间也具有低序列同一性。模板之间的结构相似性也有助于识别更有可能产生改进模型的模板组合。
添加额外模板不一定能提高模型质量,但存在两种模板的特定组合,可以预先选择,这些组合往往比单模板模型在模型质量上有所提高。从结构互补性中获得的益处取决于建模比对的准确性。该研究有助于解释这样的观察结果,即在比较建模中,仔细选择模板以及准确的目标:模板比对对于从使用多个模板中受益是必要的,并提供了最大化使用多个模板益处的指导方针。这使得能够制定简单的模板选择规则,以便在结构基因组学的背景下对蛋白质家族的目标进行排名。