Nayeem Akbar, Sitkoff Doree, Krystek Stanley
Computer-Assisted Drug Design, Pharmaceutical Research Institute, Bristol-Myers Squibb, Princeton, New Jersey 08543, USA.
Protein Sci. 2006 Apr;15(4):808-24. doi: 10.1110/ps.051892906.
An open question in protein homology modeling is, how well do current modeling packages satisfy the dual criteria of quality of results and practical ease of use? To address this question objectively, we examined homology-built models of a variety of therapeutically relevant proteins. The sequence identities across these proteins range from 19% to 76%. A novel metric, the difference alignment index (DAI), is developed to aid in quantifying the quality of local sequence alignments. The DAI is also used to construct the relative sequence alignment (RSA), a new representation of global sequence alignment that facilitates comparison of sequence alignments from different methods. Comparisons of the sequence alignments in terms of the RSA and alignment methodologies are made to better understand the advantages and caveats of each method. All sequence alignments and corresponding 3D models are compared to their respective structure-based alignments and crystal structures. A variety of protein modeling software was used. We find that at sequence identities >40%, all packages give similar (and satisfactory) results; at lower sequence identities (<25%), the sequence alignments generated by Profit and Prime, which incorporate structural information in their sequence alignment, stand out from the rest. Moreover, the model generated by Prime in this low sequence identity region is noted to be superior to the rest. Additionally, we note that DSModeler and MOE, which generate reasonable models for sequence identities >25%, are significantly more functional and easier to use when compared with the other structure-building software.
蛋白质同源性建模中一个悬而未决的问题是,当前的建模软件包在多大程度上满足结果质量和实际易用性这两个双重标准?为了客观地解决这个问题,我们研究了多种与治疗相关的蛋白质的同源性构建模型。这些蛋白质之间的序列同一性范围从19%到76%。我们开发了一种新的指标,即差异比对指数(DAI),以帮助量化局部序列比对的质量。DAI还用于构建相对序列比对(RSA),这是一种全局序列比对的新表示形式,便于比较来自不同方法的序列比对。通过RSA和比对方法对序列比对进行比较,以更好地理解每种方法的优缺点。将所有序列比对和相应的3D模型与其各自基于结构的比对和晶体结构进行比较。我们使用了多种蛋白质建模软件。我们发现,在序列同一性>40%时,所有软件包都给出了相似(且令人满意)的结果;在较低的序列同一性(<25%)时,Profit和Prime生成的序列比对在其余软件中脱颖而出,它们在序列比对中纳入了结构信息。此外,在这个低序列同一性区域,Prime生成的模型被认为优于其他模型。此外,我们注意到,对于序列同一性>25%能生成合理模型的DSModeler和MOE,与其他结构构建软件相比,功能更强大且更易于使用。