Koretke K K, Luthey-Schulten Z, Wolynes P G
School of Chemical Sciences, University of Illinois, Urbana 61801, USA.
Protein Sci. 1996 Jun;5(6):1043-59. doi: 10.1002/pro.5560050607.
A quantitative form of the principle of minimal frustration is used to obtain from a database analysis statistical mechanical energy functions and gap parameters for aligning sequences to three-dimensional structures. The analysis that partially takes into account correlations in the energy landscape improves upon the previous approximations of Goldstein et al. (1994, 1995) (Goldstein R, Luthey-Schulten Z, Wolynes P, 1994, Proceedings of the 27th Hawaii International Conference on System Sciences. Los Alamitos, California: IEEE Computer Society Press. pp 306-315; Goldstein R, Luthey-Schulten Z, Wolynes P, 1995, In: Elber R, ed. New developments in theoretical studies of proteins. Singapore: World Scientific). The energy function allows for ordering of alignments based on the compatibility of a sequence to be in a given structure (i.e., lowest energy) and therefore removes the necessity of using percent identity or similarity as scoring parameters. The alignments produced by the energy function on distant homologues with low percent identity (less than 21%) are generally better than those generated with evolutionary information. The lowest energy alignment generated with the energy function for sequences containing prosite signatures but unknown structures is a structure containing the same prosite signature, providing a check on the robustness of the algorithm. Finally, the energy function can make use of known experimental evidence as constraints within the alignment algorithm to aid in finding the correct structural alignment.
最小受挫原则的一种定量形式被用于从数据库分析中获得统计力学能量函数和用于将序列与三维结构进行比对的间隙参数。这种部分考虑能量景观中相关性的分析改进了Goldstein等人(1994年、1995年)之前的近似方法(Goldstein R, Luthey-Schulten Z, Wolynes P, 1994, 《第27届夏威夷国际系统科学会议论文集》。加利福尼亚州洛斯阿拉米托斯:IEEE计算机学会出版社。第306 - 315页;Goldstein R, Luthey-Schulten Z, Wolynes P, 1995, 载于Elber R编著的《蛋白质理论研究的新进展》。新加坡:世界科学出版社)。该能量函数允许基于序列与给定结构的兼容性(即最低能量)来对比对进行排序,因此消除了使用百分比一致性或相似性作为评分参数的必要性。由能量函数对低百分比一致性(小于21%)的远源同源物产生的比对通常比利用进化信息生成的比对更好。对于包含Prosite特征但结构未知的序列,用能量函数生成的最低能量比对是一个包含相同Prosite特征的结构,这为算法的稳健性提供了检验。最后,能量函数可以在比对算法中利用已知的实验证据作为约束条件,以帮助找到正确的结构比对。