Faulon Jean-Loup, Sale Ken, Young Malin
Sandia National Laboratories, PO Box 969, MS 9951, Livermore, CA 94551, USA.
Protein Sci. 2003 Aug;12(8):1750-61. doi: 10.1110/ps.0305003.
Herein we present a computational technique for generating helix-membrane protein folds matching a predefined set of distance constraints, such as those obtained from NMR NOE, chemical cross-linking, dipolar EPR, and FRET experiments. The purpose of the technique is to provide initial structures for local conformational searches based on either energetic considerations or ad-hoc scoring criteria. In order to properly screen the conformational space, the technique generates an exhaustive list of conformations within a specified root-mean-square deviation (RMSD) where the helices are positioned in order to match the provided distances. Our results indicate that the number of structures decreases exponentially as the number of distances increases, and increases exponentially as the errors associated with the distances increases. We also found the number of solutions to be smaller when all the distances share one helix in common, compared to the case where the distances connect helices in a daisy-chain manner. We found that for 7 helices, at least 15 distances with errors up to 8 A are needed to produce a number of solutions that is not too large to be processed by local search refinement procedures. Finally, without energetic considerations, our enumeration technique retrieved the transmembrane domains of Bacteriorhodopsin (PDB entry1c3w), Halorhodopsin (1e12), Rhodopsin (1f88), Aquaporin-1 (1fqy), Glycerol uptake facilitator protein (1fx8), Sensory Rhodopsin (1jgj), and a subunit of Fumarate reductase flavoprotein (1qlaC) with Calpha level RMSDs of 3.0 A, 2.3 A, 3.2 A, 4.6 A, 6.0 A, 3.7 A, and 4.4 A, respectively.
在此,我们提出一种计算技术,用于生成与预定义的一组距离约束相匹配的螺旋 - 膜蛋白折叠结构,这些距离约束例如是从核磁共振(NMR)的核Overhauser效应(NOE)、化学交联、双极电子顺磁共振(EPR)和荧光共振能量转移(FRET)实验中获得的。该技术的目的是基于能量考量或临时评分标准,为局部构象搜索提供初始结构。为了恰当地筛选构象空间,该技术会生成在指定均方根偏差(RMSD)内的详尽构象列表,其中螺旋的位置要与提供的距离相匹配。我们结果表明,随着距离数量的增加,结构数量呈指数级减少,而随着与距离相关的误差增加,结构数量呈指数级增加。我们还发现,当所有距离共享一个共同螺旋时,与距离以雏菊链方式连接螺旋的情况相比,解的数量更少。我们发现,对于7个螺旋,至少需要15个误差高达8埃的距离,才能产生数量不至于太大而无法通过局部搜索优化程序处理的解。最后,在未考虑能量因素的情况下,我们的枚举技术检索到了细菌视紫红质(PDB编号1c3w)、嗜盐菌视紫红质(1e12)、视紫红质(1f88)、水通道蛋白 - 1(1fqy)、甘油摄取促进蛋白(1fx8)、感官视紫红质(1jgj)以及延胡索酸还原酶黄素蛋白的一个亚基(1qlaC)的跨膜结构域,其Cα水平的均方根偏差分别为3.0埃、2.3埃、3.2埃、4.6埃、6.0埃、3.7埃和4.4埃。