Liu Yongxing, Beveridge D L
Chemistry Department and Molecular Biophysics Program, Wesleyan University, Middletown, Connecticut 06457, USA.
Proteins. 2002 Jan 1;46(1):128-46. doi: 10.1002/prot.10020.
A theoretical and computational approach to ab initio structure prediction for polypeptides in water is described and applied to selected amino acid sequences for testing and preliminary validation. The method builds systematically on the extensive efforts applied to parameterization of molecular dynamics (MD) force fields, employs an empirically well-validated continuum dielectric model for solvation, and an eminently parallelizable approach to conformational search. The effective free energy of polypeptide chains is estimated from AMBER united atom potential functions, with internal degrees of freedom for both backbone and amino acid side chains explicitly treated. The hydration free energy of each structure is determined using the Generalized Born/Solvent Accessibility (GBSA) method, modified and reparameterized to include atom types consistent with the AMBER force field. The conformational search procedure employs a multiple copy, Monte Carlo simulated annealing (MCSA) protocol in full torsion angle space, applied iteratively on sets of structures of progressively lower free energy until a prediction of a structure with lowest effective free energy is obtained. Calibration tests for the effective energy function and search algorithm are performed on the alanine dipeptide, selected protein crystal structures, and united atom decoys on barnase, crambin, and six examples from the Rosetta set. Specific demonstration cases of the method are provided for the 8-mer sequence of Ala residues, a 12-residue peptide with longer side chains QLLKKLLQQLKQ, a de novo designed 16 residue peptide of sequence (AAQAA)3Y, a 15-residue sequence with a beta sheet motif, GEWTWDATKTFTVTE, and a 36 residue small protein, Villin headpiece. The Ala 8-mer readily formed an alpha-helix. An alpha-helix structure was predicted for the 16-mer, consistent with observed results from IR and CD spectroscopy and with the pattern in psi/straight phi angles of known protein structures. The predicted structure for the 12-mer, composed of a mix of helix and less regular elements of secondary structure, lies 2.65 A RMS from the observed crystal structure. Structure prediction for the 8-mer beta-motif resulted in form 4.50 A RMS from the crystal geometry. For Villin, the predicted native form is very close to the crystal structure, RMS values of 3.5 A (including sidechains), and 1.01 A (main chain only). The methodology permits a detailed analysis of the molecular forces which dominate various segments of the predicted folding trajectory. Analysis of the results in terms of internal torsional, electrostatic and van der Waals and the electrostatic and non-electrostatic contributions to hydration, including the hydrophobic effect, is presented.
本文描述了一种用于预测水中多肽从头结构的理论和计算方法,并将其应用于选定的氨基酸序列进行测试和初步验证。该方法系统地建立在对分子动力学(MD)力场参数化所做的大量工作基础之上,采用了经验证有效的连续介质介电模型进行溶剂化处理,并采用了一种高度可并行化的构象搜索方法。多肽链的有效自由能由AMBER联合原子势函数估算,其中主链和氨基酸侧链的内部自由度都得到了明确处理。每种结构的水合自由能使用广义玻恩/溶剂可及性(GBSA)方法确定,并经过修改和重新参数化,以纳入与AMBER力场一致的原子类型。构象搜索过程在全扭转角空间中采用多重复制蒙特卡罗模拟退火(MCSA)协议,对自由能逐渐降低的结构集进行迭代应用,直到获得具有最低有效自由能的结构预测。对丙氨酸二肽、选定的蛋白质晶体结构以及来自巴那斯酶、克拉宾和罗塞塔集的六个示例的联合原子诱饵进行了有效能量函数和搜索算法的校准测试。该方法针对丙氨酸残基的8聚体序列、具有较长侧链的12残基肽QLLKKLLQQLKQ、从头设计的序列为(AAQAA)3Y的16残基肽、具有β-折叠基序的15残基序列GEWTWDATKTFTVTE以及36残基的小蛋白维林头部结构域提供了具体的示范案例。丙氨酸8聚体很容易形成α-螺旋。预测16聚体为α-螺旋结构,这与红外光谱和圆二色光谱的观察结果以及已知蛋白质结构的ψ/直φ角模式一致。预测的12残基肽结构由螺旋和不太规则的二级结构元素混合组成,与观察到的晶体结构的均方根偏差(RMS)为2.65 Å。对8聚体β-基序的结构预测与晶体几何结构的RMS为4.50 Å。对于维林,预测的天然形式与晶体结构非常接近,RMS值为3.5 Å(包括侧链)和1.01 Å(仅主链)。该方法允许对主导预测折叠轨迹各个部分的分子力进行详细分析。本文给出了根据内部扭转、静电和范德华力以及对水合作用的静电和非静电贡献(包括疏水效应)对结果的分析。