Department of Biochemistry, Duke University, Durham, North Carolina.
Program in Computational Biology and Bioinformatics, Duke University, Durham, North Carolina.
Biophys J. 2024 Feb 6;123(3):317-333. doi: 10.1016/j.bpj.2023.12.021. Epub 2023 Dec 29.
Helix-coil models are routinely used to interpret circular dichroism data of helical peptides or predict the helicity of naturally-occurring and designed polypeptides. However, a helix-coil model contains significantly more information than mean helicity alone, as it defines the entire ensemble-the equilibrium population of every possible helix-coil configuration-for a given sequence. Many desirable quantities of this ensemble are either not obtained as ensemble averages or are not available using standard helicity-averaging calculations. Enumeration of the entire ensemble can allow calculation of a wider set of ensemble properties, but the exponential size of the configuration space typically renders this intractable. We present an algorithm that efficiently approximates the helix-coil ensemble to arbitrary accuracy by sequentially generating a list of the M highest populated configurations in descending order of population. Truncating this list of (configuration, population) pairs at a desired accuracy provides an approximating sub-ensemble. We demonstrate several uses of this approach for providing insight into helix-coil ensembles and folding mechanisms, including landscape visualization.
螺旋-卷曲模型通常用于解释螺旋肽的圆二色性数据或预测天然存在和设计的多肽的螺旋性。然而,与平均螺旋度相比,螺旋-卷曲模型包含的信息量要大得多,因为它为给定的序列定义了整个集合——每个可能的螺旋-卷曲构象的平衡种群。该集合中的许多理想数量要么不能作为集合平均值获得,要么不能使用标准的螺旋平均值计算获得。对整个集合进行枚举可以允许计算更广泛的集合属性,但构象空间的指数大小通常使得这难以处理。我们提出了一种算法,通过顺序生成按种群递减顺序排列的 M 个最高种群的配置列表,以任意精度有效地逼近螺旋-卷曲集合。在所需精度处截断此(构象,种群)对列表提供了一个近似的子集合。我们展示了这种方法在提供对螺旋-卷曲集合和折叠机制的洞察力方面的几种用途,包括景观可视化。