Analyse algébrique, Institut de Mathématiques de Jussieu, Université Pierre et Marie Curie, Paris VI, France.
Centre d'Informatique pour la Biologie, Institut Pasteur, Paris, France.
Comput Struct Biotechnol J. 2013 Jun 19;7:e201305001. doi: 10.5936/csbj.201305001. eCollection 2013.
There is a need for rules allowing three-dimensional structure information to be derived from protein sequences. In this work, consideration of an elementary protein folding step allows protein sub-sequences which optimize folding to be derived for any given protein sequence. Classical mechanics applied to this system and the energy conservation law during the elementary folding step yields an equation whose solutions are taken over the field of rational numbers. This formalism is applied to beta-sheets containing two edge strands and at least two central strands. The number of protein sub-sequences optimized for folding per amino acid in beta-strands is shown in particular to predict edge strands from protein sequences. Topological information on beta-strands and loops connecting them is derived for protein sequences with a prediction accuracy of 75%. The statistical significance of the finding is given. Applications in protein structure prediction are envisioned such as for the quality assessment of protein structure models.
需要制定规则,允许从蛋白质序列中推导出三维结构信息。在这项工作中,考虑到一个基本的蛋白质折叠步骤,可以为任何给定的蛋白质序列推导出优化折叠的蛋白质子序列。将经典力学应用于该系统,并在基本折叠步骤中考虑能量守恒定律,得到一个方程,其解取有理数域。该形式体系应用于包含两条边缘链和至少两条中心链的β-折叠。特别地,该方法用于根据蛋白质序列预测边缘链,展示了每一个氨基酸折叠优化的蛋白质子序列的数量。为具有预测精度为 75%的蛋白质序列推导出β-折叠和连接它们的环的拓扑信息。给出了该发现的统计学意义。可以预见该方法在蛋白质结构预测中的应用,例如用于评估蛋白质结构模型的质量。