Suppr超能文献

在疏水-亲水模型中,蛋白质在八分之三的最优条件下快速折叠。

Fast protein folding in the hydrophobic-hydrophilic model within three-eighths of optimal.

作者信息

Hart W E, Istrail S C

机构信息

Sandia National Laboratories, Massively Parallel Computing Research Laboratory, Albuquerque, NM 87185-1110, USA.

出版信息

J Comput Biol. 1996 Spring;3(1):53-96. doi: 10.1089/cmb.1996.3.53.

Abstract

We present performance-guaranteed approximation algorithms for the protein folding problem in the hydrophobic-hydrophilic model (Dill, 1985). Our algorithms are the first approximation algorithms in the literature with guaranteed performance for this model (Dill, 1994). The hydrophobic-hydrophilic model abstracts the dominant force of protein folding: the hydrophobic interaction. The protein is modeled as a chain of amino acids of length n that are of two types; H (hydrophobic, i.e., nonpolar) and P (hydrophilic, i.e., polar). Although this model is a simplification of more complex protein folding models, the protein folding structure prediction problem is notoriously difficult for this model. Our algorithms have linear (3n) or quadratic time and achieve a three-dimensional protein conformation that has a guaranteed free energy no worse than three-eighths of optimal. This result answers the open problem of Ngo et al. (1994) about the possible existence of an efficient approximation algorithm with guaranteed performance for protein structure prediction in any well-studied model of protein folding. By achieving speed and near-optimality simultaneously, our algorithms rigorously capture salient features of the recently proposed framework of protein folding by Sali et al. (1994). Equally important, the final conformations of our algorithms have significant secondary structure (antiparallel sheets, beta-sheets, compact hydrophobic core). Furthermore, hypothetical folding pathways can be described for our algorithms that fit within the framework of diffusion-collision protein folding proposed by Karplus and Weaver (1979). Computational limitations of algorithms that compute the optimal conformation have restricted their applicability to short sequences (length < or = 90). Because our algorithms trade computational accuracy for speed, they can construct near-optimal conformations in linear time for sequences of any size.

摘要

我们提出了用于疏水-亲水平衡模型(迪尔,1985年)中蛋白质折叠问题的性能保证近似算法。我们的算法是文献中针对该模型具有性能保证的首批近似算法(迪尔,1994年)。疏水-亲水平衡模型抽象出了蛋白质折叠的主导力量:疏水相互作用。蛋白质被建模为长度为n的氨基酸链,这些氨基酸分为两种类型:H(疏水的,即非极性的)和P(亲水的,即极性的)。尽管此模型是对更复杂的蛋白质折叠模型的简化,但蛋白质折叠结构预测问题对于该模型来说仍然非常困难。我们的算法具有线性(3n)或二次时间复杂度,并能得到一种三维蛋白质构象,其保证的自由能不超过最优值的八分之三。这一结果回答了恩戈等人(1994年)关于在任何经过充分研究的蛋白质折叠模型中是否可能存在具有性能保证的高效近似算法来进行蛋白质结构预测的开放性问题。通过同时实现速度和接近最优性,我们的算法严格捕捉了萨利等人(1994年)最近提出的蛋白质折叠框架的显著特征。同样重要的是,我们算法的最终构象具有显著的二级结构(反平行片层、β-片层、紧密的疏水核心)。此外,可以为我们的算法描述符合卡尔普斯和韦弗(1979年)提出的扩散-碰撞蛋白质折叠框架的假设折叠途径。计算最优构象的算法的计算限制使其仅适用于短序列(长度≤90)。由于我们的算法以计算精度换取速度,它们可以在线性时间内为任何大小的序列构建接近最优的构象。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验