Suppr超能文献

基于空位比对和经验性配对得分函数的全局最优蛋白质穿线法

Global optimum protein threading with gapped alignment and empirical pair score functions.

作者信息

Lathrop R H, Smith T F

机构信息

Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge 02139, USA.

出版信息

J Mol Biol. 1996 Feb 2;255(4):641-65. doi: 10.1006/jmbi.1996.0053.

Abstract

We describe a branch-and-bound search algorithm for finding the exact global optimum gapped sequence-structure alignment ("threading") between a protein sequence and a protein core or structural model, using an arbitrary amino acid pair score function (e.g. contact potentials, knowledge-based potentials, potentials of mean force, etc.). The search method imposes minimal conditions on how structural environments are defined or the form of the score function, and allows arbitrary sequence-specific functions for scoring loops and active site residues. Consequently the search method can be used with many different score functions and threading methodologies; this paper illustrates five from the literature. On a desktop workstation running LISP, we have found the global optimum protein sequence-structure alignment in NP-hard search spaces as large as 9.6 x 10(31), at rates ranging as high as 6.8 x 10(28) equivalent threadings per second (most of which are pruned before they ever are examined explicitly). Continuing the procedure past the global optimum enumerates successive candidate threadings in monotonically increasing score order. We give efficient algorithms for search space size, uniform random sampling, segment placement probabilities, mean, standard deviation and partition function. The method should prove useful for structure prediction, as well as for critical evaluation of new pair score functions.

摘要

我们描述了一种分支定界搜索算法,用于在蛋白质序列与蛋白质核心或结构模型之间找到精确的全局最优带空位序列 - 结构比对(“穿线法”),该算法使用任意氨基酸对评分函数(例如接触势、基于知识的势、平均力势等)。该搜索方法对结构环境的定义方式或评分函数的形式施加了最小条件,并允许使用任意的序列特异性函数对环和活性位点残基进行评分。因此,该搜索方法可与许多不同的评分函数和穿线方法一起使用;本文举例说明了文献中的五种方法。在运行LISP的桌面工作站上,我们在高达9.6×10³¹的NP难搜索空间中找到了全局最优蛋白质序列 - 结构比对,速率高达每秒6.8×10²⁸次等效穿线(其中大部分在被明确检查之前就被修剪掉了)。在找到全局最优解之后继续该过程,会按分数单调递增的顺序枚举连续的候选穿线。我们给出了搜索空间大小、均匀随机抽样、片段放置概率、均值、标准差和配分函数的高效算法。该方法对于结构预测以及新的对评分函数的关键评估应该是有用的。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验