Suppr超能文献

基于位点比对概率的蛋白质序列-结构比对

Protein sequence-structure alignment based on site-alignment probabilities.

作者信息

Miyazawa S

机构信息

Faculty of Technology, Gunma University, Kiryu, Gunma 376, Japan.

出版信息

Genome Inform Ser Workshop Genome Inform. 2000;11:141-50.

Abstract

A protein sequence-structure alignment method for database searches is examined on how effectively this method together with a simple scoring function previously developed can identify compatibilities between sequences and structures of proteins. The scoring function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. Results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold. Probability alignments consisting of most reliable site pairs only can yield small root mean square deviations, and including less reliable pairs increases the deviations. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.

摘要

一种用于数据库搜索的蛋白质序列 - 结构比对方法,被研究其与先前开发的简单评分函数一起,能多有效地识别蛋白质序列与结构之间的兼容性。该评分函数由成对接触能量、残基因排列过密的排斥堆积势以及二级结构的短程势组成。序列 - 结构比对中的成对接触相互作用基于待比对位点对的概率,在平均场近似下进行评估。间隙罚分假定与每个残基位置的接触数成比例,结果间隙将更频繁地出现在蛋白质表面而非核心区域。除了最小能量比对,我们还使用通过按成对比对概率依次比对位点对得到的概率比对。结果表明,当前的能量函数和比对方法既能很好地检测与给定序列兼容的折叠,反之亦然,即能检测与给定折叠兼容的序列。仅由最可靠位点对组成的概率比对能产生较小的均方根偏差,而包含不太可靠的位点对会增加偏差。值得注意的是,通过这种方法能检测到一些序列 - 结构对,它们的序列同一性仅为5 - 20%。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验