Suppr超能文献

疏水性氨基酸的周期性分布有助于定义基本构建模块,从而比对远缘相关的蛋白质。

Periodic distributions of hydrophobic amino acids allows the definition of fundamental building blocks to align distantly related proteins.

作者信息

Baussand J, Deremble C, Carbone A

机构信息

Génomique Analytique, INSERM UMRS511, Université Pierre et Marie Curie-Paris 6, 91, Bd de l'Hôpital, 75013 Paris, France.

出版信息

Proteins. 2007 May 15;67(3):695-708. doi: 10.1002/prot.21319.

Abstract

Several studies on large and small families of proteins proved in a general manner that hydrophobic amino acids are globally conserved even if they are subjected to high rate substitution. Statistical analysis of amino acids evolution within blocks of hydrophobic amino acids detected in sequences suggests their usage as a basic structural pattern to align pairs of proteins of less than 25% sequence identity, with no need of knowing their 3D structure. The authors present a new global alignment method and an automatic tool for Proteins with HYdrophobic Blocks ALignment (PHYBAL) based on the combinatorics of overlapping hydrophobic blocks. Two substitution matrices modeling a different selective pressure inside and outside hydrophobic blocks are constructed, the Inside Hydrophobic Blocks Matrix and the Outside Hydrophobic Blocks Matrix, and a 4D space of gap values is explored. PHYBAL performance is evaluated against Needleman and Wunsch algorithm run with Blosum 30, Blosum 45, Blosum 62, Gonnet, HSDM, PAM250, Johnson and Remote Homo matrices. PHYBAL behavior is analyzed on eight randomly selected pairs of proteins of >30% sequence identity that cover a large spectrum of structural properties. It is also validated on two large datasets, the 127 pairs of the Domingues dataset with >30% sequence identity, and 181 pairs issued from BAliBASE 2.0 and ranked by percentage of identity from 7 to 25%. Results confirm the importance of considering substitution matrices modeling hydrophobic contexts and a 4D space of gap values in aligning distantly related proteins. Two new notions of local and global stability are defined to assess the robustness of an alignment algorithm and the accuracy of PHYBAL. A new notion, the SAD-coefficient, to assess the difficulty of structural alignment is also introduced. PHYBAL has been compared with Hydrophobic Cluster Analysis and HMMSUM methods.

摘要

多项针对大小不同蛋白质家族的研究普遍证明,即使疏水氨基酸经历高替换率,它们在整体上仍是保守的。对序列中检测到的疏水氨基酸块内氨基酸进化的统计分析表明,它们可作为一种基本结构模式,用于比对序列同一性低于25%的蛋白质对,而无需了解其三维结构。作者提出了一种新的全局比对方法以及一种基于重叠疏水块组合学的用于具有疏水块比对的蛋白质(PHYBAL)的自动工具。构建了两个分别模拟疏水块内部和外部不同选择压力的替换矩阵,即疏水块内部矩阵和疏水块外部矩阵,并探索了一个四维空位值空间。将PHYBAL的性能与使用Blosum 30、Blosum 45、Blosum 62、Gonnet、HSDM、PAM250、Johnson和远程同源矩阵运行的Needleman和Wunsch算法进行了评估。对八对随机选择的序列同一性大于30%且涵盖广泛结构特性的蛋白质对分析了PHYBAL的行为。还在两个大型数据集上进行了验证,一个是序列同一性大于30%的Domingues数据集中的127对,另一个是来自BAliBASE 2.0且同一性百分比从7%到25%排序的181对数据。结果证实了在比对远缘相关蛋白质时考虑模拟疏水环境的替换矩阵和四维空位值空间的重要性。定义了局部稳定性和全局稳定性这两个新概念,以评估比对算法的稳健性和PHYBAL的准确性。还引入了一个用于评估结构比对难度的新概念——SAD系数。已将PHYBAL与疏水簇分析和HMMSUM方法进行了比较。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验