Friedberg Iddo, Margalit Hanah
Department of Molecular Genetics and Biotechnology, The Hebrew University-Hadassah Medical School, Jerusalem 91120, Israel.
Protein Sci. 2002 Feb;11(2):350-60. doi: 10.1110/ps.18602.
Many protein pairs that share the same fold do not have any detectable sequence similarity, providing a valuable source of information for studying sequence-structure relationship. In this study, we use a stringent data set of structurally similar, sequence-dissimilar protein pairs to characterize residues that may play a role in the determination of protein structure and/or function. For each protein in the database, we identify amino-acid positions that show residue conservation within both close and distant family members. These positions are termed "persistently conserved". We then proceed to determine the "mutually" persistently conserved (MPC) positions: those structurally aligned positions in a protein pair that are persistently conserved in both pair mates. Because of their intra- and interfamily conservation, these positions are good candidates for determining protein fold and function. We find that 45% of the persistently conserved positions are mutually conserved. A significant fraction of them are located in critical positions for secondary structure determination, they are mostly buried, and many of them form spatial clusters within their protein structures. A substitution matrix based on the subset of MPC positions shows two distinct characteristics: (i) it is different from other available matrices, even those that are derived from structural alignments; (ii) its relative entropy is high, emphasizing the special residue restrictions imposed on these positions. Such a substitution matrix should be valuable for protein design experiments.
许多具有相同折叠结构的蛋白质对没有任何可检测到的序列相似性,这为研究序列-结构关系提供了宝贵的信息来源。在本研究中,我们使用了一组结构相似但序列不相似的蛋白质对的严格数据集,以表征可能在蛋白质结构和/或功能的确定中起作用的残基。对于数据库中的每个蛋白质,我们确定在近缘和远缘家族成员中均显示残基保守性的氨基酸位置。这些位置被称为“持续保守”位置。然后,我们继续确定“相互”持续保守(MPC)位置:蛋白质对中那些在两个配对伙伴中均持续保守的结构对齐位置。由于它们在家族内和家族间的保守性,这些位置是确定蛋白质折叠和功能的良好候选者。我们发现4'5%的持续保守位置是相互保守的。其中很大一部分位于二级结构确定的关键位置,它们大多被埋藏,并且其中许多在其蛋白质结构内形成空间簇。基于MPC位置子集的替换矩阵显示出两个明显的特征:(i)它与其他可用矩阵不同,甚至与那些从结构比对中衍生出来的矩阵也不同;(ii)其相对熵很高,强调了对这些位置施加的特殊残基限制。这样的替换矩阵对于蛋白质设计实验应该是有价值的。