Blouin Christian, Butt Davin, Roger Andrew James
Genome Atlantic, Department of Biochemistry and Molecular Biology, and Faculty of Computer Science, Dalhousie University, 6050 University Avenue, Halifax, NS, Canada B3H 1W5.
Protein Sci. 2004 Mar;13(3):608-16. doi: 10.1110/ps.03299804.
The rapidly evolving subsets of a protein are often evident in multiple sequence alignments as poorly defined, gap-containing regions. We investigated the 3D context of these regions observed in 28 protein structures containing a GTP-binding domain assumed to be homologous to the transforming factor p21-RAS. The phylogenetic depth of this data set is such that it is possible to observe lineages sharing a common protein core that diverged early in the eukaryotic cell history. The sequence variability among these homolog proteins is directly linked to the structural variability of surface loops. We demonstrate that these regions are self-contained and thus mostly free of the evolutionary constraints imposed by the conserved core of the domain. These intraloop interactions have the property to create stem-like structures. Interestingly, these stem-like structures can be observed in loops of varying size, up to the size of small protein domains. We propose a model under which the diversity of protein topologies observed in these loops can be the product of a stochastic sampling of sequence and conformational space in a near-neutral fashion, while the proximity of the functional features of the domain core allows novel beneficial traits to be fixed. Our comparative observations, limited here to the proteins containing the RAS-like GTP-binding domain, suggest that a stochastic process of insertion/deletion analogous to "budding" of loops is a likely mechanism of structural innovation. Such a framework could be experimentally exploited to investigate the folding of increasingly complex model inserts.
蛋白质快速进化的亚群在多序列比对中通常表现为定义不明确、含有缺口的区域。我们研究了在28个含有假定与转化因子p21-RAS同源的GTP结合结构域的蛋白质结构中观察到的这些区域的三维背景。该数据集的系统发育深度使得有可能观察到在真核细胞历史早期就分化的、共享共同蛋白质核心的谱系。这些同源蛋白质之间的序列变异性与表面环的结构变异性直接相关。我们证明这些区域是自成一体的,因此大多不受该结构域保守核心所施加的进化限制。这些环内相互作用具有形成茎状结构的特性。有趣的是,这些茎状结构可以在大小各异的环中观察到,直至小蛋白质结构域的大小。我们提出了一个模型,在该模型中,在这些环中观察到的蛋白质拓扑结构的多样性可能是以近中性方式对序列和构象空间进行随机采样的产物,而结构域核心功能特征的临近性则允许新的有益性状得以固定。我们在此仅限于对含有RAS样GTP结合结构域的蛋白质的比较观察表明,类似于环“出芽”的插入/缺失随机过程可能是结构创新的一种机制。这样一个框架可通过实验加以利用,以研究日益复杂的模型插入片段的折叠情况。