Mariño-Ramírez Leonardo, Jordan I King, Landsman David
Computational Biology Branch, National Center for Biotechnology Information, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894-6075, USA.
Genome Biol. 2006;7(12):R122. doi: 10.1186/gb-2006-7-12-r122.
Core histone genes are periodically expressed along the cell cycle and peak during S phase. Core histone gene expression is deeply evolutionarily conserved from the yeast Saccharomyces cerevisiae to human.
We evaluated the evolutionary dynamics of the specific regulatory mechanisms that give rise to the conserved histone regulatory phenotype. In contrast to the conservation of core histone gene expression patterns, the core histone regulatory machinery is highly divergent between species. There has been substantial evolutionary turnover of cis-regulatory sequence motifs along with the transcription factors that bind them. The regulatory mechanisms employed by members of the four core histone families are more similar within species than within gene families. The presence of species-specific histone regulatory mechanisms is opposite to what is seen at the protein sequence level. Core histone proteins are more similar within families, irrespective of their species of origin, than between families, which is consistent with the shared common ancestry of the members of individual histone families. Structure and sequence comparisons between histone families reveal that H2A and H2B form one related group whereas H3 and H4 form a distinct group, which is consistent with the nucleosome assembly dynamics.
The dissonance between the evolutionary conservation of the core histone gene regulatory phenotypes and the divergence of their regulatory mechanisms indicates a highly dynamic mode of regulatory evolution. This distinct mode of regulatory evolution is probably facilitated by a solution space for promoter sequences, in terms of functionally viable cis-regulatory sites, that is substantially greater than that of protein sequences.
核心组蛋白基因在细胞周期中呈周期性表达,并在S期达到峰值。从酿酒酵母到人类,核心组蛋白基因表达在进化上高度保守。
我们评估了产生保守的组蛋白调控表型的特定调控机制的进化动态。与核心组蛋白基因表达模式的保守性相反,核心组蛋白调控机制在不同物种间高度分化。顺式调控序列基序以及与之结合的转录因子发生了大量的进化更替。四个核心组蛋白家族成员所采用的调控机制在物种内比在基因家族内更为相似。物种特异性组蛋白调控机制的存在与蛋白质序列水平上的情况相反。核心组蛋白在家族内比在家族间更为相似,无论其起源物种如何,这与单个组蛋白家族成员的共同祖先一致。组蛋白家族之间的结构和序列比较表明,H2A和H2B形成一个相关组,而H3和H4形成一个不同的组,这与核小体组装动态一致。
核心组蛋白基因调控表型的进化保守性与其调控机制的分化之间的不一致表明了一种高度动态的调控进化模式。就功能可行的顺式调控位点而言,启动子序列的解决方案空间可能比蛋白质序列的解决方案空间大得多,这可能促进了这种独特的调控进化模式。