Taylor Martin S, Kai Chikatoshi, Kawai Jun, Carninci Piero, Hayashizaki Yoshihide, Semple Colin A M
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom.
PLoS Genet. 2006 Apr;2(4):e30. doi: 10.1371/journal.pgen.0020030. Epub 2006 Apr 28.
We have surveyed the evolutionary trends of mammalian promoters and upstream sequences, utilising large sets of experimentally supported transcription start sites (TSSs). With 30,969 well-defined TSSs from mouse and 26,341 from human, there are sufficient numbers to draw statistically meaningful conclusions and to consider differences between promoter types. Unlike previous smaller studies, we have considered the effects of insertions, deletions, and transposable elements as well as nucleotide substitutions. The rate of promoter evolution relative to that of control sequences has not been consistent between lineages nor within lineages over time. The most pronounced manifestation of this heterotachy is the increased rate of evolution in primate promoters. This increase is seen across different classes of mutation, including substitutions and micro-indel events. We investigated the relationship between promoter and coding sequence selective constraint and suggest that they are generally uncorrelated. This analysis also identified a small number of mouse promoters associated with the immune response that are under positive selection in rodents. We demonstrate significant differences in divergence between functional promoter categories and identify a category of promoters, not associated with conventional protein-coding genes, that has the highest rates of divergence across mammals. We find that evolutionary rates vary both on a fine scale within mammalian promoters and also between different functional classes of promoters. The discovery of heterotachy in promoter evolution, in particular the accelerated evolution of primate promoters, has important implications for our understanding of human evolution and for strategies to detect primate-specific regulatory elements.
我们利用大量经实验支持的转录起始位点(TSS),研究了哺乳动物启动子和上游序列的进化趋势。小鼠有30969个定义明确的TSS,人类有26341个,数量足以得出具有统计学意义的结论,并考虑启动子类型之间的差异。与之前规模较小的研究不同,我们考虑了插入、缺失、转座元件以及核苷酸替换的影响。启动子相对于对照序列的进化速率在不同谱系之间以及同一谱系内随时间并不一致。这种异速进化最明显的表现是灵长类启动子进化速率的增加。在包括替换和微插入/缺失事件在内的不同类型突变中都能看到这种增加。我们研究了启动子与编码序列选择约束之间的关系,并表明它们通常不相关。该分析还确定了少数与免疫反应相关的小鼠启动子在啮齿动物中受到正选择。我们证明了功能启动子类别之间在分歧上存在显著差异,并确定了一类与传统蛋白质编码基因无关的启动子,其在哺乳动物中的分歧率最高。我们发现,进化速率在哺乳动物启动子内部的精细尺度上以及不同功能类别的启动子之间都有所不同。启动子进化中异速进化的发现,特别是灵长类启动子的加速进化,对我们理解人类进化以及检测灵长类特异性调控元件的策略具有重要意义。