Guo Zhenhua, Stiller John W
Department of Biology, East Carolina University, Howell Science Complex N 108, Greenville, NC 27858, USA.
BMC Genomics. 2004 Sep 20;5:69. doi: 10.1186/1471-2164-5-69.
Cyclin-dependent kinases (CDKs) are a large family of proteins that function in a variety of key regulatory pathways in eukaryotic cells, including control over the cell cycle and gene transcription. Among the most important and broadly studied of these roles is reversible phosphorylation of the C-terminal domain (CTD) of RNA polymerase II, part of a complex array of CTD/protein interactions that coordinate the RNAP II transcription cycle. The RNAP CTD is strongly conserved in some groups of eukaryotes, but highly degenerate or absent in others; the reasons for these differences in stabilizing selection on CTD structure are not clear. Given the importance of reversible phosphorylation for CTD-based transcription, the distribution and evolutionary history of CDKs may be a key to understanding differences in constraints on CTD structure; however, the origins and evolutionary relationships of CTD kinases have not been investigated thoroughly. Moreover, although the functions of most CDKs are reasonably well studied in mammals and yeasts, very little is known from most other eukaryotes.
Here we identify 123 CDK family members from animals, plants, yeasts, and four protists from which genome sequences have been completed, and 10 additional CDKs from incomplete genome sequences of organisms with known CTD sequences. Comparative genomic and phylogenetic analyses suggest that cell-cycle CDKs are present in all organisms sampled in this study. In contrast, no clear orthologs of transcription-related CDKs are identified in the most putatively ancestral eukaryotes, Trypanosoma or Giardia. Kinases involved in CTD phosphorylation, CDK7, CDK8 and CDK9, all are recovered as well-supported and distinct orthologous families, but their relationships to each other and other CDKs are not well-resolved. Significantly, clear orthologs of CDK7 and CDK8 are restricted to only those organisms belonging to groups in which the RNAP II CTD is strongly conserved.
The apparent origins of CDK7 and CDK8, or at least their conservation as clearly recognizable orthologous families, correlate with strong stabilizing selection on RNAP II CTD structure. This suggests co-evolution of the CTD and these CTD-directed CDKs. This observation is consistent with the hypothesis that CDK7 and CDK8 originated at about the same time that the CTD was canalized as the staging platform RNAP II transcription. Alternatively, extensive CTD phosphorylation may occur in only a subset of eukaryotes and, when present, this interaction results in greater stabilizing selection on both CTD and CDK sequences. Overall, our results suggest that transcription-related kinases originated after cell-cycle related CDKs, and became more evolutionarily and functionally diverse as transcriptional complexity increased.
细胞周期蛋白依赖性激酶(CDK)是一大类蛋白质,在真核细胞的多种关键调控途径中发挥作用,包括对细胞周期和基因转录的控制。在这些作用中,最重要且研究最广泛的是RNA聚合酶II C末端结构域(CTD)的可逆磷酸化,这是一系列协调RNA聚合酶II转录周期的CTD/蛋白质相互作用复合体的一部分。RNA聚合酶CTD在某些真核生物群体中高度保守,但在其他群体中高度退化或缺失;CTD结构稳定选择存在这些差异的原因尚不清楚。鉴于可逆磷酸化对基于CTD的转录的重要性,CDK的分布和进化历史可能是理解CTD结构限制差异的关键;然而,CTD激酶的起源和进化关系尚未得到充分研究。此外,尽管大多数CDK在哺乳动物和酵母中的功能已得到较好研究,但对于大多数其他真核生物却知之甚少。
在这里,我们从动物、植物、酵母以及已完成基因组测序的四种原生生物中鉴定出123个CDK家族成员,并从具有已知CTD序列的生物体的不完整基因组序列中鉴定出另外10个CDK。比较基因组学和系统发育分析表明,细胞周期CDK存在于本研究采样的所有生物体中。相比之下,在最具推测性的原始真核生物锥虫或贾第虫中未鉴定出与转录相关的CDK的明确直系同源物。参与CTD磷酸化的激酶CDK7、CDK8和CDK9,均被鉴定为得到充分支持且不同的直系同源家族,但它们彼此之间以及与其他CDK的关系尚未得到很好的解析。值得注意的是,CDK7和CDK8的明确直系同源物仅限于那些RNA聚合酶II CTD高度保守的生物群体。
CDK7和CDK8的明显起源,或者至少它们作为可清晰识别的直系同源家族的保守性,与RNA聚合酶II CTD结构上的强烈稳定选择相关。这表明CTD与这些CTD导向的CDK共同进化。这一观察结果与CDK7和CDK8大约在CTD被确定为RNA聚合酶II转录的分期平台时起源的假设一致。或者,广泛的CTD磷酸化可能仅发生在真核生物的一个子集中,并且当存在时,这种相互作用会导致对CTD和CDK序列的更大稳定选择。总体而言,我们的结果表明与转录相关的激酶起源于与细胞周期相关的CDK之后,并随着转录复杂性的增加在进化和功能上变得更加多样化。