Marsh T L, Reich C I, Whitelock R B, Olsen G J
Department of Microbiology, University of Illinois, Urbana 61801.
Proc Natl Acad Sci U S A. 1994 May 10;91(10):4180-4. doi: 10.1073/pnas.91.10.4180.
The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.
真核生物转录起始的第一步由TATA结合蛋白介导,它是转录因子IID复合物的一个亚基。我们已经从嗜热栖热菌(古菌的一种,以前称为古细菌)中克隆并测序了一种假定的该真核蛋白同源物的基因。古菌基因编码的蛋白质是一个保守结构域的串联重复序列,与真核生物对应物中的重复结构域相对应。对该重复序列两半部分的分子系统发育分析表明,这种重复发生在古菌域和真核域分化之前。结合之前关于RNA聚合酶亚基组成和序列相似性的观察,以及在沃氏火球菌(嗜热栖热菌的亲缘物种)中发现的类转录因子IIB序列,似乎真核转录装置的主要特征在真核细胞组织起源之前就已确立。古菌蛋白两半部分之间的差异小于单个真核序列两半部分之间的差异,这表明古菌蛋白的序列变化平均速率低于其真核生物对应物。就这种较低速率适用于整个基因组而言,相比于对真核生物相应序列的研究,通过对古菌序列的研究更有可能清晰地展现出产生当今基因组的早期基因(和基因家族)的情况。