Taylor F M, Martindale D W
Department of Natural Resource Sciences, McGill University, Ste Anne de Bellevue, Quebec, Canada.
Nucleic Acids Res. 1993 Sep 25;21(19):4610-4. doi: 10.1093/nar/21.19.4610.
We have determined the nucleotide sequence of the cnjB gene from the ciliate Tetrahymena thermophila. This gene is transcriptionally active only during early conjugation, peaking in meiotic prophase. It contains 13 introns, four transcription start points and codes for a putative polypeptide (CnjB) of 1748 amino acids with a calculated molecular weight of 200 kilodaltons and a pl of 7.9. The coding region of cnjB has a low GC content (32% GC) and unusual codon usage. The C-terminal one-third of CnjB consists of three repetitive domains. Introns were absent in this region of cnjB. One of the repetitive domains consists of seven CCHC or retroviral-type zinc fingers, a motif found in one or two copies in retroviral nucleocapsid proteins. This motif has also been found recently in seven copies in the human nucleic-acid binding protein CNBP, in an apparent CNBP homologue in Schizosaccharomyces pombe and in one copy in a Xenopus gene active in early embryos. The other two domains are on either side of the zinc finger domain and contain a repeated glycine-rich motif seen in the heterogeneous nuclear ribonuclear proteins A1 and A2/B1 as well as other proteins. Both CCHC zinc fingers and glycine-rich repeats have been found in proteins with single-stranded nucleic acid-binding activity as well as strand-annealing activity. CnjB is, to our knowledge, the first protein found to contain both types of motifs.
我们已经确定了嗜热四膜虫(Tetrahymena thermophila)中cnjB基因的核苷酸序列。该基因仅在接合前期早期具有转录活性,在减数分裂前期达到峰值。它包含13个内含子、四个转录起始点,编码一个推定的由1748个氨基酸组成的多肽(CnjB),计算分子量为200千道尔顿,等电点为7.9。cnjB的编码区GC含量较低(32%GC),密码子使用情况也不寻常。CnjB的C端三分之一由三个重复结构域组成。cnjB的这一区域没有内含子。其中一个重复结构域由七个CCHC或逆转录病毒型锌指组成,这种基序在逆转录病毒核衣壳蛋白中以一到两个拷贝的形式存在。最近在人类核酸结合蛋白CNBP中也发现了七个拷贝的这种基序,在粟酒裂殖酵母(Schizosaccharomyces pombe)中一个明显的CNBP同源物中以及在非洲爪蟾早期胚胎中活跃的一个基因中发现了一个拷贝。另外两个结构域位于锌指结构域的两侧,包含在不均一核核糖核蛋白A1和A2/B1以及其他蛋白质中可见的富含甘氨酸的重复基序。CCHC锌指和富含甘氨酸的重复序列都已在具有单链核酸结合活性以及链退火活性的蛋白质中发现。据我们所知,CnjB是第一个被发现同时包含这两种基序的蛋白质。