St-Jacques B, Han T H, MacMurray A, Shin H S
Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge 02142.
Mol Cell Biol. 1990 Jan;10(1):138-45. doi: 10.1128/mcb.10.1.138-145.1990.
The H-2 complex of mice contains many genes in addition to the gene families involved in immune reactions. Some of them are believed to function in mouse development, as suggested by the findings that several embryonic lethal mutations map within or near the H-2 complex. We have analyzed the H-2K/tw5 region in an attempt to study non-H-2 genes encoded in this region. Overlapping cosmid clones spanning about 170 kilobase pairs of DNA, including the H-2K/tw5 region of the mouse, have been screened for genes expressed in embryonic carcinoma cells. A transcript of 2.8 kilobase pairs (K. Abe. J.-F. Wei, F.-S. Wei, Y.-C. Hsu, H. Uehara, K. Artzt, and D. Bennett, EMBO J. 7:3441-3449, 1988) encoded by the KE 4 gene flanking H-2K distally was identified. The transcript was abundantly expressed in embryonic carcinoma cells but was present at low levels in other tissues in adults. A cDNA for this transcript was isolated from the F9 embryonic carcinoma cell line and sequenced. It potentially encodes a protein of 436 amino acids with several interesting features. First, it contains two regions made of well-conserved repeats unusually rich in histidine residues. In the repeats, histidine alternates with other amino acids, notably glycine or serine. Second, the two histidine-rich regions are separated by three putative membrane-spanning domains. Third, the N-terminal part of the sequence shows characteristics of a signal peptide. The results indicate that the protein coded by the gene may be a transmembrane protein with histidine-rich charge clusters. A similar sequence motif found in other known genes allows speculation on the possible functional of this gene.
小鼠的H-2复合体除了包含参与免疫反应的基因家族外,还含有许多其他基因。其中一些基因被认为在小鼠发育过程中发挥作用,这是由几个胚胎致死突变定位在H-2复合体内或其附近的研究结果所表明的。我们分析了H-2K/tw5区域,试图研究该区域编码的非H-2基因。我们筛选了跨越约170千碱基对DNA的重叠黏粒克隆,包括小鼠的H-2K/tw5区域,以寻找在胚胎癌细胞中表达的基因。我们鉴定出了由位于H-2K远端侧翼的KE 4基因编码的一个2.8千碱基对的转录本(K. Abe、J.-F. Wei、F.-S. Wei、Y.-C. Hsu、H. Uehara、K. Artzt和D. Bennett,《欧洲分子生物学组织杂志》7:3441-3449,1988年)。该转录本在胚胎癌细胞中大量表达,但在成年动物的其他组织中含量较低。我们从F9胚胎癌细胞系中分离出了该转录本的cDNA并进行了测序。它可能编码一个含有几个有趣特征的436个氨基酸的蛋白质。首先,它包含两个由保守重复序列组成的区域,这些重复序列中组氨酸残基异常丰富。在这些重复序列中,组氨酸与其他氨基酸交替出现,尤其是甘氨酸或丝氨酸。其次,这两个富含组氨酸的区域被三个假定的跨膜结构域隔开。第三,该序列的N端部分显示出信号肽的特征。结果表明,该基因编码的蛋白质可能是一种具有富含组氨酸电荷簇的跨膜蛋白。在其他已知基因中发现的类似序列基序使我们能够推测该基因可能的功能。