Chu X, Rittner C, Schneider P M
Institut für Rechtsmedizin, Johannes Gutenberg-Universität, Mainz, Germany.
Exp Clin Immunogenet. 1995;12(2):74-81.
The fourth component of the complement system, C4, is encoded by two highly homologous MHC-linked genes expressing the two isotypes C4A and C4B. A gene size polymorphism (either 22.5 or 16 kb) has been described which depends on the presence or absence of a 6.5-kb insertion in intron 9 of the C4 gene. By sequencing a C4A-specific lambda clone from a human genomic library containing the long intron 9 as well as PCR-amplified DNA containing the short intron, the DNA sequences of both introns were determined. The long and short introns have lengths of 6,787 bp and 415 bp, respectively. The sequence of the short intron is almost identical (96%) to the corresponding parts of the long intron. At position 282 of the short intron, a 6,372-bp insertion is present in the long intron which has all characteristics of a full-length endogenous retrovirus. The proviral DNA is flanked by two 6-bp target site repeats. The orientation of the proviral sequence is opposite to that of the C4 coding strand. Long terminal repeats (LTRs) of 548 bp were found at both ends of the provirus. A TATA box and an SV40 enhancer core as well as a polyadenylation signal are present in the LTR. A 5' primer binding site for lysine tRNA was identified. The strongest sequence homologies were found in comparison to human endogenous retrovirus (HERV-K): between 65-88% for gag, pol and env genes. However, a search for open reading frames in these regions indicated the presence of multiple stop codons in all three reading frames.(ABSTRACT TRUNCATED AT 250 WORDS)
补体系统的第四种成分C4由两个高度同源的主要组织相容性复合体(MHC)连锁基因编码,表达两种同种型C4A和C4B。已经描述了一种基因大小多态性(22.5 kb或16 kb),其取决于C4基因内含子9中是否存在6.5 kb的插入片段。通过对来自包含长内含子9的人类基因组文库的C4A特异性λ克隆以及包含短内含子的PCR扩增DNA进行测序,确定了两个内含子的DNA序列。长内含子和短内含子的长度分别为6787 bp和415 bp。短内含子的序列与长内含子的相应部分几乎相同(96%)。在短内含子的第282位,长内含子中存在一个6372 bp的插入片段,它具有全长内源性逆转录病毒的所有特征。前病毒DNA两侧是两个6 bp的靶位点重复序列。前病毒序列的方向与C4编码链的方向相反。在前病毒的两端发现了548 bp的长末端重复序列(LTR)。LTR中存在一个TATA盒、一个SV40增强子核心以及一个多聚腺苷酸化信号。鉴定出了赖氨酸tRNA的5'引物结合位点。与人类内源性逆转录病毒(HERV-K)相比,发现了最强的序列同源性:gag、pol和env基因的同源性在65%-88%之间。然而,在这些区域搜索开放阅读框表明,所有三个阅读框中都存在多个终止密码子。(摘要截断于250字)