Suppr超能文献

长末端重复样元件位于一个缺乏内含子的人类免疫球蛋白ε假基因两侧。

Long terminal repeat-like elements flank a human immunoglobulin epsilon pseudogene that lacks introns.

作者信息

Ueda S, Nakai S, Nishida Y, Hisajima H, Honjo T

出版信息

EMBO J. 1982;1(12):1539-44. doi: 10.1002/j.1460-2075.1982.tb01352.x.

Abstract

There are at least three immunoglobulin epsilon genes (C epsilon 1, C epsilon 2, and C epsilon 3) in the human genome. The nucleotide sequences of the expressed epsilon gene (C epsilon 1) and one (C epsilon 3) of the two epsilon pseudogenes were compared. The results show that the C epsilon 3 gene lacks the three intervening sequences entirely and has a 31-base A-rich sequence 16 bases 3' to the putative poly(A) addition signal, indicating that the C epsilon 3 gene is a processed gene. The C epsilon 3 gene sequence is homologous to the five separate DNA segments of the C epsilon 1 gene; namely, a segment in the 5'-flanking region (100 bases) and four exons, which are interrupted by a spacer region or intervening sequences. Long terminal repeat (LTR)-like sequences which contain TATAAA and AATAAA sequences as well as terminal inverted repeats are present in both 5'- and 3'-flanking regions. The 5' and 3' LTR-like sequences do not, however, constitute a direct repeat, unlike transposable elements of eukaryotes and retroviruses. The 3' LTR-like sequence is repetitive in the human genome, but is not homologous to the Alu family DNA. Models for the evolutionary origin of the processed gene flanked by the LTR-like sequences are discussed. The C epsilon 3 gene has a new open frame which codes potentially for an unknown protein of 292 amino acid residues.

摘要

人类基因组中至少存在三个免疫球蛋白ε基因(Cε1、Cε2和Cε3)。对已表达的ε基因(Cε1)和两个ε假基因之一(Cε3)的核苷酸序列进行了比较。结果显示,Cε3基因完全缺失三个间隔序列,并且在假定的多聚腺苷酸化信号下游16个碱基处有一个31个碱基的富含A的序列,这表明Cε3基因是一个已加工基因。Cε3基因序列与Cε1基因的五个独立DNA片段同源;即5'侧翼区域的一个片段(100个碱基)和四个外显子,它们被一个间隔区域或间隔序列中断。5'和3'侧翼区域都存在类似长末端重复序列(LTR)的序列,其中包含TATAAA和AATAAA序列以及末端反向重复序列。然而,与真核生物和逆转录病毒的转座元件不同,5'和3'类似LTR的序列并不构成直接重复。3'类似LTR的序列在人类基因组中是重复的,但与Alu家族DNA不同源。讨论了由类似LTR序列侧翼的已加工基因的进化起源模型。Cε3基因有一个新的开放阅读框,可能编码一个由292个氨基酸残基组成的未知蛋白质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8abe/553248/2641d91b4c73/emboj00304-0066-a.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验