Suppr超能文献

黑腹果蝇原肌球蛋白I基因的结构与DNA序列

Structure and DNA sequence of the tropomyosin I gene from Drosophila melanogaster.

作者信息

Basi G S, Storti R V

出版信息

J Biol Chem. 1986 Jan 15;261(2):817-27.

PMID:3079761
Abstract

The muscle tropomyosin I gene of Drosophila melanogaster undergoes alternative splicing in different muscles of the fly to generate two isoforms of the same protein. We report here the structural analysis and DNA sequence of the tropomyosin I gene. The gene spans 5 kilobase of DNA and is comprised of five exons and four introns. Exon 4 is alternatively spliced in RNA of different muscle, resulting in two isoforms of the same protein. The gene lacks a "TATA" box homology at the map position; it is usually found in the vast majority of eukaryotic genes characterized thus far. Instead, a series of three alternating TG stretches are located upstream from the site of initiation of transcription. The gene encodes a 5' untranslated leader of 103 base pairs, and the 3' untranslated region comprises between 30 and 50% of the transcripts. The DNA sequence is extremely G + C rich in the protein coding regions of the gene, and A + T rich in the non-coding, flanking, and intron regions. The DNA sequence upstream of the acceptor sites in the two introns which are subject to alternative splicing displays a stretch of homology which is noted. The 3' untranslated region of the fifth exon contains multiple polyadenylation sites. The 284 amino acid protein encoded by the gene is split by introns between residues 198/199 and 257/258. These sites correlate closely with two important functional domains in the tropomyosin molecule. A comparison of the first 257 amino acids and the carboxyl-terminal 27 amino acids of the Drosophila and vertebrate tropomyosins together, shows two distinct and mutually exclusive classes for these domains. The functional significance of the Drosophila tropomyosin isoforms is discussed.

摘要

黑腹果蝇的肌肉原肌球蛋白I基因在果蝇的不同肌肉中进行可变剪接,以产生同一蛋白质的两种异构体。我们在此报告原肌球蛋白I基因的结构分析和DNA序列。该基因跨度为5千碱基,由五个外显子和四个内含子组成。外显子4在不同肌肉的RNA中进行可变剪接,产生同一蛋白质的两种异构体。该基因在图谱位置缺乏“TATA”框同源性;而“TATA”框同源性通常在迄今为止已鉴定的绝大多数真核基因中存在。相反,在转录起始位点上游有一系列三个交替的TG序列。该基因编码一个103个碱基对的5'非翻译前导序列,3'非翻译区占转录本的30%至50%。该基因的蛋白质编码区的DNA序列富含G + C,而非编码、侧翼和内含子区域富含A + T。在两个可变剪接的内含子的受体位点上游的DNA序列显示出一段值得注意的同源性。第五个外显子的3'非翻译区包含多个聚腺苷酸化位点。该基因编码的284个氨基酸的蛋白质在内含子处被分割在第198/199位和第257/258位氨基酸之间。这些位点与原肌球蛋白分子中的两个重要功能域密切相关。将果蝇和脊椎动物原肌球蛋白的前257个氨基酸和羧基末端的27个氨基酸放在一起比较,显示出这些结构域有两个不同且相互排斥的类别。文中还讨论了果蝇原肌球蛋白异构体的功能意义。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验