Roy K, Mitsugi K, Sirotnak F M
Program in Molecular Pharmacology and Therapeutics, Memorial Sloan-Kettering Cancer Center, Cornell University, New York, New York 10021, USA.
J Biol Chem. 1996 Sep 27;271(39):23820-7. doi: 10.1074/jbc.271.39.23820.
The organization of the murine folylpolyglutamate synthetase (FPGS) gene has been determined by sequence analysis that also revealed an interesting complexity in the case of exon 1. The entire nucleotide sequence of the L1210 FPGS cDNA, the 3'- and 5'-untranslated regions, the mitochondrial leader sequence, and the coding region were found to be distributed on 15 exons with an overall length of 10.358 kilobases. Two splice variants of exon 1 were identified by screening of an L1210 cell cDNA library. Variant I (exons 1a + 1b plus 2-15) incorporates all of the sequence homologous to the recently reported (Taylor, S. M., Freemantle, S. J., and Moran, R. G. (1995) Cancer Res. 55, 6030-6034) human exon 1, including two ATG start codons at positions +1 and +126, and encodes both mitochondrial and cytosolic form of FPGS. The most prevalent variant, Variant II (exons 1b plus 2 to 15), incorporates only a portion (92 nucleotides at the 3' end) of this sequence, incorporates only one ATG start codon at position +126, and encodes only cytosolic FPGS. The existence of this variant is consistent with the identification of an appropriately situated internal donor/acceptor site in what was believed to be exon 1. A third related variant (Variant III) with a novel 5' termini was originally identified by screening of a mouse liver cDNA library. This variant, which occurs at moderately low frequency in the L1210 cell cDNA library, incorporates an alternate to exon 1a (exon 1c) spliced to exon 1b plus exons 2-15 and encodes a different mitochondrial leader peptide than Variant I. The identification of these variants suggests another possible mechanism, i.e. at the level of precursor mRNA splicing, for regulating synthesis of mitochondrial versus cytosolic forms of FPGS in the cell. Exon 1c is positioned in the gene upstream of exon 1a separated by an intron of 56 nucleotides within a region of DNA sequence that like the homologous human sequence is distinctly promoter-like. However, the sequence of this region differs from the human sequence in terms of the number, position, and type of putative regulatory elements, particularly in regard to the number of SP-1 binding sites and the position of multiple transcription start sites as determined by enzymatic primer extension.
通过序列分析确定了小鼠叶酰聚谷氨酸合成酶(FPGS)基因的组织方式,该分析还揭示了外显子1情况中有趣的复杂性。发现L1210 FPGS cDNA的完整核苷酸序列、3'和5'非翻译区、线粒体前导序列以及编码区分布在15个外显子上,总长为10.358千碱基。通过筛选L1210细胞cDNA文库鉴定出了外显子1的两种剪接变体。变体I(外显子1a + 1b加上2 - 15)包含了与最近报道的(泰勒,S.M.,弗里曼特尔,S.J.,和莫兰,R.G.(1995)癌症研究55,6030 - 6034)人类外显子1同源的所有序列,包括位于+1和+126位置的两个ATG起始密码子,并编码FPGS的线粒体和胞质形式。最常见的变体,变体II(外显子1b加上2至15),仅包含该序列的一部分(3'端的92个核苷酸),仅在+126位置包含一个ATG起始密码子,并且仅编码胞质FPGS。该变体的存在与在被认为是外显子1的区域中鉴定到的适当位置的内部供体/受体位点一致。通过筛选小鼠肝脏cDNA文库最初鉴定出了第三种相关变体(变体III),其具有新的5'末端。该变体在L1210细胞cDNA文库中出现频率适中较低,包含与外显子1a交替的外显子1c(外显子1c)剪接到外显子1b加上外显子2 - 15,并编码与变体I不同的线粒体前导肽。这些变体的鉴定表明了另一种可能的机制,即在细胞中调节FPGS线粒体形式与胞质形式合成的前体mRNA剪接水平。外显子1c位于基因中外显子1a的上游,由56个核苷酸的内含子隔开,位于一个DNA序列区域内,该区域与同源人类序列一样明显类似启动子。然而,该区域的序列在推定调控元件的数量、位置和类型方面与人类序列不同,特别是在SP - 1结合位点的数量以及通过酶促引物延伸确定的多个转录起始位点的位置方面。