Avraham S, Austen K F, Nicodemus C F, Gartner M C, Stevens R L
Department of Medicine, Harvard Medical School, Boston, Massachusetts.
J Biol Chem. 1989 Oct 5;264(28):16719-26.
A mouse liver genomic library was probed with a 450-base pair AccI----3' gene-specific fragment of a mouse bone marrow-derived mast cell proteoglycan cDNA to isolate 15-18-kilobase (kb) genomic clones containing the gene that encodes the peptide core of mouse secretory granule proteoglycans. Based on the nucleotide sequences of its 2.0-3.5-kb subcloned fragments, this mouse gene consists of three exons. The first exon contains 41 base pairs of untranslated nucleotides that are present in the 5' region of the transcript and also encodes the hydrophobic 25-amino acid signal peptide. The second exon encodes a 48-amino acid sequence that would be predicted to be the N terminus of the peptide core after the signal peptide has been removed in the endoplasmic reticulum. The third exon encodes a 79-amino acid sequence that includes the 15 amino acids immediately preceding an alternating serine-glycine 21-amino acid sequence for glycosaminoglycan attachment, and the subsequent C-terminal 43 amino acids; this exon also contains the 424 untranslated nucleotides present in the 3' region of the transcript. Primer extension and S1 nuclease protection analyses were performed to determine the transcription initiation site of the mouse gene. Rat-1 fibroblasts were cotransfected with the selectable marker pSV2 neo and a lambda clone (lambda MG-PG1) to obtain two rat-1 fibroblast cell lines that had the mouse secretory granule proteoglycan gene integrated into their genomes. RNA blot analysis of both cell lines revealed the presence of the 1.0-kb secretory granule proteoglycan peptide core mRNA transcript, indicating that lambda MG-PG1 contained the entire mouse secretory granule proteoglycan peptide core gene including some of the regulatory elements in its promoter region. The gene that encodes the peptide core of mouse secretory granule proteoglycans is the first proteoglycan gene to have its complete exon/intron organization determined and to be transfected and expressed in another cell type.
用小鼠骨髓来源的肥大细胞蛋白聚糖cDNA的一个450碱基对的AccI----3'基因特异性片段探测小鼠肝脏基因组文库,以分离出15 - 18千碱基(kb)的基因组克隆,这些克隆含有编码小鼠分泌颗粒蛋白聚糖肽核心的基因。根据其2.0 - 3.5 kb亚克隆片段的核苷酸序列,该小鼠基因由三个外显子组成。第一个外显子包含41个碱基对的非翻译核苷酸,存在于转录本的5'区域,还编码25个氨基酸的疏水信号肽。第二个外显子编码一个48个氨基酸的序列,预计在内质网中信号肽被去除后,该序列将是肽核心的N端。第三个外显子编码一个79个氨基酸的序列,包括紧接在用于糖胺聚糖附着的交替丝氨酸 - 甘氨酸21个氨基酸序列之前的15个氨基酸,以及随后的C端43个氨基酸;该外显子还包含转录本3'区域中存在的424个非翻译核苷酸。进行引物延伸和S1核酸酶保护分析以确定小鼠基因的转录起始位点。将选择标记pSV2 neo与一个λ克隆(λMG - PG1)共转染大鼠 - 1成纤维细胞,以获得两个大鼠 - 1成纤维细胞系,它们的基因组中整合了小鼠分泌颗粒蛋白聚糖基因。对这两个细胞系的RNA印迹分析显示存在1.0 kb的分泌颗粒蛋白聚糖肽核心mRNA转录本,表明λMG - PG1包含整个小鼠分泌颗粒蛋白聚糖肽核心基因,包括其启动子区域中的一些调控元件。编码小鼠分泌颗粒蛋白聚糖肽核心的基因是第一个其完整外显子/内含子结构被确定并在另一种细胞类型中被转染和表达的蛋白聚糖基因。