Bobek L A, Tsai H, Biesbrock A R, Levine M J
Department of Oral Biology, School of Dental Medicine, State University of New York at Buffalo 14214.
J Biol Chem. 1993 Sep 25;268(27):20563-9.
Previous biochemical studies have determined that human saliva contains high and low molecular weight mucin glycoproteins (MG1 and MG2, respectively) that are structurally distinct. In this study, we describe the isolation and characterization of overlapping cDNA clones which code for the MG2 protein core. DNA sequencing revealed a translated region of 1131 nucleotides encoding a protein of 377 amino acid residues with a molecular mass of 39 kDa. The first 20 N-terminal residues were very hydrophobic and probably comprise the MG2 leader peptide. The region encoding the secreted protein can be divided into three distinct domains; unique 5'- and 3'-translated regions containing 4 and 1 potential N-glycosylation sites, respectively, and a central region of six almost perfect tandem repeats of 23 amino acid residues with a high number of Thr and Ser. No sequence homology with any other human or animal mucins, and no significant homology to any other proteins was found. MG2 mRNA is about 2.5 kilobases long, and its expression appears to be species-, tissue-, and cell-specific. We propose to name this gene MUC7 in accordance with the mucin genes cloned to date named MUC1-MUC6.
以往的生物化学研究已确定,人类唾液含有结构不同的高分子量和低分子量粘蛋白糖蛋白(分别为MG1和MG2)。在本研究中,我们描述了编码MG2蛋白核心的重叠cDNA克隆的分离和特性。DNA测序揭示了一个1131个核苷酸的翻译区域,编码一个由377个氨基酸残基组成、分子量为39 kDa的蛋白质。前20个N端残基具有很强的疏水性,可能构成MG2前导肽。编码分泌蛋白的区域可分为三个不同的结构域;独特的5'和3'翻译区域,分别含有4个和1个潜在的N-糖基化位点,以及一个由23个氨基酸残基组成的六个几乎完美的串联重复序列的中心区域,其中苏氨酸和丝氨酸含量很高。未发现与任何其他人类或动物粘蛋白有序列同源性,也未发现与任何其他蛋白质有显著同源性。MG2 mRNA约2.5千碱基长,其表达似乎具有物种、组织和细胞特异性。我们建议根据迄今克隆的粘蛋白基因MUC1-MUC6,将该基因命名为MUC7。