Nollet S, Moniaux N, Maury J, Petitprez D, Degand P, Laine A, Porchet N, Aubert J P
INSERM Unité 377, Place de Verdun, 59045 Lille Cedex, France.
Biochem J. 1998 Jun 15;332 ( Pt 3)(Pt 3):739-48. doi: 10.1042/bj3320739.
In a previous study we isolated a partial cDNA with a tandem repeat of 48 bp, which allowed us to map a novel human mucin gene named MUC4 to chromosome 3q29. Here we report the organization and sequence of the 5'-region and its junction with the tandem repeat array of MUC4. Analysis of three overlapping genomic clones allowed us to obtain a partial restriction map of MUC4 and to locate the complete 48 bp tandem repeat domain on a PstI/EcoRI genomic fragment that exhibits a very large variation in number of tandem repeats (7-19 kb). cDNA clonal extension allowed us to obtain the entire 5' coding region of MUC4. Exon 1 consists of a 5' untranslated region and an 82 bp fragment encoding the signal peptide. This latter shows a high degree of similarity to the signal peptide of another apomucin, ASGP-1. Exon 2 is extremely large and contains a unique sequence that is followed by the whole tandem repeat domain. It encodes only one cysteine residue, making MUC4 different from mucin genes belonging to the 11p15.5 family. Moreover, an intron downstream from the tandem repeat array consists mainly of a 15 bp tandem repeat that exhibits a polymorphism in having a variable number of tandem repeats.
在先前的一项研究中,我们分离出了一个具有48 bp串联重复序列的部分cDNA,这使我们能够将一个名为MUC4的新型人类粘蛋白基因定位到3q29染色体上。在此,我们报告MUC4基因5'区域的结构和序列及其与串联重复序列阵列的连接情况。对三个重叠的基因组克隆进行分析,使我们获得了MUC4的部分限制性图谱,并将完整的48 bp串联重复结构域定位在一个PstI/EcoRI基因组片段上,该片段的串联重复数量呈现出很大的变化(7 - 19 kb)。通过cDNA克隆延伸,我们获得了MUC4完整的5'编码区。外显子1由一个5'非翻译区和一个编码信号肽的82 bp片段组成。后者与另一种脱辅基粘蛋白ASGP - 1的信号肽具有高度相似性。外显子2非常大,包含一个独特序列,其后是整个串联重复结构域。它仅编码一个半胱氨酸残基,这使得MUC4与属于11p15.5家族的粘蛋白基因不同。此外,串联重复序列阵列下游的一个内含子主要由一个15 bp的串联重复序列组成,该序列在串联重复数量上表现出多态性。