Escande F, Aubert J P, Porchet N, Buisine M P
INSERM Unité 377, place de Verdun, 59045 Lille Cedex, France.
Biochem J. 2001 Sep 15;358(Pt 3):763-72. doi: 10.1042/0264-6021:3580763.
Human mucin gene MUC5AC is clustered with MUC2, MUC5B and MUC6 on chromosome 11p15.5. We report here the full length cDNA sequence upstream of the repetitive region of human MUC5AC. We have also determined the sequence of its large central tandem repeat array. The 5'-region reveals high degree of sequence similarity with MUC2 and MUC5B and codes for 1336 amino acids organized into a signal peptide, four pro-von Willebrand factor-like D domains (D1, D2, D' and D3) and a short domain which connects to the central repetitive region. In the central region, 17 major domains have been identified. Nine code for cysteine-rich domains (Cys-domains 1-9) and exhibit high sequence similarity to the cysteine-rich domains described in the central region of MUC2 and MUC5B. Cys-domains 1-5 are interspersed by domains enriched with serine, threonine, and proline residues. Cys-domains 1-9 are interspersed by four domains (TR1-TR4) composed of various numbers of MUC5AC-type repeats. Southern-blot analyses reveal allelic variations both in length and nucleotide sequence. The length polymorphism which is due to variable numbers of tandem repeats is located in TR1 and TR4, whereas a mutation polymorphism detected with TaqI is located in Cys-domain 6. In this study, the organization of MUC5AC has been entirely elucidated showing extensive similarity to the other chromosome 11p15 MUC genes, particularly MUC5B, and providing additional arguments for common evolution from a single ancestral gene.
人类粘蛋白基因MUC5AC与MUC2、MUC5B和MUC6聚集在11号染色体的p15.5区域。我们在此报告人类MUC5AC重复区域上游的全长cDNA序列。我们还确定了其大型中央串联重复序列阵列的序列。5'区域与MUC2和MUC5B具有高度的序列相似性,编码1336个氨基酸,这些氨基酸组成一个信号肽、四个类血管性血友病因子D结构域(D1、D2、D'和D3)以及一个连接到中央重复区域的短结构域。在中央区域,已鉴定出17个主要结构域。其中九个编码富含半胱氨酸的结构域(半胱氨酸结构域1-9),与MUC2和MUC5B中央区域描述的富含半胱氨酸的结构域具有高度的序列相似性。半胱氨酸结构域1-5之间散布着富含丝氨酸、苏氨酸和脯氨酸残基的结构域。半胱氨酸结构域1-9之间散布着四个由不同数量的MUC5AC型重复序列组成的结构域(TR1-TR4)。Southern杂交分析揭示了长度和核苷酸序列上的等位基因变异。由于串联重复序列数量可变导致的长度多态性位于TR1和TR4中,而用TaqI检测到的突变多态性位于半胱氨酸结构域6中。在本研究中,MUC5AC的组织已完全阐明,显示出与其他11号染色体p15 MUC基因,特别是MUC5B有广泛的相似性,并为从单个祖先基因的共同进化提供了额外的证据。