Suppr超能文献

人类粘蛋白基因MUC5B,10.7kb的大型中央外显子编码各种交替的亚结构域,形成一个超级重复序列。11p15.5基因家族的结构证据。

Human mucin gene MUC5B, the 10.7-kb large central exon encodes various alternate subdomains resulting in a super-repeat. Structural evidence for a 11p15.5 gene family.

作者信息

Desseyn J L, Guyonnet-Dupérat V, Porchet N, Aubert J P, Laine A

机构信息

INSERM 377 Laboratoire Gérard Biserte, place de Verdun, 59045 Lille Cedex, France.

出版信息

J Biol Chem. 1997 Feb 7;272(6):3168-78. doi: 10.1074/jbc.272.6.3168.

Abstract

Human mucin gene MUC5B is mapped clustered with MUC6, MUC2, and MUC5AC on chromosome 11p15.5. We report here the isolation of three overlapping genomic clones of human MUC5B spanning approximately 40 kilobases. We have determined their partial restriction maps and the intron-exon boundaries of the central region encoding a single open reading frame. This coding region has been completely sequenced. Its length is 10,713 base pairs, and it encodes a 3570-amino acid peptide. Nineteen subdomains have been individualized. Some subdomains show similarity to each other, creating larger composite repeat units that we have called super-repeats. Four super-repeats of 528 amino acid residues are thus observed within the central exon. Each comprises (i) a subdomain composed of 11 repeats of the irregular repeat of 29 amino acid residues, (ii) a unique conserved subdomain with no typical repeat, and (iii) a cysteine-rich subdomain. This latter subdomain has high sequence similarity to the cysteine-rich domains described in MUC2 and MUC5AC. Sequence data of these three genes, together with their clustered organization, lead us to suggest that they may be a part of a multigene family. The super-repeat present in MUC5B is the largest ever determined in mucin genes and the central exon of this gene is, by far, the largest reported for a vertebrate gene.

摘要

人类黏蛋白基因MUC5B定位于11号染色体p15.5上,与MUC6、MUC2和MUC5AC成簇排列。我们在此报告分离出了三个重叠的人类MUC5B基因组克隆,其跨度约为40千碱基对。我们确定了它们的部分限制性图谱以及编码单个开放阅读框的中央区域的内含子-外显子边界。该编码区域已被完全测序。其长度为10,713个碱基对,编码一个3570个氨基酸的肽。已确定了19个亚结构域。一些亚结构域彼此相似,形成了我们称为超级重复序列的更大的复合重复单元。因此,在中央外显子中观察到四个由528个氨基酸残基组成的超级重复序列。每个超级重复序列都包含:(i) 一个由29个氨基酸残基的不规则重复序列重复11次组成的亚结构域;(ii) 一个没有典型重复序列的独特保守亚结构域;(iii) 一个富含半胱氨酸的亚结构域。后一个亚结构域与MUC2和MUC5AC中描述的富含半胱氨酸结构域具有高度的序列相似性。这三个基因的序列数据及其成簇排列方式使我们推测它们可能是一个多基因家族的一部分。MUC5B中存在的超级重复序列是黏蛋白基因中迄今确定的最大的超级重复序列,而且该基因的中央外显子是迄今为止报道的脊椎动物基因中最大的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验