Department of Biological Sciences, State University of New York at Buffalo, New York 14260, USA.
Institute of Computer Science (ICS), Foundation of Research and Technology-Hellas, Heraklion, Crete, Greece.
Sci Rep. 2016 Aug 25;6:31791. doi: 10.1038/srep31791.
Genomic structural variants constitute the majority of variable base pairs in primate genomes and affect gene function in multiple ways. While whole gene duplications and deletions are relatively well-studied, the biology of subexonic (i.e., within coding exon sequences), copy number variation remains elusive. The salivary MUC7 gene provides an opportunity for studying such variation, as it harbors copy number variable subexonic repeat sequences that encode for densely O-glycosylated domains (PTS-repeats) with microbe-binding properties. To understand the evolution of this gene, we analyzed mammalian and primate genomes within a comparative framework. Our analyses revealed that (i) MUC7 has emerged in the placental mammal ancestor and rapidly gained multiple sites for O-glycosylation; (ii) MUC7 has retained its extracellular activity in saliva in placental mammals; (iii) the anti-fungal domain of the protein was remodified under positive selection in the primate lineage; and (iv) MUC7 PTS-repeats have evolved recurrently and under adaptive constraints. Our results establish MUC7 as a major player in salivary adaptation, likely as a response to diverse pathogenic exposure in primates. On a broader scale, our study highlights variable subexonic repeats as a primary source for modular evolutionary innovation that lead to rapid functional adaptation.
基因组结构变异构成了灵长类基因组中大多数可变碱基对,并以多种方式影响基因功能。虽然全基因重复和缺失已经得到了相对较好的研究,但亚外显子(即在编码外显子序列内)的拷贝数变异仍然难以捉摸。唾液 MUC7 基因为研究这种变异提供了机会,因为它含有拷贝数可变的亚外显子重复序列,这些序列编码具有微生物结合特性的高度 O-糖基化结构域(PTS 重复)。为了了解该基因的进化,我们在比较框架内分析了哺乳动物和灵长类动物的基因组。我们的分析表明:(i) MUC7 出现在胎盘哺乳动物的祖先中,并迅速获得了多个 O-糖基化位点;(ii) MUC7 在胎盘哺乳动物的唾液中保留了其细胞外活性;(iii) 该蛋白的抗真菌结构域在灵长类动物谱系中受到正选择的重塑;(iv) MUC7 PTS 重复序列经历了反复的适应性进化。我们的研究结果确立了 MUC7 作为唾液适应性的主要参与者,可能是对灵长类动物中多种致病性暴露的反应。从更广泛的角度来看,我们的研究强调了可变亚外显子重复作为导致快速功能适应的模块化进化创新的主要来源。