Graduate Program in Cellular and Molecular Biology, University of Michigan Medical School, Ann Arbor, MI, USA.
Neurosciences Graduate Program, University of Michigan Medical School, Ann Arbor, MI, USA.
BMC Biol. 2023 Aug 23;21(1):179. doi: 10.1186/s12915-023-01673-4.
The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms-as diverse as bats, dolphins, and elephants-in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these "outward-looking" families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression.
Here, we show that these tandem arrays of outward-looking genes occupy AT-biased isochores and comprise the "tissue-specific" gene class that lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT- versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and allelic diversity, we use population genetic data and comparative analysis. First, we find that AT bias can emerge during evolutionary expansion of gene families in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for non-synonymous variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination.
Our analyses suggest that tolerance of non-synonymous variation and low recombination are two forces that have produced the depletion of GC bases in outward-facing gene arrays. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.
哺乳动物在恐龙灭绝时的辐射产生了大量的新形式,如蝙蝠、海豚和大象,仅在 1000 万到 2000 万年之间。在幕后,对新生态位的适应伴随着大量基因家族的广泛创新,这些基因使动物能够与环境接触,包括化学感受器、外源性酶和免疫及屏障蛋白。这些“外向型”家族中的基因在人类中存在等位基因多样性,并表现出组织特异性,有时还具有随机表达。
在这里,我们表明,这些外向型基因的串联阵列占据了 AT 偏向的同型异位区,并构成了“组织特异性”基因类,它们的启动子中缺乏 CpG 岛。哺乳动物基因组进化模型没有纳入 AT 与 GC 偏向区域中基因的功能和转录模式的显著差异。为了研究基因家族扩张、序列内容和等位基因多样性之间的关系,我们使用了群体遗传学数据和比较分析。首先,我们发现,基因家族在顺式进化扩张过程中会出现 AT 偏向。其次,在 AT 偏向的同型异位区或 GC 含量低的启动子中的人类基因,今天经历的新点突变率相对较低,但富含非同义变异。最后,我们发现包含基因簇的同型异位区表现出低的重组率。
我们的分析表明,非同义变异的耐受性和低重组率是导致外向型基因阵列中 GC 碱基耗竭的两种力量。反过来,高 AT 含量对其染色质组织和转录调控产生了深远的影响。