Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA.
Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA.
Mol Biol Evol. 2021 Aug 23;38(9):3681-3696. doi: 10.1093/molbev/msab138.
Despite the importance of gene regulatory enhancers in human biology and evolution, we lack a comprehensive model of enhancer evolution and function. This substantially limits our understanding of the genetic basis of species divergence and our ability to interpret the effects of noncoding variants on human traits. To explore enhancer sequence evolution and its relationship to regulatory function, we traced the evolutionary origins of transcribed human enhancer sequences with activity across diverse tissues and cellular contexts from the FANTOM5 consortium. The transcribed enhancers are enriched for sequences of a single evolutionary age ("simple" evolutionary architectures) compared with enhancers that are composites of sequences of multiple evolutionary ages ("complex" evolutionary architectures), likely indicating constraint against genomic rearrangements. Complex enhancers are older, more pleiotropic, and more active across species than simple enhancers. Genetic variants within complex enhancers are also less likely to associate with human traits and biochemical activity. Transposable-element-derived sequences (TEDS) have made diverse contributions to enhancers of both architectures; the majority of TEDS are found in enhancers with simple architectures, while a minority have remodeled older sequences to create complex architectures. Finally, we compare the evolutionary architectures of transcribed enhancers with histone-mark-defined enhancers. Our results reveal that most human transcribed enhancers are ancient sequences of a single age, and thus the evolution of most human enhancers was not driven by increases in evolutionary complexity over time. Our analyses further suggest that considering enhancer evolutionary histories provides context that can aid interpretation of the effects of variants on enhancer function. Based on these results, we propose a framework for analyzing enhancer evolutionary architecture.
尽管基因调控增强子在人类生物学和进化中具有重要意义,但我们缺乏对增强子进化和功能的全面了解。这极大地限制了我们对物种分化遗传基础的理解,以及我们解释非编码变异对人类特征影响的能力。为了探索增强子序列进化及其与调控功能的关系,我们追踪了 FANTOM5 联盟具有跨多种组织和细胞背景活性的转录人类增强子序列的进化起源。与由多个进化时期的序列组成的增强子(“复杂”进化结构)相比,转录增强子富含单一进化时期的序列(“简单”进化结构),这可能表明对基因组重排的限制。复杂增强子比简单增强子更古老、更多效性,在物种间也更活跃。复杂增强子内的遗传变异与人类特征和生化活性的关联也较少。转座元件衍生序列(TEDs)对两种结构的增强子都有多种贡献;大多数 TEDs 存在于具有简单结构的增强子中,而少数 TEDs 则重塑了较旧的序列以创建复杂结构。最后,我们比较了转录增强子与组蛋白标记定义的增强子的进化结构。我们的结果表明,大多数人类转录增强子是单一年龄的古老序列,因此,大多数人类增强子的进化不是随着时间的推移而增加进化复杂性驱动的。我们的分析还表明,考虑增强子进化历史可以提供有助于解释变异对增强子功能影响的背景。基于这些结果,我们提出了一种分析增强子进化结构的框架。