Institute Curie, Paris Sciences-et-Lettres Research University, Institut National de la Santé et de la Recherche Médicale, U932, 75005 Paris.
Genomics Core, Baylor Scott & White Research Institute, Dallas, TX 75204.
Proc Natl Acad Sci U S A. 2020 Apr 7;117(14):7905-7916. doi: 10.1073/pnas.1912008117. Epub 2020 Mar 19.
Transposable elements (TEs) compose nearly half of mammalian genomes and provide building blocks for -regulatory elements. Using high-throughput sequencing, we show that 84 TE subfamilies are overrepresented, and distributed in a lineage-specific fashion in core and boundary domains of CD8 T cell enhancers. Endogenous retroviruses are most significantly enriched in core domains with accessible chromatin, and bear recognition motifs for immune-related transcription factors. In contrast, short interspersed elements (SINEs) are preferentially overrepresented in nucleosome-containing boundaries. A substantial proportion of these SINEs harbor a high density of the enhancer-specific histone mark H3K4me1 and carry sequences that match enhancer boundary nucleotide composition. Motifs with regulatory features are better preserved within enhancer-enriched TE copies compared to their subfamily equivalents located in gene deserts. TE-rich and TE-poor enhancers associate with both shared and unique gene groups and are enriched in overlapping functions related to lymphocyte and leukocyte biology. The majority of T cell enhancers are shared with other immune lineages and are accessible in common hematopoietic progenitors. A higher proportion of immune tissue-specific enhancers are TE-rich compared to enhancers specific to other tissues, correlating with higher TE occurrence in immune gene-associated genomic regions. Our results suggest that during evolution, TEs abundant in these regions and carrying motifs potentially beneficial for enhancer architecture and immune functions were particularly frequently incorporated by evolving enhancers. Their putative selection and regulatory cooption may have accelerated the evolution of immune regulatory networks.
转座元件 (TEs) 构成了近一半的哺乳动物基因组,为 - 调控元件提供了构建模块。我们使用高通量测序表明,84 个 TE 亚家族过表达,并以谱系特异性的方式分布在 CD8 T 细胞增强子的核心和边界区域。内源性逆转录病毒在具有可及染色质的核心区域中最为丰富,并且具有免疫相关转录因子的识别基序。相比之下,短散布元件 (SINEs) 优先在富含核小体的边界中过表达。这些 SINE 中有相当大的一部分含有高密度的增强子特异性组蛋白标记 H3K4me1,并携带与增强子边界核苷酸组成匹配的序列。与位于基因沙漠中的亚家族等效物相比,这些具有调节特征的基序在增强子富集的 TE 拷贝中得到更好的保留。富含 TE 和 TE 贫乏的增强子与共享和独特的基因群相关,并富集与淋巴细胞和白细胞生物学相关的重叠功能。大多数 T 细胞增强子与其他免疫谱系共享,并且在常见的造血祖细胞中可访问。与其他组织特异性增强子相比,免疫组织特异性增强子中富含 TE 的比例更高,这与免疫基因相关基因组区域中 TE 发生率较高相关。我们的研究结果表明,在进化过程中,这些区域中丰富的 TE 及其携带的可能对增强子结构和免疫功能有益的基序,被进化中的增强子特别频繁地整合。它们的潜在选择和调控共适应可能加速了免疫调控网络的进化。