Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA.
Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA.
Genome Biol. 2021 Jun 29;22(1):193. doi: 10.1186/s13059-021-02409-9.
Transposable elements are increasingly recognized as a source of cis-regulatory variation. Previous studies have revealed that transposons are often bound by transcription factors and some have been co-opted into functional enhancers regulating host gene expression. However, the process by which transposons mature into complex regulatory elements, like enhancers, remains poorly understood. To investigate this process, we examined the contribution of transposons to the cis-regulatory network controlling circadian gene expression in the mouse liver, a well-characterized network serving an important physiological function.
ChIP-seq analyses reveal that transposons and other repeats contribute ~ 14% of the binding sites for core circadian regulators (CRs) including BMAL1, CLOCK, PER1/2, and CRY1/2, in the mouse liver. RSINE1, an abundant murine-specific SINE, is the only transposon family enriched for CR binding sites across all datasets. Sequence analyses and reporter assays reveal that the circadian regulatory activity of RSINE1 stems from the presence of imperfect CR binding motifs in the ancestral RSINE1 sequence. These motifs matured into canonical motifs through point mutations after transposition. Furthermore, maturation occurred preferentially within elements inserted in the proximity of ancestral CR binding sites. RSINE1 also acquired motifs that recruit nuclear receptors known to cooperate with CRs to regulate circadian gene expression specifically in the liver.
Our results suggest that the birth of enhancers from transposons is predicated both by the sequence of the transposon and by the cis-regulatory landscape surrounding their genomic integration site.
转座元件越来越被认为是顺式调控变异的来源。先前的研究表明,转座子通常被转录因子结合,有些已被内源性利用成为调节宿主基因表达的功能增强子。然而,转座子成熟为复杂调控元件(如增强子)的过程仍知之甚少。为了研究这个过程,我们研究了转座子对控制小鼠肝脏生物钟基因表达的顺式调控网络的贡献,该网络是一个具有重要生理功能的特征明确的网络。
ChIP-seq 分析显示,转座子和其他重复序列大约占了核心生物钟调控因子(CRs)BMAL1、CLOCK、PER1/2 和 CRY1/2 在小鼠肝脏中的结合位点的 14%。RSINE1 是一种丰富的、具有小鼠特异性的 SINE,是唯一在所有数据集的 CR 结合位点中富集的转座子家族。序列分析和报告基因检测表明,RSINE1 的生物钟调控活性源于其在祖先 RSINE1 序列中存在不完整的 CR 结合基序。这些基序在转座后通过点突变成熟为典型基序。此外,成熟过程优先发生在插入到祖先 CR 结合位点附近的元件中。RSINE1 还获得了招募核受体的基序,这些核受体已知与 CR 合作调节生物钟基因表达,特别是在肝脏中。
我们的结果表明,增强子的产生既取决于转座子的序列,也取决于其基因组整合位点周围的顺式调控景观。