Sundaram Vasavi, Cheng Yong, Ma Zhihai, Li Daofeng, Xing Xiaoyun, Edge Peter, Snyder Michael P, Wang Ting
Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA;
Department of Genetics, Stanford University, Stanford, California 94305, USA;
Genome Res. 2014 Dec;24(12):1963-76. doi: 10.1101/gr.168872.113. Epub 2014 Oct 15.
Transposable elements (TEs) have been shown to contain functional binding sites for certain transcription factors (TFs). However, the extent to which TEs contribute to the evolution of TF binding sites is not well known. We comprehensively mapped binding sites for 26 pairs of orthologous TFs in two pairs of human and mouse cell lines (representing two cell lineages), along with epigenomic profiles, including DNA methylation and six histone modifications. Overall, we found that 20% of binding sites were embedded within TEs. This number varied across different TFs, ranging from 2% to 40%. We further identified 710 TF-TE relationships in which genomic copies of a TE subfamily contributed a significant number of binding peaks for a TF, and we found that LTR elements dominated these relationships in human. Importantly, TE-derived binding peaks were strongly associated with open and active chromatin signatures, including reduced DNA methylation and increased enhancer-associated histone marks. On average, 66% of TE-derived binding events were cell type-specific with a cell type-specific epigenetic landscape. Most of the binding sites contributed by TEs were species-specific, but we also identified binding sites conserved between human and mouse, the functional relevance of which was supported by a signature of purifying selection on DNA sequences of these TEs. Interestingly, several TFs had significantly expanded binding site landscapes only in one species, which were linked to species-specific gene functions, suggesting that TEs are an important driving force for regulatory innovation. Taken together, our data suggest that TEs have significantly and continuously shaped gene regulatory networks during mammalian evolution.
转座元件(TEs)已被证明含有某些转录因子(TFs)的功能性结合位点。然而,TEs对TF结合位点进化的贡献程度尚不清楚。我们全面绘制了两对人类和小鼠细胞系(代表两个细胞谱系)中26对直系同源TFs的结合位点,以及表观基因组图谱,包括DNA甲基化和六种组蛋白修饰。总体而言,我们发现20%的结合位点嵌入在TEs中。这个数字在不同的TFs中有所不同,范围从2%到40%。我们进一步确定了710种TF-TE关系,其中一个TE亚家族的基因组拷贝为一个TF贡献了大量的结合峰,并且我们发现LTR元件在人类中主导了这些关系。重要的是,TE衍生的结合峰与开放和活跃的染色质特征密切相关,包括DNA甲基化减少和增强子相关组蛋白标记增加。平均而言,66%的TE衍生的结合事件具有细胞类型特异性的表观遗传景观。TEs贡献的大多数结合位点是物种特异性的,但我们也确定了人类和小鼠之间保守的结合位点,这些位点的功能相关性得到了对这些TEs的DNA序列进行纯化选择的特征的支持。有趣的是,几个TFs仅在一个物种中具有显著扩展的结合位点景观,这与物种特异性基因功能相关,表明TEs是调控创新的重要驱动力。综上所述,我们的数据表明,TEs在哺乳动物进化过程中显著且持续地塑造了基因调控网络。