Cheng Yong, Ma Zhihai, Kim Bong-Hyun, Wu Weisheng, Cayting Philip, Boyle Alan P, Sundaram Vasavi, Xing Xiaoyun, Dogan Nergiz, Li Jingjing, Euskirchen Ghia, Lin Shin, Lin Yiing, Visel Axel, Kawli Trupti, Yang Xinqiong, Patacsil Dorrelyn, Keller Cheryl A, Giardine Belinda, Kundaje Anshul, Wang Ting, Pennacchio Len A, Weng Zhiping, Hardison Ross C, Snyder Michael P
Department of Genetics, Stanford University, Stanford, CA 94305, USA.
Program in Bioinformatics and Integrative Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
Nature. 2014 Nov 20;515(7527):371-375. doi: 10.1038/nature13985.
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
为了拓宽我们对基因调控机制进化的理解,我们生成了人类-小鼠红系祖细胞、淋巴母细胞和胚胎干细胞系中34种直系同源转录因子(TF)的占据图谱。通过整合全基因组转录因子占据库、相关的表观遗传信号和共关联模式,我们在此推断出自小鼠和人类谱系分化以来起作用的基因调控特征的几个进化原则。TF占据序列的基因组分布图谱、主要结合基序、染色质状态和DNA甲基化偏好都得到了很好的保守。然而,直系同源DNA片段被直系同源TF结合的程度在TF之间以及基因组位置上都有所不同:启动子处的结合比远端元件处的结合更保守。值得注意的是,占据保守的TF占据序列往往具有多效性;它们在多个组织中发挥作用,并且还与许多TF共关联。具有潜在调控功能位点的单核苷酸变体在占据保守的TF占据序列中富集。