Department of Molecular Biotechnology and Health Sciences, University of Turin, 10126 Turin, Italy.
Center for Translational Genomics and Bioinformatics, San Raffaele Scientific Institute IRCCS, 20132 Milan, Italy.
Am J Hum Genet. 2018 Feb 1;102(2):207-218. doi: 10.1016/j.ajhg.2017.12.014. Epub 2018 Jan 18.
Genome expansion is believed to be an important driver of the evolution of gene regulation. To investigate the role of a newly arising sequence in rewiring regulatory networks, we estimated the age of each region of the human genome by applying maximum parsimony to genome-wide alignments with 100 vertebrates. We then studied the age distribution of several types of functional regions, with a focus on regulatory elements. The age distribution of regulatory elements reveals the extensive use of newly formed genomic sequence in the evolution of regulatory interactions. Many transcription factors have expanded their repertoire of targets through waves of genomic expansions that can be traced to specific evolutionary times. Repeated elements contributed a major part of such expansion: many classes of such elements are enriched in binding sites of one or a few specific transcription factors, whose binding sites are localized in specific portions of the element and characterized by distinctive motif words. These features suggest that the binding sites were available as soon as the new sequence entered the genome, rather than being created later by accumulation of point mutations. By comparing the age of regulatory regions to the evolutionary shift in expression of nearby genes, we show that rewiring through genome expansion played an important role in shaping human regulatory networks.
基因组扩张被认为是基因调控进化的重要驱动因素。为了研究新出现的序列在重新构建调控网络中的作用,我们通过对 100 种脊椎动物的全基因组比对应用最大简约法来估计人类基因组中每个区域的年龄。然后,我们研究了几种功能区域的年龄分布,重点是调控元件。调控元件的年龄分布揭示了在调控相互作用的进化过程中广泛利用新形成的基因组序列。许多转录因子通过可以追溯到特定进化时期的基因组扩张,扩大了其靶基因的范围。重复元件在这种扩张中起了主要作用:许多类别的重复元件在一个或几个特定转录因子的结合位点中富集,这些转录因子的结合位点定位于元件的特定部分,并具有独特的基序词。这些特征表明,这些结合位点在新序列进入基因组时就已经存在,而不是通过点突变的积累后来创建的。通过比较调控区域的年龄与附近基因表达的进化变化,我们表明,通过基因组扩张进行的重新布线在塑造人类调控网络方面发挥了重要作用。