Brand Colin M, Kuang Shuzhen, Gilbertson Erin N, McArthur Evonne, Pollard Katherine S, Webster Timothy H, Capra John A
Bakar Computational Health Sciences Institute, University of California, San Francisco, CA.
Department of Epidemiology and Biostatistics, University of California, San Francisco, CA.
bioRxiv. 2023 Oct 26:2023.10.26.564272. doi: 10.1101/2023.10.26.564272.
Phenotypic divergence between closely related species, including bonobos and chimpanzees (genus ), is largely driven by variation in gene regulation. The 3D structure of the genome mediates gene expression; however, genome folding differences in are not well understood. Here, we apply machine learning to predict genome-wide 3D genome contact maps from DNA sequence for 56 bonobos and chimpanzees, encompassing all five extant lineages. We use a pairwise approach to estimate 3D divergence between individuals from the resulting contact maps in 4,420 1 Mb genomic windows. While most pairs were similar, ∼17% were predicted to be substantially divergent in genome folding. The most dissimilar maps were largely driven by single individuals with rare variants that produce unique 3D genome folding in a region. We also identified 89 genomic windows where bonobo and chimpanzee contact maps substantially diverged, including several windows harboring genes associated with traits implicated in phenotypic divergence. We used mutagenesis to identify 51 3D-modifying variants in these bonobo-chimpanzee divergent windows, finding that 34 or 66.67% induce genome folding changes via CTCF binding motif disruption. Our results reveal 3D genome variation at the population-level and identify genomic regions where changes in 3D folding may contribute to phenotypic differences in our closest living relatives.
包括倭黑猩猩和黑猩猩(属)在内的近缘物种之间的表型差异在很大程度上是由基因调控的变化驱动的。基因组的三维结构介导基因表达;然而,其基因组折叠差异尚不清楚。在这里,我们应用机器学习从56只倭黑猩猩和黑猩猩的DNA序列预测全基因组三维基因组接触图谱,涵盖了所有五个现存谱系。我们使用成对方法从所得的4420个1兆碱基基因组窗口的接触图谱中估计个体之间的三维差异。虽然大多数对相似,但约17%的对被预测在基因组折叠上有显著差异。最不相似的图谱在很大程度上是由具有罕见变异的单个个体驱动的,这些变异在一个区域产生独特的三维基因组折叠。我们还确定了89个基因组窗口,其中倭黑猩猩和黑猩猩的接触图谱有显著差异,包括几个含有与表型差异相关性状的基因的窗口。我们使用诱变技术在这些倭黑猩猩 - 黑猩猩差异窗口中鉴定出51个三维修饰变异,发现34个或66.67%的变异通过CTCF结合基序破坏诱导基因组折叠变化。我们的结果揭示了群体水平上的三维基因组变异,并确定了三维折叠变化可能导致我们现存最亲近亲属表型差异的基因组区域。