Zhuo Xiaoyu, Du Alan Y, Pehrsson Erica C, Li Daofeng, Wang Ting
Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri 63110, USA.
The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.
Genome Res. 2021 Feb;31(2):279-290. doi: 10.1101/gr.263491.120. Epub 2020 Dec 10.
Structural variation (SV), including insertions and deletions (indels), is a primary mechanism of genome evolution. However, the mechanism by which SV contributes to epigenome evolution is poorly understood. In this study, we characterized the association between lineage-specific indels and epigenome differences between human and chimpanzee to investigate how SVs might have shaped the epigenetic landscape. By intersecting medium-to-large human-chimpanzee indels (20 bp-50 kb) with putative promoters and enhancers in cranial neural crest cells (CNCCs) and repressed regions in induced pluripotent cells (iPSCs), we found that 12% of indels overlap putative regulatory and repressed regions (RRRs), and 15% of these indels are associated with lineage-biased RRRs. Indel-associated putative enhancer and repressive regions are approximately 1.3 times and approximately three times as likely to be lineage-biased, respectively, as those not associated with indels. We found a twofold enrichment of medium-sized indels (20-50 bp) in CpG island (CGI)-containing promoters than expected by chance. Lastly, from human-specific transposable element insertions, we identified putative regulatory elements, including NR2F1-bound putative CNCC enhancers derived from SVAs and putative iPSC promoters derived from LTR5s. Our results show that different types of indels are associated with specific epigenomic diversity between human and chimpanzee.
结构变异(SV),包括插入和缺失(indels),是基因组进化的主要机制。然而,人们对SV促成表观基因组进化的机制了解甚少。在本研究中,我们对人类和黑猩猩之间谱系特异性indels与表观基因组差异之间的关联进行了表征,以研究SVs可能如何塑造表观遗传景观。通过将中等至大型人类 - 黑猩猩indels(20 bp - 50 kb)与颅神经嵴细胞(CNCCs)中的推定启动子和增强子以及诱导多能细胞(iPSCs)中的抑制区域进行交叉分析,我们发现12%的indels与推定的调控和抑制区域(RRRs)重叠,其中15%的indels与谱系偏向的RRRs相关。与indels相关的推定增强子和抑制区域出现谱系偏向的可能性分别约为与indels不相关区域的1.3倍和约三倍。我们发现,在含CpG岛(CGI)的启动子中,中等大小的indels(20 - 50 bp)的富集程度比随机预期的高两倍。最后,从人类特异性转座元件插入中,我们鉴定出推定的调控元件,包括源自SVA的NR2F1结合的推定CNCC增强子和源自LTR5的推定iPSC启动子。我们的结果表明,不同类型的indels与人类和黑猩猩之间特定的表观基因组多样性相关。