Petersen Rachel M, Vockley Christopher M, Lea Amanda J
Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee 37232, USA.
Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee 37232, USA.
Genome Res. 2025 Aug 1;35(8):1781-1793. doi: 10.1101/gr.279957.124.
A major goal in evolutionary biology and biomedicine is to understand the complex interactions between genetic variants, the epigenome, and gene expression. However, the causal relationships between these factors remain poorly understood. mSTARR-seq, a methylation-sensitive massively parallel reporter assay, is capable of identifying methylation-dependent regulatory activity at many thousands of genomic regions simultaneously and allows for the testing of causal relationships between DNA methylation and gene expression on a region-by-region basis. Here, we develop a multiplexed mSTARR-seq protocol to assay naturally occurring human genetic variation from 25 individuals from 10 localities in Europe and Africa. We identify 6957 regulatory elements in either the unmethylated or methylated state, and this set was enriched for enhancer and promoter chromatin annotations, as expected. The expression of 58% of these regulatory elements is modulated by methylation, which is generally associated with decreased transcription. Within our set of regulatory elements, we use allele-specific expression analyses to identify 8020 sites with genetic effects on gene regulation; further, we find that 42.3% of these genetic effects vary in direction or magnitude between methylated and unmethylated states. Sites exhibiting methylation-dependent genetic effects are enriched for GWAS and EWAS annotations, implicating them in human disease. Compared with data sets that assay DNA from a single European ancestry individual, our multiplexed assay is able to uncover more genetic effects and methylation-dependent genetic effects, highlighting the importance of including diverse genomes in assays that aim to understand gene regulatory processes.
进化生物学和生物医学的一个主要目标是了解基因变异、表观基因组和基因表达之间的复杂相互作用。然而,这些因素之间的因果关系仍知之甚少。mSTARR-seq是一种甲基化敏感的大规模平行报告基因检测方法,能够同时在数千个基因组区域鉴定甲基化依赖性调控活性,并允许逐区域测试DNA甲基化与基因表达之间的因果关系。在这里,我们开发了一种多重mSTARR-seq方案,以检测来自欧洲和非洲10个地区的25个人的自然发生的人类遗传变异。我们在未甲基化或甲基化状态下鉴定出6957个调控元件,正如预期的那样,这一组元件富含增强子和启动子染色质注释。这些调控元件中58%的表达受甲基化调节,这通常与转录减少有关。在我们的调控元件集合中,我们使用等位基因特异性表达分析来鉴定8020个对基因调控有遗传效应的位点;此外,我们发现这些遗传效应中有42.3%在甲基化和未甲基化状态之间的方向或大小有所不同。表现出甲基化依赖性遗传效应的位点富含全基因组关联研究(GWAS)和全表观基因组关联研究(EWAS)注释,表明它们与人类疾病有关。与检测来自单一欧洲血统个体DNA的数据集相比,我们的多重检测能够发现更多的遗传效应和甲基化依赖性遗传效应,突出了在旨在了解基因调控过程的检测中纳入多样化基因组的重要性。