Department of Genome Sciences, University of Washington, Seattle, WA, USA.
Altius Institute for Biomedical Sciences, Seattle, WA, USA.
Nat Commun. 2021 May 14;12(1):2850. doi: 10.1038/s41467-021-23139-3.
Functional assessment of disease-associated sequence variation at non-coding regulatory elements is complicated by their high degree of context sensitivity to both the local chromatin and nuclear environments. Allelic profiling of DNA accessibility across individuals has shown that only a select minority of sequence variation affects transcription factor (TF) occupancy, yet low sequence diversity in human populations means that no experimental assessment is available for the majority of disease-associated variants. Here we describe high-resolution in vivo maps of allelic DNA accessibility in liver, kidney, lung and B cells from 5 increasingly diverged strains of F1 hybrid mice. The high density of heterozygous sites in these hybrids enables precise quantification of effect size and cell-type specificity for hundreds of thousands of variants throughout the mouse genome. We show that chromatin-altering variants delineate characteristic sensitivity profiles for hundreds of TF motifs. We develop a compendium of TF-specific sensitivity profiles accounting for genomic context effects. Finally, we link maps of allelic accessibility to allelic transcript levels in the same samples. This work provides a foundation for quantitative prediction of cell-type specific effects of non-coding variation on TF activity, which will facilitate both fine-mapping and systems-level analyses of common disease-associated variation in human genomes.
功能评估疾病相关序列变异在非编码调控元件是复杂的,因为它们对局部染色质和核环境具有高度的上下文敏感性。个体间 DNA 可及性的等位基因分析表明,只有少数序列变异会影响转录因子 (TF) 的占据,但人类群体中的低序列多样性意味着大多数与疾病相关的变异都没有实验评估。在这里,我们描述了来自 5 个越来越分化的 F1 杂交小鼠的肝脏、肾脏、肺和 B 细胞中等位基因 DNA 可及性的高分辨率体内图谱。这些杂种中的杂合位点密度很高,能够精确量化数十万种变体在整个小鼠基因组中的效应大小和细胞类型特异性。我们表明,改变染色质的变体为数百个 TF 基序描绘了特征性的敏感性特征。我们开发了一个包含基因组上下文效应的 TF 特异性敏感性特征的汇编。最后,我们将等位基因可及性图谱与同一样本中的等位基因转录水平联系起来。这项工作为定量预测非编码变异对 TF 活性的细胞类型特异性影响提供了基础,这将有助于人类基因组中常见疾病相关变异的精细映射和系统水平分析。