Department of Medical and Molecular Genetics, King's College London, London SE1 9RT, UK.
Int J Mol Sci. 2020 Nov 9;21(21):8383. doi: 10.3390/ijms21218383.
Prioritizing tag-SNPs carried on extended risk haplotypes at susceptibility loci for common disease is a challenge. We utilized trans-ancestral exclusion mapping to reduce risk haplotypes at and identified in multiple ancestries from SLE GWAS and ImmunoChip datasets. We characterized functional annotation data across each risk haplotype from publicly available datasets including ENCODE, RoadMap Consortium, PC Hi-C data from 3D genome browser, NESDR NTR conditional eQTL database, GeneCards Genehancers and TF (transcription factor) binding sites from Haploregv4. We refined the 60 kb associated haplotype upstream of to just 12 tag-SNPs tagging a 47.7 kb core risk haplotype. There was preferential enrichment of DNAse I hypersensitivity and H3K27ac modification across the 3' end of the risk haplotype, with four tag-SNPs sharing allele-specific TF binding sites with promoter variants, which are eQTLs for in whole blood. At , we refined a core risk haplotype of 101 kb (27 tag-SNPs) from an initial extended haplotype of 194 kb (282 tag-SNPs), which had widespread DNAse I hypersensitivity, H3K27ac modification and multiple allele-specific TF binding sites. Dimerization of Fox family TFs bound at the 3' and promoter of may stabilize chromatin looping across the locus. We combined trans-ancestral exclusion mapping and epigenetic annotation to identify variants at both and with the highest likelihood of biological relevance. The approach will be of strong interest to other complex trait geneticists seeking to attribute biological relevance to risk alleles on extended risk haplotypes in their disease of interest.
优先考虑在常见疾病易感基因座上的扩展风险单倍型上携带的标签 SNPs 是一个挑战。我们利用跨祖先排除映射来减少在来自 SLE GWAS 和 ImmunoChip 数据集的多个祖源中鉴定出的 和 风险单倍型。我们从公开可用的数据集(包括 ENCODE、RoadMap 联盟、来自 3D 基因组浏览器的 PC Hi-C 数据、NESDR NTR 条件 eQTL 数据库、GeneCards Genehancers 和 Haploregv4 中的 TF(转录因子)结合位点)中对每个风险单倍型的功能注释数据进行了特征描述。我们将 上游与 相关的 60 kb 相关单倍型缩小到仅 12 个标记 SNPs,标记了一个 47.7 kb 的核心风险单倍型。在风险单倍型的 3' 端,DNAse I 超敏性和 H3K27ac 修饰存在优先富集,四个标记 SNPs 与启动子变体共享等位基因特异性 TF 结合位点,这些变体是全血中 的 eQTL。在 ,我们从最初的 194 kb (282 个标记 SNPs)扩展单倍型中精炼出一个 101 kb 的核心风险单倍型(27 个标记 SNPs),该单倍型具有广泛的 DNAse I 超敏性、H3K27ac 修饰和多个等位基因特异性 TF 结合位点。Fox 家族 TF 的二聚体结合在 和 的 3' 和启动子上,可能稳定跨越该基因座的染色质环化。我们将跨祖先排除映射和表观遗传注释相结合,以识别 和 上最有可能具有生物学相关性的变体。这种方法将受到其他复杂性状遗传学家的强烈关注,他们希望将与疾病相关的扩展风险单倍型上的风险等位基因归因于生物学相关性。