Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.
Frontier Institute of Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.
Am J Hum Genet. 2023 Aug 3;110(8):1266-1288. doi: 10.1016/j.ajhg.2023.07.002. Epub 2023 Jul 27.
Most of the single-nucleotide polymorphisms (SNPs) associated with insulin resistance (IR)-relevant phenotypes by genome-wide association studies (GWASs) are located in noncoding regions, complicating their functional interpretation. Here, we utilized an adapted STARR-seq to evaluate the regulatory activities of 5,987 noncoding SNPs associated with IR-relevant phenotypes. We identified 876 SNPs with biased allelic enhancer activity effects (baaSNPs) across 133 loci in three IR-relevant cell lines (HepG2, preadipocyte, and A673), which showed pervasive cell specificity and significant enrichment for cell-specific open chromatin regions or enhancer-indicative markers (H3K4me1, H3K27ac). Further functional characterization suggested several transcription factors (TFs) with preferential allelic binding to baaSNPs. We also incorporated multi-omics data to prioritize 102 candidate regulatory target genes for baaSNPs and revealed prevalent long-range regulatory effects and cell-specific IR-relevant biological functional enrichment on them. Specifically, we experimentally verified the distal regulatory mechanism at IRS1 locus, in which rs952227-A reinforces IRS1 expression by long-range chromatin interaction and preferential binding to the transcription factor HOXC6 to augment the enhancer activity. Finally, based on our STARR-seq screening data, we predicted the enhancer activity of 227,343 noncoding SNPs associated with IR-relevant phenotypes (fasting insulin adjusted for BMI, HDL cholesterol, and triglycerides) from the largest available GWAS summary statistics. We further provided an open resource (http://www.bigc.online/fnSNP-IR) for better understanding genetic regulatory mechanisms of IR-relevant phenotypes.
大多数通过全基因组关联研究(GWAS)与胰岛素抵抗(IR)相关表型相关的单核苷酸多态性(SNP)位于非编码区域,这使得它们的功能解释变得复杂。在这里,我们利用改良的 STARR-seq 来评估与 IR 相关表型相关的 5987 个非编码 SNP 的调节活性。我们在三个与 IR 相关的细胞系(HepG2、前脂肪细胞和 A673)中鉴定出了 133 个位置上具有偏向等位基因增强子活性效应(baaSNP)的 876 个 SNP,这些 SNP 表现出普遍的细胞特异性和对细胞特异性开放染色质区域或增强子指示标记(H3K4me1、H3K27ac)的显著富集。进一步的功能特征表明,几个转录因子(TF)具有对 baaSNP 的优先等位基因结合。我们还整合了多组学数据,对 baaSNP 的 102 个候选调节靶基因进行优先级排序,并揭示了它们普遍存在的长程调节效应和细胞特异性的与 IR 相关的生物学功能富集。具体来说,我们通过实验验证了 IRS1 基因座的远端调控机制,其中 rs952227-A 通过长程染色质相互作用增强 IRS1 的表达,并优先与转录因子 HOXC6 结合,增强增强子活性。最后,基于我们的 STARR-seq 筛选数据,我们从最大的可用 GWAS 汇总统计数据中预测了与 IR 相关表型(BMI、HDL 胆固醇和甘油三酯校正后的空腹胰岛素)相关的 227343 个非编码 SNP 的增强子活性。我们还提供了一个开放资源(http://www.bigc.online/fnSNP-IR),以帮助更好地理解与 IR 相关表型的遗传调控机制。