Institute of Protein Research, Russian Academy of Sciences, Pushchino, Russia.
Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia.
Nat Commun. 2021 May 12;12(1):2751. doi: 10.1038/s41467-021-23007-0.
Sequence variants in gene regulatory regions alter gene expression and contribute to phenotypes of individual cells and the whole organism, including disease susceptibility and progression. Single-nucleotide variants in enhancers or promoters may affect gene transcription by altering transcription factor binding sites. Differential transcription factor binding in heterozygous genomic loci provides a natural source of information on such regulatory variants. We present a novel approach to call the allele-specific transcription factor binding events at single-nucleotide variants in ChIP-Seq data, taking into account the joint contribution of aneuploidy and local copy number variation, that is estimated directly from variant calls. We have conducted a meta-analysis of more than 7 thousand ChIP-Seq experiments and assembled the database of allele-specific binding events listing more than half a million entries at nearly 270 thousand single-nucleotide polymorphisms for several hundred human transcription factors and cell types. These polymorphisms are enriched for associations with phenotypes of medical relevance and often overlap eQTLs, making candidates for causality by linking variants with molecular mechanisms. Specifically, there is a special class of switching sites, where different transcription factors preferably bind alternative alleles, thus revealing allele-specific rewiring of molecular circuitry.
基因调控区域的序列变异会改变基因表达,并导致个体细胞和整个生物体的表型发生变化,包括疾病易感性和疾病进展。增强子或启动子中的单核苷酸变异可能通过改变转录因子结合位点来影响基因转录。杂合基因组位置中不同转录因子的结合提供了有关此类调控变异的自然信息来源。我们提出了一种新的方法,可以在 ChIP-Seq 数据中的单核苷酸变异处调用等位基因特异性转录因子结合事件,同时考虑到非整倍体和局部拷贝数变异的联合贡献,这些贡献可以直接从变异调用中估计。我们对超过 7000 个 ChIP-Seq 实验进行了荟萃分析,并组装了等位基因特异性结合事件数据库,其中列出了近 27 万个单核苷酸多态性的 50 多万个条目,涉及数百个人类转录因子和细胞类型。这些多态性与具有医学相关性的表型相关联,并且经常与 eQTL 重叠,通过将变体与分子机制联系起来,成为因果关系的候选者。具体来说,有一种特殊的开关位点,其中不同的转录因子更喜欢结合替代等位基因,从而揭示了分子电路的等位基因特异性重布线。