Department of Biological Sciences, Indian Institute of Science Education and Research (IISER) - Mohali, Knowledge City, Sector 81, SAS Nagar, 140306, India.
Department of Animal Biology, School of Life Sciences, University of Hyderabad (UoH), Central University, Prof. CN Rao Road, P O, Gachibowli, Hyderabad, Telangana, 500046, India.
BMC Genomics. 2020 Feb 22;21(1):175. doi: 10.1186/s12864-020-6580-6.
Proximity ligation based techniques, like Hi-C, involve restriction digestion followed by ligation of formaldehyde cross-linked chromatin. Distinct chromatin states can impact the restriction digestion, and hence the visibility in the contact maps, of engaged loci. Yet, the extent and the potential impact of digestion bias remain obscure and under-appreciated in the literature.
Through analysis of 45 Hi-C datasets, lamina-associated domains (LADs), inactive X-chromosome in mammals, and polytene bands in fly, we first established that the DNA in condensed chromatin had lesser accessibility to restriction endonucleases used in Hi-C as compared to that in decondensed chromatin. The observed bias was independent of known systematic biases, was not appropriately corrected by existing computational methods, and needed an additional optimization step. We then repurposed this bias to identify novel condensed domains outside LADs, which were bordered by insulators and were dynamically associated with the polycomb mediated epigenetic and transcriptional states during development.
Our observations suggest that the corrected one-dimensional read counts of existing Hi-C datasets can be reliably repurposed to study the gene-regulatory dynamics associated with chromatin condensation and decondensation, and that the existing Hi-C datasets should be interpreted with cautions.
基于连接的邻近技术,如 Hi-C,涉及限制消化,然后连接甲醛交联的染色质。不同的染色质状态可以影响限制消化,从而影响接触图谱中参与的基因座的可见性。然而,在文献中,消化偏差的程度和潜在影响仍然模糊不清,没有得到充分的认识。
通过对 45 个 Hi-C 数据集的分析,我们首先确定了与去凝聚染色质相比,凝聚染色质中的 DNA 对用于 Hi-C 的限制内切酶的可及性较低。观察到的偏差与已知的系统性偏差无关,现有的计算方法没有适当纠正,并且需要额外的优化步骤。然后,我们重新利用这种偏差来识别 LAD 之外的新的凝聚域,这些域由绝缘子包围,并在发育过程中与多梳介导的表观遗传和转录状态动态相关。
我们的观察表明,现有的 Hi-C 数据集的修正一维读取计数可以可靠地重新用于研究与染色质凝聚和解凝聚相关的基因调控动态,并且应该谨慎解释现有的 Hi-C 数据集。