Beijing Institute of Radiation Medicine, Beijing, China.
College of Computer, National University of Defence Technology, Changsha, China.
PLoS Comput Biol. 2020 Feb 21;16(2):e1007287. doi: 10.1371/journal.pcbi.1007287. eCollection 2020 Feb.
Hi-C is commonly used to study three-dimensional genome organization. However, due to the high sequencing cost and technical constraints, the resolution of most Hi-C datasets is coarse, resulting in a loss of information and biological interpretability. Here we develop DeepHiC, a generative adversarial network, to predict high-resolution Hi-C contact maps from low-coverage sequencing data. We demonstrated that DeepHiC is capable of reproducing high-resolution Hi-C data from as few as 1% downsampled reads. Empowered by adversarial training, our method can restore fine-grained details similar to those in high-resolution Hi-C matrices, boosting accuracy in chromatin loops identification and TADs detection, and outperforms the state-of-the-art methods in accuracy of prediction. Finally, application of DeepHiC to Hi-C data on mouse embryonic development can facilitate chromatin loop detection. We develop a web-based tool (DeepHiC, http://sysomics.com/deephic) that allows researchers to enhance their own Hi-C data with just a few clicks.
Hi-C 通常用于研究三维基因组组织。然而,由于测序成本高和技术限制,大多数 Hi-C 数据集的分辨率较低,导致信息和生物学可解释性的损失。在这里,我们开发了 DeepHiC,一种生成对抗网络,从低覆盖测序数据中预测高分辨率的 Hi-C 互作图。我们证明,DeepHiC 能够从低至 1%的下采样读取中重现高分辨率的 Hi-C 数据。通过对抗训练,我们的方法可以恢复类似于高分辨率 Hi-C 矩阵中的细粒度细节,提高染色质环识别和 TAD 检测的准确性,并在预测准确性方面优于最先进的方法。最后,将 DeepHiC 应用于小鼠胚胎发育的 Hi-C 数据可以促进染色质环检测。我们开发了一个基于网络的工具(DeepHiC,http://sysomics.com/deephic),研究人员只需点击几下即可使用该工具增强自己的 Hi-C 数据。