IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):647-656. doi: 10.1109/TCBB.2018.2865349. Epub 2018 Aug 15.
The recently developed Hi-C technology enables a genome-wide view of chromosome spatial organizations, and has shed deep insights into genome structure and genome function. However, multiple sources of uncertainties make downstream data analysis and interpretation challenging. Specifically, statistical models for inferring three-dimensional (3D) chromosomal structure from Hi-C data are far from their maturity. Most existing methods are highly over-parameterized, lacking clear interpretations, and sensitive to outliers. In this study, we propose a parsimonious, easy to interpret, and robust piecewise helical model for the inference of 3D chromosomal structure of individual topologically associated domain from Hi-C data. When applied to a real Hi-C dataset, the piecewise helical model not only achieves much better model fitting than existing models, but also reveals that geometric properties of chromatin spatial organization are closely related to genome function.
最近开发的 Hi-C 技术能够全面观察染色体的空间结构,并深入了解基因组结构和功能。然而,多种不确定因素使得下游数据分析和解释具有挑战性。具体来说,从 Hi-C 数据推断三维(3D)染色体结构的统计模型远未成熟。大多数现有方法高度超参数化,缺乏明确的解释,并且对离群值敏感。在这项研究中,我们提出了一种简洁、易于解释和稳健的分段螺旋模型,用于从 Hi-C 数据推断个体拓扑相关结构域的 3D 染色体结构。当应用于真实的 Hi-C 数据集时,分段螺旋模型不仅比现有模型实现了更好的模型拟合,还揭示了染色质空间组织的几何性质与基因组功能密切相关。