Feng Zhen-Xing, Li Qian-Zhong, Meng Jian-Jun
Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China.
Laboratory of Theoretical Biophysics, School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China; The State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Inner Mongolia University, Hohhot, 010070, China.
J Theor Biol. 2018 May 14;445:136-150. doi: 10.1016/j.jtbi.2018.02.023. Epub 2018 Feb 21.
The enhancer-promoter interactions (EPIs) with strong tissue-specificity play an important role in cis-regulatory mechanism of human cell lines. However, it still remains a challenging work to predict these interactions so far. Due to that these interactions are regulated by the cooperativeness of diverse functional genomic signatures, DNA spatial structure and DNA sequence elements. In this paper, by adding DNA structure properties and transcription factor binding motifs, we presented an improved computational method to predict EPIs in human cell lines. In comparison with the results of other group on the same datasets, our best accuracies by cross-validation test were about 15%-24% higher in the same cell lines, and the accuracies by independent test were about 11%-15% higher in new cell lines. Meanwhile, we found that transcription factor binding motifs and DNA structure properties have important information that would largely determine long range EPIs prediction. From the distribution comparisons, we also found their distinct differences between interacting and non-interacting sets in each cell line. Then, the correlation analysis and network models for relationships among top-ranked functional genomic signatures indicated that diverse genomic signatures would cooperatively establish a complex regulatory network to facilitate long range EPIs. The experimental results provided additional insights about the roles of DNA intrinsic properties and functional genomic signatures in EPIs prediction.
具有强组织特异性的增强子-启动子相互作用(EPI)在人类细胞系的顺式调控机制中发挥着重要作用。然而,迄今为止预测这些相互作用仍然是一项具有挑战性的工作。这是因为这些相互作用受多种功能基因组特征、DNA空间结构和DNA序列元件的协同作用调控。在本文中,通过添加DNA结构特性和转录因子结合基序,我们提出了一种改进的计算方法来预测人类细胞系中的EPI。与其他团队在相同数据集上的结果相比,在相同细胞系中,我们通过交叉验证测试得到的最佳准确率高出约15%-24%,在新细胞系中通过独立测试得到的准确率高出约11%-15%。同时,我们发现转录因子结合基序和DNA结构特性具有重要信息,这些信息在很大程度上决定了远距离EPI的预测。从分布比较中,我们还发现了每个细胞系中相互作用组和非相互作用组之间的明显差异。然后,对排名靠前的功能基因组特征之间的关系进行的相关性分析和网络模型表明,多种基因组特征将协同建立一个复杂的调控网络,以促进远距离EPI。实验结果为DNA内在特性和功能基因组特征在EPI预测中的作用提供了更多见解。