Batmanov Kirill, Wang Junbai
Department of Pathology, Oslo University Hospital-Norwegian Radium Hospital, Montebello, 0310 Oslo,Norway.
Genes (Basel). 2017 Sep 18;8(9):233. doi: 10.3390/genes8090233.
DNA shape readout is an important mechanism of transcription factor target site recognition, in addition to the sequence readout. Several machine learning-based models of transcription factor-DNA interactions, considering DNA shape features, have been developed in recent years. Here, we present a new biophysical model of protein-DNA interactions by integrating the DNA shape properties. It is based on the neighbor dinucleotide dependency model BayesPI2, where new parameters are restricted to a subspace spanned by the dinucleotide form of DNA shape features. This allows a biophysical interpretation of the new parameters as a position-dependent preference towards specific DNA shape features. Using the new model, we explore the variation of DNA shape preferences in several transcription factors across various cancer cell lines and cellular conditions. The results reveal that there are DNA shape variations at FOXA1 (Forkhead Box Protein A1) binding sites in steroid-treated MCF7 cells. The new biophysical model is useful for elucidating the finer details of transcription factor-DNA interaction, as well as for predicting cancer mutation effects in the future.
除了序列识别外,DNA形状识别是转录因子靶位点识别的重要机制。近年来,已经开发了几种基于机器学习的考虑DNA形状特征的转录因子与DNA相互作用模型。在这里,我们通过整合DNA形状特性,提出了一种新的蛋白质与DNA相互作用的生物物理模型。它基于邻二核苷酸依赖性模型BayesPI2,其中新参数被限制在由DNA形状特征的二核苷酸形式所跨越的子空间中。这使得新参数能够作为对特定DNA形状特征的位置依赖性偏好进行生物物理解释。使用新模型,我们探索了几种转录因子在各种癌细胞系和细胞条件下DNA形状偏好的变化。结果表明,在类固醇处理的MCF7细胞中,FOXA1(叉头框蛋白A1)结合位点存在DNA形状变化。这种新的生物物理模型有助于阐明转录因子与DNA相互作用的更精细细节,以及未来预测癌症突变效应。