Pedersen A G, Engelbrecht J
Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark.
Proc Int Conf Intell Syst Mol Biol. 1995;3:292-9.
In this paper we present a novel method for using the learning ability of a neural network as a measure of information in local regions of input data. Using the method to analyze Escherichia coli promoters, we discover all previously described signals, and furthermore find new signals that are regularly spaced along the promoter region. The spacing of all signals correspond to the helical periodicity of DNA, meaning that the signals are all present on the same face of the DNA helix in the promoter region. This is consistent with a model where the RNA polymerase contacts the promoter on one side of the DNA, and suggests that the regions important for promoter recognition may include more positions on the DNA than usually assumed. We furthermore analyze the E. coli promoters by calculating the Kullback Leibler distance, and by constructing sequence logos.
在本文中,我们提出了一种新颖的方法,利用神经网络的学习能力作为输入数据局部区域信息的一种度量。使用该方法分析大肠杆菌启动子,我们发现了所有先前描述的信号,此外还发现了沿启动子区域规则间隔排列的新信号。所有信号的间隔与DNA的螺旋周期性相对应,这意味着这些信号都出现在启动子区域DNA螺旋的同一面上。这与RNA聚合酶在DNA一侧与启动子接触的模型一致,并表明对启动子识别重要的区域可能包括DNA上比通常认为的更多的位置。我们还通过计算库尔贝克-莱布勒散度和构建序列标识来分析大肠杆菌启动子。