Zhang Fan, Kuo Michael D, Brunkhors Adrian
Dept. of Radiol., California Univ., San Diego, CA, USA.
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2025-7. doi: 10.1109/IEMBS.2006.260365.
E. coli promoter recognition is an area of great interest in bioinformatics. In this paper, we describe the implementation of a feed forward neural network to predict the E. coli promoter. According to the sequence conservation, some sequences with 60 bases are selected as positive samples and some corresponding non-promoters from E. coli coding areas are selected as negative samples, and a classifier based on feed forward neural network is trained. Results show that feed forward neural networks can extract the statistical characteristics of promoters more effectively, and that coding with four dimensions for nucleic acid data is superior to two dimensions. Another result demonstrated here is that the number of hidden layers seems to have no significant effect on E. coli promoter prediction precision. The research results in this paper can provide reference for promoter recognition research.
大肠杆菌启动子识别是生物信息学中一个备受关注的领域。在本文中,我们描述了一种前馈神经网络的实现方法,用于预测大肠杆菌启动子。根据序列保守性,选择一些60个碱基的序列作为正样本,并从大肠杆菌编码区选择一些相应的非启动子作为负样本,然后训练一个基于前馈神经网络的分类器。结果表明,前馈神经网络能够更有效地提取启动子的统计特征,并且对核酸数据进行四维编码优于二维编码。本文展示的另一个结果是,隐藏层的数量似乎对大肠杆菌启动子预测精度没有显著影响。本文的研究结果可为启动子识别研究提供参考。