Giuliano F, Arrigo P, Scalia F, Cardo P P, Damiani G
Istituto per i Circuiti Elettronici, Genova, Italy.
Comput Appl Biosci. 1993 Dec;9(6):687-93. doi: 10.1093/bioinformatics/9.6.687.
Computer recognition of short functional sites on DNA, such as promoter regions or intron-exon boundaries, has recently attracted much interest. In this paper we have focused our attention on the automatic recognition of relevant features of human nucleic acid sequences by means of an unsupervised artificial neural network model. Sixty messenger RNA and 31 genomic DNA sequences were analysed. The results showed that in mRNA, the minimal similarity 60 base pattern was guanine- and cytosine-rich and located in most sequences in a range of 250 bases from either the middle point of the signal peptide coding region or from the start of the coding region. On DNA sequences a region defined by a cluster of minimal similarity patterns was present in many of the analysed genes. This zone may be related to alternative splicing and DNA methylation.
计算机对DNA上短功能位点的识别,如启动子区域或内含子-外显子边界,最近引起了广泛关注。在本文中,我们将注意力集中在通过无监督人工神经网络模型自动识别人类核酸序列的相关特征上。分析了60个信使RNA和31个基因组DNA序列。结果表明,在信使RNA中,最小相似性60碱基模式富含鸟嘌呤和胞嘧啶,且在大多数序列中位于距信号肽编码区域中点或编码区域起始点250个碱基范围内。在DNA序列上,许多被分析基因中存在一个由最小相似性模式簇定义的区域。该区域可能与可变剪接和DNA甲基化有关。