Danilova Ludmila V, Lyubetsky Vassily A, Gelfand Mikhail S
Institute for Information Transmission Problems RAS, Moscow, 101447, Russia.
In Silico Biol. 2003;3(1-2):33-47. Epub 2003 Apr 14.
We describe an algorithm (IRSA) for identification of common regulatory signals in samples of unaligned DNA sequences. The algorithm was tested on randomly generated sequences of fixed length with implanted signal of length 15 with 4 mutations, and on natural upstream regions of bacterial genes regulated by PurR, ArgR and CRP. Then it was applied to upstream regions of orthologous genes from Escherichia coli and related genomes. Some new palindromic binding and direct repeats signals were identified. Finally we present a parallel version suitable for computers supporting the MPI protocol. This implementation is not strictly bounded by the number of available processors. The computation speed linearly depends on the number of processors.