Jones R
Thinking Machines Corporation, Cambridge, MA 02142.
Comput Appl Biosci. 1992 Aug;8(4):377-83. doi: 10.1093/bioinformatics/8.4.377.
A method is described for finding all occurrences of a sequence pattern within a database of molecular sequences. Implementation of this on a massively parallel computer allows the user to perform very fast database searches using complex patterns. In particular, the software supports approximate pattern matching with score thresholds for either the entire pattern or specified elements thereof. Matches to individual elements can be linked by variable length gaps within user-specified limits.