Breen E J, Williams K L
School of Biological Sciences, Macquarie University, Sydney, N.S.W. Australia.
Comput Methods Programs Biomed. 1989 Feb;28(2):87-91. doi: 10.1016/0169-2607(89)90164-8.
Open hashing is used to demonstrate the effectiveness of several hashing functions for the uniform distribution of biological records. The three types of database tested include (1) genetic nomenclature, mutation sites and strain names, (2) surnames extracted from literature files and (3) a set of 1000 numeric ASCII strings. Several hash functions (hashpjw, hashcrc and hashquad) showed considerable versatility on all data sets examined while two hash functions, hashsum and hashsmc, performed poorly, on the same databases.