Gaut B S, Weir B S
Department of Statistics, North Carolina State University, Raleigh 27695-8203.
Mol Biol Evol. 1994 Jul;11(4):620-9. doi: 10.1093/oxfordjournals.molbev.a040141.
Likelihood-ratio statistics are proposed to test for heterogeneity in nucleotide substitution rate among regions of a DNA sequence. The tests examine three-sequence phylogenies, and two specific tests are proposed: a test to detect rate heterogeneity among genic regions within a sequence, over all evolutionary lineages; and a test to detect rate heterogeneity among regions in a specific evolutionary lineage. Simulations examine the ability of tests to detect a single region that varies in nucleotide substitution rate relative to the remainder of the sequence. A 50-bp region with a fivefold substitution-rate increase can be detected > or = 90% of the time when it is found in all three lineages of the phylogeny, and a 50-bp region of fivefold rate increase can be detected with approximately 70% power when it is found in only one evolutionary lineage. Simulation also examines the effect of transition- and transversion-rate differences. The tests are applied to published DNA sequences. While the tests are powerful, significant results can be difficult to interpret biologically.