Hughes Austin L, Friedman Robert
Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA.
Immunogenetics. 2008 Sep;60(9):495-506. doi: 10.1007/s00251-008-0304-4. Epub 2008 Jun 26.
Using basic probability theory, we show that there is a substantial likelihood that even in the presence of strong purifying selection, there will be a number of codons in which the number of synonymous nucleotide substitutions per site (d (S)) exceeds the number of non-synonymous nucleotide substitutions per site (d (N)). In an empirical study, we examined the numbers of synonymous (b (S)) and non-synonymous substitutions (b (N)) along branches of the phylogenies of 69 single-copy orthologous genes from seven species of mammals. A pattern of b (N) > b (S) was most commonly seen in the shortest branches of the tree and was associated with a high coefficient of variation in both b (N) and b (S), suggesting that high stochastic error in b (N) and b (S) on short branches, rather than positive Darwinian selection, is the explanation of most cases where b (N) is greater than b (S) on a given branch. The branch-site method of Zhang et al. (Zhang, Nielsen, Yang, Mol Biol Evol, 22:2472-2479, 2005) identified 117 codons on 35 branches as "positively selected," but a majority of these codons lacked synonymous substitutions, while in the others, synonymous and non-synonymous differences per site occurred in approximately equal frequencies. Thus, it was impossible to rule out the hypothesis that chance variation in the pattern of mutation across sites, rather than positive selection, accounted for the observed pattern. Our results showed that b (N)/b (S) was consistently elevated in immune system genes, but neither the search for branches with b (N) > b (S) nor the branch-site method revealed this trend.
运用基本概率论,我们证明,即便存在强大的纯化选择,仍极有可能出现许多密码子,其中每个位点的同义核苷酸替换数(d(S))超过每个位点的非同义核苷酸替换数(d(N))。在一项实证研究中,我们考察了来自七种哺乳动物的69个单拷贝直系同源基因的系统发育树各分支上的同义替换数(b(S))和非同义替换数(b(N))。b(N) > b(S)的模式在树的最短分支中最为常见,并且与b(N)和b(S)的高变异系数相关,这表明短分支上b(N)和b(S)的高随机误差,而非正向达尔文选择,是给定分支上b(N)大于b(S)的大多数情况的解释。张等人(Zhang, Nielsen, Yang, Mol Biol Evol, 22:2472 - 2479, 2005)的分支位点方法在35个分支上鉴定出117个密码子为“正选择”,但这些密码子中的大多数缺乏同义替换,而在其他密码子中,每个位点的同义差异和非同义差异出现的频率大致相等。因此,无法排除位点间突变模式的随机变异而非正选择导致所观察到模式的假设。我们的结果表明,免疫系统基因中的b(N)/b(S)持续升高,但无论是寻找b(N) > b(S)的分支还是分支位点方法都未揭示这一趋势。