Plotkin Joshua B, Dushoff Jonathan, Desai Michael M, Fraser Hunter B
Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
J Mol Evol. 2006 Nov;63(5):635-53. doi: 10.1007/s00239-005-0233-x. Epub 2006 Oct 14.
Selection pressures on proteins are usually measured by comparing homologous nucleotide sequences (Zuckerkandl and Pauling 1965). Recently we introduced a novel method, termed volatility, to estimate selection pressures on proteins on the basis of their synonymous codon usage (Plotkin and Dushoff 2003; Plotkin et al. 2004). Here we provide a theoretical foundation for this approach. Under the Fisher-Wright model, we derive the expected frequencies of synonymous codons as a function of the strength of selection on amino acids, the mutation rate, and the effective population size. We analyze the conditions under which we can expect to draw inferences from biased codon usage, and we estimate the time scales required to establish and maintain such a signal. We find that synonymous codon usage can reliably distinguish between negative selection and neutrality only for organisms, such as some microbes, that experience large effective population sizes or periods of elevated mutation rates. The power of volatility to detect positive selection is also modest--requiring approximately 100 selected sites--but it depends less strongly on population size. We show that phenomena such as transient hyper-mutators can improve the power of volatility to detect selection, even when the neutral site heterozygosity is low. We also discuss several confounding factors, neglected by the Fisher-Wright model, that may limit the applicability of volatility in practice.
对蛋白质的选择压力通常通过比较同源核苷酸序列来衡量(祖克坎德尔和鲍林,1965年)。最近,我们引入了一种名为波动性的新方法,以根据蛋白质的同义密码子使用情况来估计对蛋白质的选择压力(普洛特金和杜肖夫,2003年;普洛特金等人,2004年)。在此,我们为这种方法提供了理论基础。在费希尔-赖特模型下,我们推导出同义密码子的预期频率,它是氨基酸选择强度、突变率和有效种群大小的函数。我们分析了在哪些条件下有望从有偏的密码子使用中得出推论,并估计建立和维持这种信号所需的时间尺度。我们发现,同义密码子使用情况仅对于经历大有效种群大小或突变率升高时期的生物体(如一些微生物),才能可靠地区分负选择和中性选择。波动性检测正选择的能力也较为有限——需要大约100个被选择的位点——但它对种群大小的依赖程度较小。我们表明,即使中性位点杂合度较低,诸如瞬时高突变体等现象也可以提高波动性检测选择的能力。我们还讨论了费希尔-赖特模型忽略的几个混杂因素,这些因素可能会限制波动性在实际中的适用性。