Drummond Alexei, Pybus Oliver G, Rambaut Andrew
Department of Zoology, University of Oxford, South Parks Road, Oxford, OX1 3PS, UK.
Adv Parasitol. 2003;54:331-58. doi: 10.1016/s0065-308x(03)54008-8.
The processes of mutation and nucleotide substitution contribute to the observed variability in virulence, transmission and persistence of viral pathogens. Since most viruses evolve many times faster than their human hosts, we are in the unusual position of being able to measure these processes directly by comparing viral genes that have been isolated and sequenced at different points in time. The analysis of such data requires the use of specific statistical methods that take into account the shared ancestry of the sequences and the randomness inherent in the process of nucleotide substitution. In this paper we describe the various statistical methods for estimating evolutionary rates, which can be classified into three general approaches: linear regression, maximum likelihood, and Bayesian inference. We discuss the advantages and shortcomings of each approach and illustrate their use through the analysis of two example viruses; human immunodeficiency virus type 1 and dengue virus serotype 4. Reliable estimates of viral substitution rates have many important applications in population genetics and phylogenetics, including dating evolutionary events and divergence times, estimating demographic parameters such as population size and generation time, and investigating the effect of natural selection on molecular evolution.
突变和核苷酸替换过程导致了病毒病原体在毒力、传播和持久性方面所观察到的变异性。由于大多数病毒的进化速度比人类宿主快许多倍,我们处于一个特殊的位置,能够通过比较在不同时间点分离和测序的病毒基因来直接测量这些过程。对此类数据的分析需要使用特定的统计方法,这些方法要考虑到序列的共同祖先以及核苷酸替换过程中固有的随机性。在本文中,我们描述了估计进化速率的各种统计方法,这些方法可分为三种一般方法:线性回归、最大似然法和贝叶斯推断。我们讨论了每种方法的优缺点,并通过分析两种示例病毒——人类免疫缺陷病毒1型和登革病毒血清型4来说明它们的用途。病毒替换率的可靠估计在群体遗传学和系统发育学中有许多重要应用,包括确定进化事件和分歧时间、估计种群大小和世代时间等人口统计学参数,以及研究自然选择对分子进化的影响。