Brown Garth R, Gill Geoffrey P, Kuntz Robert J, Langley Charles H, Neale David B
Department of Environmental Horticulture, University of California, Davis, CA 95616, USA.
Proc Natl Acad Sci U S A. 2004 Oct 19;101(42):15255-60. doi: 10.1073/pnas.0404231101. Epub 2004 Oct 11.
Outbreeding species with large, stable population sizes, such as widely distributed conifers, are expected to harbor relatively more DNA sequence polymorphism. Under the neutral theory of molecular evolution, the expected heterozygosity is a function of the product 4N(e)mu, where N(e) is the effective population size and mu is the per-generation mutation rate, and the genomic scale of linkage disequilibrium is determined by 4N(e)r, where r is the per-generation recombination rate between adjacent sites. These parameters were estimated in the long-lived, outcrossing gymnosperm loblolly pine (Pinus taeda L.) from a survey of single nucleotide polymorphisms across approximately 18 kb of DNA distributed among 19 loci from a common set of 32 haploid genomes. Estimates of 4N(e)mu at silent and nonsynonymous sites were 0.00658 and 0.00108, respectively, and both were statistically heterogeneous among loci. By Tajima's D statistic, the site frequency spectrum of no locus was observed to deviate from that predicted by neutral theory. Substantial recombination in the history of the sampled alleles was observed and linkage disequilibrium declined within several kilobases. The composite likelihood estimate of 4N(e)r based on all two-site sample configurations equaled 0.00175. When geological dating, an assumed generation time (25 years), and an estimated divergence from Pinus pinaster Ait. are used, the effective population size of loblolly pine should be 5.6 x 10(5). The emerging narrow range of estimated silent site heterozygosities (relative to the vast range of population sizes) for humans, Drosophila, maize, and pine parallels the paradox described earlier for allozyme polymorphism and challenges simple equilibrium models of molecular evolution.
远交物种具有大而稳定的种群规模,比如广泛分布的针叶树,预计会含有相对更多的DNA序列多态性。根据分子进化的中性理论,预期杂合度是乘积4N(e)μ的函数,其中N(e)是有效种群规模,μ是每代的突变率,而连锁不平衡的基因组规模由4N(e)r决定,其中r是相邻位点之间的每代重组率。这些参数是在长寿的异交裸子植物火炬松(Pinus taeda L.)中估计得到的,该估计来自对跨越约18 kb DNA的单核苷酸多态性的调查,这些DNA分布在来自32个单倍体基因组的一组共19个基因座中。沉默位点和非同义位点的4N(e)μ估计值分别为0.00658和0.00108,并且两者在基因座之间在统计学上是异质的。通过Tajima's D统计量,未观察到任何基因座的位点频率谱偏离中性理论预测的谱。在抽样等位基因的历史中观察到了大量重组,并且连锁不平衡在几千碱基内下降。基于所有两位点样本配置的4N(e)r的复合似然估计值等于0.00175。当使用地质年代测定、假定的世代时间(25年)以及与海岸松(Pinus pinaster Ait.)的估计分歧时,火炬松的有效种群规模应为5.6×10⁵。人类、果蝇、玉米和松树估计的沉默位点杂合度出现的狭窄范围(相对于种群规模的巨大范围)与之前描述的关于等位酶多态性的悖论相似,并对简单的分子进化平衡模型提出了挑战。