Li W H, Sadler L A
Center for Demographic and Population Genetics, University of Texas Health Science Center, Houston 77225.
Genetics. 1991 Oct;129(2):513-23. doi: 10.1093/genetics/129.2.513.
The nucleotide diversity (pi) in humans is studied by using published cDNA and genomic sequences that have been carefully checked for sequencing accuracy. This measure of genetic variability is defined as the number of nucleotide differences per site between two randomly chosen sequences from a population. A total of more than 75,000 base pairs from 49 loci are compared. The DNA regions studied are the 5' and 3' untranslated regions and the amino acid coding regions. The coding regions are divided into nondegenerate sites (i.e., sites at which all possible changes are nonsynonymous), twofold degenerate sites (i.e., sites at each of which one of the three possible changes is synonymous) and fourfold degenerate sites (i.e., sites at which all three possible changes are synonymous). The pi values estimated are, respectively, 0.03 and 0.04% for the 5' and 3' UT regions, and 0.03, 0.06 and 0.11% for nondegenerate, twofold degenerate and fourfold degenerate sites. Since the highest pi value is only 0.11%, which is about one order of magnitude lower than those in Drosophila populations, the nucleotide diversity in humans is very low. The low diversity is probably due to a relatively small long-term effective population size rather than any severe bottleneck during human evolution.
通过使用已仔细检查测序准确性的已发表的cDNA和基因组序列来研究人类的核苷酸多样性(π)。这种遗传变异性的度量被定义为从一个群体中随机选择的两个序列之间每个位点的核苷酸差异数。总共比较了来自49个基因座的超过75,000个碱基对。所研究的DNA区域是5'和3'非翻译区以及氨基酸编码区。编码区分为非简并位点(即所有可能变化都是非同义的位点)、双重简并位点(即三个可能变化中的一个是同义的位点)和四重简并位点(即所有三个可能变化都是同义的位点)。对于5'和3'非翻译区,估计的π值分别为0.03%和0.04%,对于非简并、双重简并和四重简并位点,分别为0.03%、0.06%和0.11%。由于最高的π值仅为0.11%,比果蝇群体中的值低约一个数量级,因此人类的核苷酸多样性非常低。这种低多样性可能是由于长期有效种群规模相对较小,而不是人类进化过程中的任何严重瓶颈。