Department of Genetics, Stanford University, Mail Stop 5120, Stanford, California 94305, USA.
Genome Biol. 2010 Jan 28;11(1):R10. doi: 10.1186/gb-2010-11-1-r10.
The transmission pattern of the human X chromosome reduces its population size relative to the autosomes, subjects it to disproportionate influence by female demography, and leaves X-linked mutations exposed to selection in males. As a result, the analysis of X-linked genomic variation can provide insights into the influence of demography and selection on the human genome. Here we characterize the genomic variation represented by 16,297 X-linked SNPs genotyped in the CEPH human genome diversity project samples.
We found that X chromosomes tend to be more differentiated between human populations than autosomes, with several notable exceptions. Comparisons between genetically distant populations also showed an excess of X-linked SNPs with large allele frequency differences. Combining information about these SNPs with results from tests designed to detect selective sweeps, we identified two regions that were clear outliers from the rest of the X chromosome for haplotype structure and allele frequency distribution. We were also able to more precisely define the geographical extent of some previously described X-linked selective sweeps.
The relationship between male and female demographic histories is likely to be complex as evidence supporting different conclusions can be found in the same dataset. Although demography may have contributed to the excess of SNPs with large allele frequency differences observed on the X chromosome, we believe that selection is at least partially responsible. Finally, our results reveal the geographical complexities of selective sweeps on the X chromosome and argue for the use of diverse populations in studies of selection.
人类 X 染色体的传递模式使其相对于常染色体而言种群规模较小,受到女性人口统计学的不成比例影响,并使 X 连锁突变在男性中暴露于选择之下。因此,对 X 连锁基因组变异的分析可以深入了解人口统计学和选择对人类基因组的影响。在这里,我们描述了在 CEPH 人类基因组多样性计划样本中对 16,297 个 X 连锁 SNP 进行基因分型所代表的基因组变异。
我们发现,与常染色体相比,X 染色体在人类群体之间的分化程度更高,但也有几个明显的例外。遗传距离较远的人群之间的比较也显示出 X 连锁 SNP 具有较大等位基因频率差异的过多。将这些 SNP 的信息与旨在检测选择清扫的测试结果结合起来,我们确定了两个区域,它们在单倍型结构和等位基因频率分布方面明显偏离了 X 染色体的其余部分。我们还能够更精确地定义一些先前描述的 X 连锁选择清扫的地理范围。
男性和女性人口统计学历史之间的关系可能很复杂,因为同一数据集中可以找到支持不同结论的证据。尽管人口统计学可能导致 X 染色体上观察到的具有较大等位基因频率差异的 SNP 过多,但我们认为选择至少部分负责。最后,我们的结果揭示了 X 染色体上选择清扫的地理复杂性,并主张在选择研究中使用多样化的人群。