Department of Biology, Stanford University, Stanford, California 94305, USA.
G3 (Bethesda). 2013 May 20;3(5):891-907. doi: 10.1534/g3.113.005728.
Over the past two decades, microsatellite genotypes have provided the data for landmark studies of human population-genetic variation. However, the various microsatellite data sets have been prepared with different procedures and sets of markers, so that it has been difficult to synthesize available data for a comprehensive analysis. Here, we combine eight human population-genetic data sets at the 645 microsatellite loci they share in common, accounting for procedural differences in the production of the different data sets, to assemble a single data set containing 5795 individuals from 267 worldwide populations. We perform a systematic analysis of genetic relatedness, detecting 240 intra-population and 92 inter-population pairs of previously unidentified close relatives and proposing standardized subsets of unrelated individuals for use in future studies. We then augment the human data with a data set of 84 chimpanzees at the 246 loci they share in common with the human samples. Multidimensional scaling and neighbor-joining analyses of these data sets offer new insights into the structure of human populations and enable a comparison of genetic variation patterns in chimpanzees with those in humans. Our combined data sets are the largest of their kind reported to date and provide a resource for use in human population-genetic studies.
在过去的二十年中,微卫星基因型为人类群体遗传变异的标志性研究提供了数据。然而,各种微卫星数据集是用不同的程序和标记集准备的,因此很难综合现有数据进行全面分析。在这里,我们结合了八个人类群体遗传数据集,这些数据集在 645 个共同的微卫星基因座上共享,考虑到不同数据集产生过程中的差异,组装了一个包含来自全球 267 个群体的 5795 个人的单一数据集。我们对遗传相关性进行了系统分析,检测到 240 对群体内和 92 对群体间以前未识别的近亲,并提出了标准化的无关个体子集,用于未来的研究。然后,我们用一个与人类样本共享 246 个基因座的 84 只黑猩猩数据集来扩充人类数据。对这些数据集的多维尺度分析和邻接聚类分析提供了人类群体结构的新见解,并使我们能够比较黑猩猩和人类的遗传变异模式。我们的组合数据集是迄今为止报道的最大数据集之一,为人类群体遗传研究提供了资源。