Department of Ecology, Montana State University, Bozeman, MT, USA.
Heredity (Edinb). 2011 Apr;106(4):625-32. doi: 10.1038/hdy.2010.95. Epub 2010 Aug 4.
One of the primary goals of population genetics is to succinctly describe genetic relationships among populations, and the computer program STRUCTURE is one of the most frequently used tools for doing so. The mathematical model used by STRUCTURE was designed to sort individuals into Hardy-Weinberg populations, but the program is also frequently used to group individuals from a large number of populations into a small number of clusters that are supposed to represent the main genetic divisions within species. In this study, I used computer simulations to examine how well STRUCTURE accomplishes this latter task. Simulations of populations that had a simple hierarchical history of fragmentation showed that when there were relatively long divergence times within evolutionary lineages, the clusters created by STRUCTURE were frequently not consistent with the evolutionary history of the populations. These difficulties can be attributed to forcing STRUCTURE to place individuals into too few clusters. Simulations also showed that the clusters produced by STRUCTURE can be strongly influenced by variation in sample size. In some circumstances, STRUCTURE simply put all of the individuals from the largest sample in the same cluster. A reanalysis of human population structure suggests that the problems I identified with STRUCTURE in simulations may have obscured relationships among human populations-particularly genetic similarity between Europeans and some African populations.
群体遗传学的主要目标之一是简洁地描述群体之间的遗传关系,而 STRUCTURE 程序是最常用来实现这一目标的工具之一。STRUCTURE 使用的数学模型旨在将个体分类为哈迪-温伯格群体,但该程序也常用于将来自大量群体的个体分为少数几个聚类,这些聚类应该代表物种内的主要遗传分支。在这项研究中,我使用计算机模拟来检验 STRUCTURE 在完成后一个任务方面的表现。对具有简单分裂历史的群体的模拟表明,当进化谱系内的分歧时间相对较长时,STRUCTURE 创建的聚类通常与群体的进化历史不一致。这些困难可以归因于迫使 STRUCTURE 将个体分配到太少的聚类中。模拟还表明,STRUCTURE 产生的聚类会受到样本量变化的强烈影响。在某些情况下,STRUCTURE 只是将最大样本中的所有个体归入同一个聚类中。对人类群体结构的重新分析表明,我在模拟中发现的 STRUCTURE 存在的问题可能掩盖了人类群体之间的关系——特别是欧洲人和一些非洲人群体之间的遗传相似性。