Zhang Feng, Wang Yuping, Deng Hong-Wen
Institute of Molecular Genetics, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, China.
PLoS One. 2008;3(10):e3392. doi: 10.1371/journal.pone.0003392. Epub 2008 Oct 14.
Population stratification can cause spurious associations in population-based association studies. Several statistical methods have been proposed to reduce the impact of population stratification on population-based association studies. We simulated a set of stratified populations based on the real haplotype data from the HapMap ENCODE project, and compared the relative power, type I error rates, accuracy and positive prediction value of four prevailing population-based association study methods: traditional case-control tests, structured association (SA), genomic control (GC) and principal components analysis (PCA) under various population stratification levels. Additionally, we evaluated the effects of sample sizes and frequencies of disease susceptible allele on the performance of the four analytical methods in the presence of population stratification. We found that the performance of PCA was very stable under various scenarios. Our comparison results suggest that SA and PCA have comparable performance, if sufficient ancestral informative markers are used in SA analysis. GC appeared to be strongly conservative in significantly stratified populations. It may be better to apply GC in the stratified populations with low stratification level. Our study intends to provide a practical guideline for researchers to select proper study methods and make appropriate inference of the results in population-based association studies.
群体分层会在基于人群的关联研究中导致虚假关联。已经提出了几种统计方法来减少群体分层对基于人群的关联研究的影响。我们基于HapMap ENCODE项目的真实单倍型数据模拟了一组分层群体,并比较了四种主流的基于人群的关联研究方法在不同群体分层水平下的相对功效、I型错误率、准确性和阳性预测值:传统病例对照检验、结构化关联(SA)、基因组控制(GC)和主成分分析(PCA)。此外,我们评估了样本量和疾病易感等位基因频率对存在群体分层时四种分析方法性能的影响。我们发现PCA在各种情况下的性能都非常稳定。我们的比较结果表明,如果在SA分析中使用足够的祖先信息标记,SA和PCA具有可比的性能。在分层显著的群体中,GC似乎非常保守。在分层水平较低的分层群体中应用GC可能更好。我们的研究旨在为研究人员在基于人群的关联研究中选择合适的研究方法并对结果进行适当推断提供实用指南。