Suppr超能文献

病例对照研究中校正群体分层的关联方法比较。

A comparison of association methods correcting for population stratification in case-control studies.

作者信息

Wu Chengqing, DeWan Andrew, Hoh Josephine, Wang Zuoheng

机构信息

Department of Epidemiology and Public Health, Yale University, New Haven, CT 06510, USA.

出版信息

Ann Hum Genet. 2011 May;75(3):418-27. doi: 10.1111/j.1469-1809.2010.00639.x. Epub 2011 Jan 31.

Abstract

Population stratification is an important issue in case-control studies of disease-marker association. Failure to properly account for population structure can lead to spurious association or reduced power. In this article, we compare the performance of six methods correcting for population stratification in case-control association studies. These methods include genomic control (GC), EIGENSTRAT, principal component-based logistic regression (PCA-L), LAPSTRUCT, ROADTRIPS, and EMMAX. We also include the uncorrected Armitage test for comparison. In the simulation studies, we consider a wide range of population structure models for unrelated samples, including admixture. Our simulation results suggest that PCA-L and LAPSTRUCT perform well over all the scenarios studied, whereas GC, ROADTRIPS, and EMMAX fail to correct for population structure at single nucleotide polymorphisms (SNPs) that show strong differentiation across ancestral populations. The Armitage test does not adjust for confounding due to stratification thus has inflated type I error. Among all correction methods, EMMAX has the greatest power, based on the population structure settings considered for samples with unrelated individuals. The three methods, EIGENSTRAT, PCA-L, and LAPSTRUCT, are comparable, and outperform both GC and ROADTRIPS in almost all situations.

摘要

群体分层是疾病标志物关联病例对照研究中的一个重要问题。未能恰当考虑群体结构可能导致虚假关联或检验效能降低。在本文中,我们比较了病例对照关联研究中六种校正群体分层方法的性能。这些方法包括基因组控制(GC)、EIGENSTRAT、基于主成分的逻辑回归(PCA-L)、LAPSTRUCT、ROADTRIPS和EMMAX。我们还纳入了未校正的阿米蒂奇检验以作比较。在模拟研究中,我们考虑了广泛的无关样本群体结构模型,包括混合模型。我们的模拟结果表明,PCA-L和LAPSTRUCT在所有研究场景下表现良好,而GC、ROADTRIPS和EMMAX在跨祖先群体显示出强烈分化的单核苷酸多态性(SNP)处未能校正群体结构。阿米蒂奇检验未对分层导致的混杂因素进行校正,因此I型错误率升高。在所有校正方法中,基于为无关个体样本考虑的群体结构设置,EMMAX检验效能最高。EIGENSTRAT、PCA-L和LAPSTRUCT这三种方法性能相当,并且在几乎所有情况下都优于GC和ROADTRIPS。

相似文献

1
A comparison of association methods correcting for population stratification in case-control studies.
Ann Hum Genet. 2011 May;75(3):418-27. doi: 10.1111/j.1469-1809.2010.00639.x. Epub 2011 Jan 31.
2
ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure.
Am J Hum Genet. 2010 Feb 12;86(2):172-84. doi: 10.1016/j.ajhg.2010.01.001. Epub 2010 Feb 4.
3
Evaluation of population stratification adjustment using genome-wide or exonic variants.
Genet Epidemiol. 2020 Oct;44(7):702-716. doi: 10.1002/gepi.22332. Epub 2020 Jun 30.
4
Accounting for population stratification in DNA methylation studies.
Genet Epidemiol. 2014 Apr;38(3):231-41. doi: 10.1002/gepi.21789. Epub 2014 Jan 29.
5
Comparison of population-based association study methods correcting for population stratification.
PLoS One. 2008;3(10):e3392. doi: 10.1371/journal.pone.0003392. Epub 2008 Oct 14.
6
Fast model-based estimation of ancestry in unrelated individuals.
Genome Res. 2009 Sep;19(9):1655-64. doi: 10.1101/gr.094052.109. Epub 2009 Jul 31.
8
A mixed model reduces spurious genetic associations produced by population stratification in genome-wide association studies.
Genomics. 2015 Apr;105(4):191-6. doi: 10.1016/j.ygeno.2015.01.006. Epub 2015 Jan 30.
10
Clustering by genetic ancestry using genome-wide SNP data.
BMC Genet. 2010 Dec 9;11:108. doi: 10.1186/1471-2156-11-108.

引用本文的文献

1
Establishing Best Practices for Clinical GWAS: Tackling Imputation and Data Quality Challenges.
Int J Mol Sci. 2025 Jul 3;26(13):6397. doi: 10.3390/ijms26136397.
4
Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies.
Mol Breed. 2021 Dec 22;42(1):1. doi: 10.1007/s11032-021-01272-7. eCollection 2022 Jan.
6
Genome-wide association analysis of milk production, somatic cell score, and body conformation traits in Holstein cows.
Front Vet Sci. 2022 Oct 4;9:932034. doi: 10.3389/fvets.2022.932034. eCollection 2022.
9
Serine biosynthesis defect due to haploinsufficiency of PHGDH causes retinal disease.
Nat Metab. 2021 Mar;3(3):366-377. doi: 10.1038/s42255-021-00361-3. Epub 2021 Mar 22.

本文引用的文献

1
The genetical structure of populations.
Ann Eugen. 1951 Mar;15(4):323-54. doi: 10.1111/j.1469-1809.1949.tb02451.x.
2
New approaches to population stratification in genome-wide association studies.
Nat Rev Genet. 2010 Jul;11(7):459-63. doi: 10.1038/nrg2813.
3
Mixed linear model approach adapted for genome-wide association studies.
Nat Genet. 2010 Apr;42(4):355-60. doi: 10.1038/ng.546. Epub 2010 Mar 7.
4
Variance component model to account for sample structure in genome-wide association studies.
Nat Genet. 2010 Apr;42(4):348-54. doi: 10.1038/ng.548. Epub 2010 Mar 7.
5
ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure.
Am J Hum Genet. 2010 Feb 12;86(2):172-84. doi: 10.1016/j.ajhg.2010.01.001. Epub 2010 Feb 4.
6
Laplacian eigenfunctions learn population structure.
PLoS One. 2009 Dec 1;4(12):e7928. doi: 10.1371/journal.pone.0007928.
8
Discovering genetic ancestry using spectral graph theory.
Genet Epidemiol. 2010 Jan;34(1):51-9. doi: 10.1002/gepi.20434.
9
A genome-wide investigation of SNPs and CNVs in schizophrenia.
PLoS Genet. 2009 Feb;5(2):e1000373. doi: 10.1371/journal.pgen.1000373. Epub 2009 Feb 6.
10
Principal component analysis of genetic data.
Nat Genet. 2008 May;40(5):491-2. doi: 10.1038/ng0508-491.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验