• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

病例对照研究中校正群体分层的关联方法比较。

A comparison of association methods correcting for population stratification in case-control studies.

作者信息

Wu Chengqing, DeWan Andrew, Hoh Josephine, Wang Zuoheng

机构信息

Department of Epidemiology and Public Health, Yale University, New Haven, CT 06510, USA.

出版信息

Ann Hum Genet. 2011 May;75(3):418-27. doi: 10.1111/j.1469-1809.2010.00639.x. Epub 2011 Jan 31.

DOI:10.1111/j.1469-1809.2010.00639.x
PMID:21281271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3215268/
Abstract

Population stratification is an important issue in case-control studies of disease-marker association. Failure to properly account for population structure can lead to spurious association or reduced power. In this article, we compare the performance of six methods correcting for population stratification in case-control association studies. These methods include genomic control (GC), EIGENSTRAT, principal component-based logistic regression (PCA-L), LAPSTRUCT, ROADTRIPS, and EMMAX. We also include the uncorrected Armitage test for comparison. In the simulation studies, we consider a wide range of population structure models for unrelated samples, including admixture. Our simulation results suggest that PCA-L and LAPSTRUCT perform well over all the scenarios studied, whereas GC, ROADTRIPS, and EMMAX fail to correct for population structure at single nucleotide polymorphisms (SNPs) that show strong differentiation across ancestral populations. The Armitage test does not adjust for confounding due to stratification thus has inflated type I error. Among all correction methods, EMMAX has the greatest power, based on the population structure settings considered for samples with unrelated individuals. The three methods, EIGENSTRAT, PCA-L, and LAPSTRUCT, are comparable, and outperform both GC and ROADTRIPS in almost all situations.

摘要

群体分层是疾病标志物关联病例对照研究中的一个重要问题。未能恰当考虑群体结构可能导致虚假关联或检验效能降低。在本文中,我们比较了病例对照关联研究中六种校正群体分层方法的性能。这些方法包括基因组控制(GC)、EIGENSTRAT、基于主成分的逻辑回归(PCA-L)、LAPSTRUCT、ROADTRIPS和EMMAX。我们还纳入了未校正的阿米蒂奇检验以作比较。在模拟研究中,我们考虑了广泛的无关样本群体结构模型,包括混合模型。我们的模拟结果表明,PCA-L和LAPSTRUCT在所有研究场景下表现良好,而GC、ROADTRIPS和EMMAX在跨祖先群体显示出强烈分化的单核苷酸多态性(SNP)处未能校正群体结构。阿米蒂奇检验未对分层导致的混杂因素进行校正,因此I型错误率升高。在所有校正方法中,基于为无关个体样本考虑的群体结构设置,EMMAX检验效能最高。EIGENSTRAT、PCA-L和LAPSTRUCT这三种方法性能相当,并且在几乎所有情况下都优于GC和ROADTRIPS。

相似文献

1
A comparison of association methods correcting for population stratification in case-control studies.病例对照研究中校正群体分层的关联方法比较。
Ann Hum Genet. 2011 May;75(3):418-27. doi: 10.1111/j.1469-1809.2010.00639.x. Epub 2011 Jan 31.
2
ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure.路途中的病例对照关联测试:具有部分或完全未知的群体和家系结构。
Am J Hum Genet. 2010 Feb 12;86(2):172-84. doi: 10.1016/j.ajhg.2010.01.001. Epub 2010 Feb 4.
3
Evaluation of population stratification adjustment using genome-wide or exonic variants.基于全基因组或外显子变异进行群体分层调整的评估。
Genet Epidemiol. 2020 Oct;44(7):702-716. doi: 10.1002/gepi.22332. Epub 2020 Jun 30.
4
Accounting for population stratification in DNA methylation studies.考虑 DNA 甲基化研究中的群体分层。
Genet Epidemiol. 2014 Apr;38(3):231-41. doi: 10.1002/gepi.21789. Epub 2014 Jan 29.
5
Comparison of population-based association study methods correcting for population stratification.针对群体分层进行校正的基于人群的关联研究方法比较。
PLoS One. 2008;3(10):e3392. doi: 10.1371/journal.pone.0003392. Epub 2008 Oct 14.
6
Fast model-based estimation of ancestry in unrelated individuals.基于模型的无关个体祖先快速估计
Genome Res. 2009 Sep;19(9):1655-64. doi: 10.1101/gr.094052.109. Epub 2009 Jul 31.
7
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。
Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.
8
A mixed model reduces spurious genetic associations produced by population stratification in genome-wide association studies.混合模型可减少全基因组关联研究中群体分层产生的虚假遗传关联。
Genomics. 2015 Apr;105(4):191-6. doi: 10.1016/j.ygeno.2015.01.006. Epub 2015 Jan 30.
9
Testing for genetic association in the presence of population stratification in genome-wide association studies.在全基因组关联研究中存在群体分层时的遗传关联测试。
Genet Epidemiol. 2009 Nov;33(7):637-45. doi: 10.1002/gepi.20415.
10
Clustering by genetic ancestry using genome-wide SNP data.基于全基因组 SNP 数据的遗传谱系聚类分析。
BMC Genet. 2010 Dec 9;11:108. doi: 10.1186/1471-2156-11-108.

引用本文的文献

1
Establishing Best Practices for Clinical GWAS: Tackling Imputation and Data Quality Challenges.建立临床全基因组关联研究的最佳实践:应对基因填充和数据质量挑战。
Int J Mol Sci. 2025 Jul 3;26(13):6397. doi: 10.3390/ijms26136397.
2
Domestication effects on crowing in chickens: variation between wild and captive red junglefowl and domestic white Leghorn and the genetic architecture of crowing vocalizations.驯化对鸡打鸣的影响:野生和圈养原鸡以及家养白来航鸡之间的差异及打鸣发声的遗传结构
Philos Trans R Soc Lond B Biol Sci. 2025 May;380(1926):20240199. doi: 10.1098/rstb.2024.0199. Epub 2025 May 15.
3
Comparing performances of different statistical models and multiple threshold methods in a nested association mapping population of wheat.比较不同统计模型和多种阈值方法在小麦嵌套关联作图群体中的表现。
Front Plant Sci. 2024 Oct 1;15:1460353. doi: 10.3389/fpls.2024.1460353. eCollection 2024.
4
Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies.利用全基因组关联研究对小麦复杂性状作图进行综合评价。
Mol Breed. 2021 Dec 22;42(1):1. doi: 10.1007/s11032-021-01272-7. eCollection 2022 Jan.
5
Limitations of principal components in quantitative genetic association models for human studies.主成分在人类研究定量遗传关联模型中的局限性。
Elife. 2023 May 4;12:e79238. doi: 10.7554/eLife.79238.
6
Genome-wide association analysis of milk production, somatic cell score, and body conformation traits in Holstein cows.荷斯坦奶牛产奶量、体细胞评分和体型性状的全基因组关联分析
Front Vet Sci. 2022 Oct 4;9:932034. doi: 10.3389/fvets.2022.932034. eCollection 2022.
7
Recommendations on the use and reporting of race, ethnicity, and ancestry in genetic research: Experiences from the NHLBI TOPMed program.关于种族、族裔和血统在基因研究中的使用及报告的建议:美国国立心、肺、血液研究所(NHLBI)精准医学跨组学研究项目(TOPMed)的经验
Cell Genom. 2022 Aug 10;2(8). doi: 10.1016/j.xgen.2022.100155. Epub 2022 Jul 26.
8
Whole-genome re-sequencing association study on yearling wool traits in Chinese fine-wool sheep.中国细毛羊周岁羊毛性状的全基因组重测序关联研究。
J Anim Sci. 2021 Sep 1;99(9). doi: 10.1093/jas/skab210.
9
Serine biosynthesis defect due to haploinsufficiency of PHGDH causes retinal disease.由于 PHGDH 杂合子功能不全导致丝氨酸生物合成缺陷引起视网膜疾病。
Nat Metab. 2021 Mar;3(3):366-377. doi: 10.1038/s42255-021-00361-3. Epub 2021 Mar 22.
10
Schizophrenia Polygenic Risk and Brain Structural Changes in Methamphetamine-Associated Psychosis in a South African Population.南非人群中精神分裂症多基因风险与甲基苯丙胺所致精神病的脑结构变化
Front Genet. 2020 Oct 2;11:1018. doi: 10.3389/fgene.2020.01018. eCollection 2020.

本文引用的文献

1
The genetical structure of populations.种群的遗传结构。
Ann Eugen. 1951 Mar;15(4):323-54. doi: 10.1111/j.1469-1809.1949.tb02451.x.
2
New approaches to population stratification in genome-wide association studies.全基因组关联研究中群体分层的新方法。
Nat Rev Genet. 2010 Jul;11(7):459-63. doi: 10.1038/nrg2813.
3
Mixed linear model approach adapted for genome-wide association studies.混合线性模型方法适用于全基因组关联研究。
Nat Genet. 2010 Apr;42(4):355-60. doi: 10.1038/ng.546. Epub 2010 Mar 7.
4
Variance component model to account for sample structure in genome-wide association studies.用于全基因组关联研究中样本结构的方差成分模型。
Nat Genet. 2010 Apr;42(4):348-54. doi: 10.1038/ng.548. Epub 2010 Mar 7.
5
ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure.路途中的病例对照关联测试:具有部分或完全未知的群体和家系结构。
Am J Hum Genet. 2010 Feb 12;86(2):172-84. doi: 10.1016/j.ajhg.2010.01.001. Epub 2010 Feb 4.
6
Laplacian eigenfunctions learn population structure.拉普拉斯特征函数可学习群体结构。
PLoS One. 2009 Dec 1;4(12):e7928. doi: 10.1371/journal.pone.0007928.
7
A kinship-based modification of the armitage trend test to address hidden population structure and small differential genotyping errors.一种基于亲属关系对阿米蒂奇趋势检验进行的修正,以解决隐藏的群体结构和微小的差异基因分型错误。
PLoS One. 2009 Jun 8;4(6):e5825. doi: 10.1371/journal.pone.0005825.
8
Discovering genetic ancestry using spectral graph theory.利用谱图理论探寻遗传渊源。
Genet Epidemiol. 2010 Jan;34(1):51-9. doi: 10.1002/gepi.20434.
9
A genome-wide investigation of SNPs and CNVs in schizophrenia.精神分裂症中常见单核苷酸多态性和拷贝数变异的全基因组研究。
PLoS Genet. 2009 Feb;5(2):e1000373. doi: 10.1371/journal.pgen.1000373. Epub 2009 Feb 6.
10
Principal component analysis of genetic data.遗传数据的主成分分析
Nat Genet. 2008 May;40(5):491-2. doi: 10.1038/ng0508-491.