• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 GWAS 汇总统计数据的遗传关联研究中的群体分层控制。

Control for population stratification in genetic association studies based on GWAS summary statistics.

机构信息

Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, USA.

出版信息

Genet Epidemiol. 2022 Dec;46(8):604-614. doi: 10.1002/gepi.22493. Epub 2022 Jun 29.

DOI:10.1002/gepi.22493
PMID:35766057
Abstract

Over the past years, genome-wide association studies (GWAS) have generated a wealth of new information. Summary data from many GWAS are now publicly available, promoting the development of many statistical methods for association studies based on GWAS summary statistics, which avoids the increasing challenges associated with individual-level genotype and phenotype data sharing. However, for population-based association studies such as GWAS, it has been long recognized that population stratification can seriously confound association results. For large GWAS, it is very likely that there exist population stratification and cryptic relatedness, which will result in inflated Type I error in association testing. Although many methods have been developed to control for population stratification, only two of these approaches can be used to control population stratification without individual-level data: one is based on genomic control (GC) and the other one is based on linkage disequilibrium score regression (LDSC). However, the performance of these two approaches is currently unknown. In this study, we use extensive simulation studies including populations with subpopulations, spatially structured populations, and populations with cryptic relatedness to compare the performance of these two approaches to control for population stratification using only GWAS summary statistics without individual-level data. Data sets from the genetic analysis workshop 19 and UK Biobank are also used to evaluate these two approaches. We demonstrate that the intercept of LDSC can be used as a more accurate correction factor than GC. The results from this study will provide very useful information for researchers using GWAS summary statistics while trying to control for population stratification.

摘要

在过去的几年中,全基因组关联研究(GWAS)已经产生了大量的新信息。现在,许多 GWAS 的汇总数据都是公开的,这促进了许多基于 GWAS 汇总统计数据的关联研究统计方法的发展,这些方法避免了与个体水平基因型和表型数据共享相关的日益增加的挑战。然而,对于基于人群的关联研究(如 GWAS),人们早就认识到群体分层会严重混淆关联结果。对于大型 GWAS,很可能存在群体分层和隐蔽的亲缘关系,这将导致关联测试中的Ⅰ型错误膨胀。尽管已经开发了许多方法来控制群体分层,但只有两种方法可以在没有个体水平数据的情况下用于控制群体分层:一种方法基于基因组控制(GC),另一种方法基于连锁不平衡得分回归(LDSC)。然而,目前还不知道这两种方法的性能如何。在这项研究中,我们使用了广泛的模拟研究,包括具有亚群的人群、空间结构的人群和具有隐蔽亲缘关系的人群,来比较这两种方法在使用仅基于 GWAS 汇总统计数据而不使用个体水平数据的情况下控制群体分层的性能。遗传分析工作坊 19 和英国生物银行的数据也被用于评估这两种方法。我们证明 LDSC 的截距可以用作比 GC 更准确的校正因子。这项研究的结果将为使用 GWAS 汇总统计数据并试图控制群体分层的研究人员提供非常有用的信息。

相似文献

1
Control for population stratification in genetic association studies based on GWAS summary statistics.基于 GWAS 汇总统计数据的遗传关联研究中的群体分层控制。
Genet Epidemiol. 2022 Dec;46(8):604-614. doi: 10.1002/gepi.22493. Epub 2022 Jun 29.
2
LD Score regression distinguishes confounding from polygenicity in genome-wide association studies.LD评分回归在全基因组关联研究中区分混杂因素与多基因性。
Nat Genet. 2015 Mar;47(3):291-5. doi: 10.1038/ng.3211. Epub 2015 Feb 2.
3
LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis.LD Hub:一个集中式数据库和网络界面,用于执行连锁不平衡(LD)评分回归,最大限度地发挥汇总水平全基因组关联研究(GWAS)数据在单核苷酸多态性(SNP)遗传力和遗传相关性分析方面的潜力。
Bioinformatics. 2017 Jan 15;33(2):272-279. doi: 10.1093/bioinformatics/btw613. Epub 2016 Sep 22.
4
PhenoSpD: an integrated toolkit for phenotypic correlation estimation and multiple testing correction using GWAS summary statistics.PhenoSpD:一个整合的工具包,用于使用 GWAS 汇总统计数据进行表型相关性估计和多重检验校正。
Gigascience. 2018 Aug 1;7(8):giy090. doi: 10.1093/gigascience/giy090.
5
Leveraging LD eigenvalue regression to improve the estimation of SNP heritability and confounding inflation.利用 LD 特征值回归提高 SNP 遗传力和混杂膨胀的估计。
Am J Hum Genet. 2022 May 5;109(5):802-811. doi: 10.1016/j.ajhg.2022.03.013. Epub 2022 Apr 13.
6
Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics.利用 GWAS 汇总统计数据估计复杂性状遗传相关性的方法比较。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa442.
7
Genome-wide association study meta-analysis identifies three novel loci for circulating anti-Müllerian hormone levels in women.全基因组关联研究荟萃分析确定了女性循环抗苗勒管激素水平的三个新基因座。
Hum Reprod. 2022 May 3;37(5):1069-1082. doi: 10.1093/humrep/deac028.
8
Gene-based association tests in family samples using GWAS summary statistics.基于基因的家系样本关联分析,使用 GWAS 汇总统计数据。
Genet Epidemiol. 2024 Apr;48(3):103-113. doi: 10.1002/gepi.22548. Epub 2024 Feb 5.
9
PRED-LD: efficient imputation of GWAS summary statistics.PRED-LD:全基因组关联研究汇总统计数据的高效估算
BMC Bioinformatics. 2025 Apr 16;26(1):107. doi: 10.1186/s12859-025-06119-y.
10
Prospects of Fine-Mapping Trait-Associated Genomic Regions by Using Summary Statistics from Genome-wide Association Studies.利用全基因组关联研究的汇总统计信息对性状相关基因组区域进行精细定位的前景
Am J Hum Genet. 2017 Oct 5;101(4):539-551. doi: 10.1016/j.ajhg.2017.08.012. Epub 2017 Sep 21.

引用本文的文献

1
A meta-analysis of genome-wide association studies revealed significant QTL and candidate genes for loin muscle area in three breeding pigs.一项全基因组关联研究的荟萃分析揭示了三个品种种猪背最长肌面积的显著数量性状位点和候选基因。
Sci Rep. 2025 May 28;15(1):18758. doi: 10.1038/s41598-025-00819-4.