• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因关联荟萃分析容易受到研究间隐秘相关性造成的混杂影响。

Genetic association meta-analysis is susceptible to confounding by between-study cryptic relatedness.

作者信息

Tu Tiffany, Ochoa Alejandro

机构信息

Program of Computational Biology and Bioinformatics, Duke University, Durham, NC.

Department of Biostatistics and Bioinformatics, Duke University, Durham, NC.

出版信息

bioRxiv. 2025 May 12:2025.05.10.653279. doi: 10.1101/2025.05.10.653279.

DOI:10.1101/2025.05.10.653279
PMID:40463146
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12132175/
Abstract

Meta-analysis of Genome-Wide Association Studies (GWAS) has important advantages, but it assumes that studies are independent, which does not hold when there is relatedness between studies. As a motivating example, recent work suggested applying sex-stratified meta-analysis to correct for participation bias, without considering that men and women from the same population will be highly related. Our theory demonstrates how cryptic relatedness results in correlated test statistics between studies, inflating meta-analysis. We characterize the effects of different between-study relatedness scenarios, particularly population structure and recent family relatedness, on meta-analysis type I error control and power. We simulated data with (1) no family relatedness between subpopulations, (2) family relatedness within subpopulations, (3) family relatedness across subpopulations, and (4) single population with family relatedness. We run joint and meta-analyses on simulations using both binary and quantitative traits. In scenarios with family relatedness, sex-stratified meta-analysis exhibits severe inflation and lower AUC compared to joint and subpopulation meta-analyses. Remarkably, genomic control succeeds in correcting inflation in these cases, but does not alter calibrated power. Analysis of real datasets confirms severe inflation for sex-stratified meta-analysis in family studies, but a negligible effect for population studies with up to 10,000 individuals. Our theoretical framework demonstrates that the inflation factor increases as the sample size increases. We recommend against meta-analyzing studies that share the same populations, which increases the risk of inflation due to cryptic relatedness between studies.

摘要

全基因组关联研究(GWAS)的荟萃分析具有重要优势,但它假定各研究是独立的,而当研究之间存在相关性时这一假定并不成立。作为一个具有启发性的例子,近期的研究表明应用按性别分层的荟萃分析来校正参与偏倚,但未考虑来自同一人群的男性和女性会高度相关。我们的理论证明了隐性相关性如何导致研究之间的检验统计量相关,从而使荟萃分析结果膨胀。我们描述了不同的研究间相关性情形,特别是群体结构和近期家族相关性,对荟萃分析I型错误控制和效能的影响。我们模拟了以下几种数据情况:(1)亚群体之间无家族相关性;(2)亚群体内部存在家族相关性;(3)亚群体之间存在家族相关性;(4)单个群体存在家族相关性。我们对使用二元性状和定量性状的模拟数据进行了联合分析和荟萃分析。在存在家族相关性的情形下,与联合分析和亚群体荟萃分析相比,按性别分层的荟萃分析表现出严重的结果膨胀和更低的曲线下面积(AUC)。值得注意的是,基因组控制在这些情况下成功校正了结果膨胀,但未改变校准后的效能。对真实数据集的分析证实,在家族研究中按性别分层的荟萃分析存在严重的结果膨胀,但对于个体数量多达10,000的群体研究,其影响可忽略不计。我们的理论框架表明,膨胀因子会随着样本量的增加而增大。我们建议不要对来自同一人群的研究进行荟萃分析,因为这会增加由于研究之间的隐性相关性而导致结果膨胀的风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/3cb0b08f3196/nihpp-2025.05.10.653279v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/96df150a883e/nihpp-2025.05.10.653279v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/16fa0f82f8f2/nihpp-2025.05.10.653279v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/26fc68ed6b72/nihpp-2025.05.10.653279v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/a3bb0fc646d2/nihpp-2025.05.10.653279v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/a785c565b5a1/nihpp-2025.05.10.653279v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/0c3d2bcedb2d/nihpp-2025.05.10.653279v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/3cb0b08f3196/nihpp-2025.05.10.653279v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/96df150a883e/nihpp-2025.05.10.653279v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/16fa0f82f8f2/nihpp-2025.05.10.653279v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/26fc68ed6b72/nihpp-2025.05.10.653279v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/a3bb0fc646d2/nihpp-2025.05.10.653279v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/a785c565b5a1/nihpp-2025.05.10.653279v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/0c3d2bcedb2d/nihpp-2025.05.10.653279v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/459b/12132175/3cb0b08f3196/nihpp-2025.05.10.653279v1-f0007.jpg

相似文献

1
Genetic association meta-analysis is susceptible to confounding by between-study cryptic relatedness.基因关联荟萃分析容易受到研究间隐秘相关性造成的混杂影响。
bioRxiv. 2025 May 12:2025.05.10.653279. doi: 10.1101/2025.05.10.653279.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
4
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
5
Sex as a prognostic factor for mortality in adults with acute symptomatic pulmonary embolism.性别作为急性症状性肺栓塞成年患者死亡率的一个预后因素。
Cochrane Database Syst Rev. 2025 Mar 20;3(3):CD013835. doi: 10.1002/14651858.CD013835.pub2.
6
Electronic cigarettes for smoking cessation.电子烟戒烟。
Cochrane Database Syst Rev. 2024 Jan 8;1(1):CD010216. doi: 10.1002/14651858.CD010216.pub8.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
8
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
9
Pharmacological treatments in panic disorder in adults: a network meta-analysis.成人惊恐障碍的药物治疗:网络荟萃分析。
Cochrane Database Syst Rev. 2023 Nov 28;11(11):CD012729. doi: 10.1002/14651858.CD012729.pub3.
10
Electronic cigarettes for smoking cessation.用于戒烟的电子烟。
Cochrane Database Syst Rev. 2025 Jan 29;1(1):CD010216. doi: 10.1002/14651858.CD010216.pub9.

本文引用的文献

1
Evaluating multi-ancestry genome-wide association methods: Statistical power, population structure, and practical implications.评估多祖先全基因组关联方法:统计功效、群体结构及实际意义。
Am J Hum Genet. 2025 Aug 28. doi: 10.1016/j.ajhg.2025.08.006.
2
Secure and federated genome-wide association studies for biobank-scale datasets.针对生物样本库规模数据集的安全且联合的全基因组关联研究。
Nat Genet. 2025 Apr;57(4):809-814. doi: 10.1038/s41588-025-02109-1. Epub 2025 Feb 24.
3
Multi-ancestry genome-wide association analyses: a comparison of meta- and mega-analyses in the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study.
多血统全基因组关联分析:高血糖与不良妊娠结局(HAPO)研究中荟萃分析与大型分析的比较
BMC Genomics. 2025 Jan 23;26(1):65. doi: 10.1186/s12864-025-11229-1.
4
Secure discovery of genetic relatives across large-scale and distributed genomic data sets.在大规模和分布式基因组数据集上安全发现遗传亲属。
Genome Res. 2024 Oct 11;34(9):1312-1323. doi: 10.1101/gr.279057.124.
5
Efficacy of federated learning on genomic data: a study on the UK Biobank and the 1000 Genomes Project.联邦学习在基因组数据上的功效:对英国生物银行和千人基因组计划的一项研究。
Front Big Data. 2024 Feb 29;7:1266031. doi: 10.3389/fdata.2024.1266031. eCollection 2024.
6
Searching across-cohort relatives in 54,092 GWAS samples via encrypted genotype regression.通过加密基因型回归,在 54092 个 GWAS 样本中搜索跨队列亲属。
PLoS Genet. 2024 Jan 11;20(1):e1011037. doi: 10.1371/journal.pgen.1011037. eCollection 2024 Jan.
7
Limitations of principal components in quantitative genetic association models for human studies.主成分在人类研究定量遗传关联模型中的局限性。
Elife. 2023 May 4;12:e79238. doi: 10.7554/eLife.79238.
8
Privacy-aware estimation of relatedness in admixed populations.混合人群中具有隐私意识的亲缘关系估计。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac473.
9
A theory-based practical solution to correct for sex-differential participation bias.一种基于理论的实用解决方案,可纠正性别差异参与偏差。
Genome Biol. 2022 Jun 27;23(1):138. doi: 10.1186/s13059-022-02703-0.
10
sPLINK: a hybrid federated tool as a robust alternative to meta-analysis in genome-wide association studies.sPLINK:一种混合联邦工具,是全基因组关联研究中替代荟萃分析的强大选择。
Genome Biol. 2022 Jan 24;23(1):32. doi: 10.1186/s13059-021-02562-1.