• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估多血统全基因组关联方法:统计功效、群体结构及实际意义

Evaluating Multi-Ancestry Genome-Wide Association Methods: Statistical Power, Population Structure, and Practical Implications.

作者信息

Dias Julie-Alexia, Chen Tony, Xing Hua, Wang Xiaoyu, Rodriguez Alex A, Madduri Ravi K, Kraft Peter, Zhang Haoyu

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Division of Cancer Epidemiology & Genetics, National Cancer Institute, National Instituters of Health, Bethesda, MD, USA.

出版信息

medRxiv. 2025 Mar 12:2025.03.11.25323772. doi: 10.1101/2025.03.11.25323772.

DOI:10.1101/2025.03.11.25323772
PMID:40162269
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11952640/
Abstract

The increasing availability of diverse biobanks has enabled multi-ancestry genome-wide association studies (GWAS), enhancing the discovery of genetic variants across traits and diseases. However, the choice of an optimal method remains debated due to challenges in statistical power differences across ancestral groups and approaches to account for population structure. Two primary strategies exist: (1) Pooled analysis, which combines individuals from all genetic backgrounds into a single dataset while adjusting for population stratification using principal components, increasing the sample size and statistical power but requiring careful control of population stratification. (2) Meta-analysis, which performs ancestry-group-specific GWAS and subsequently combines summary statistics, potentially capturing fine-scale population structure, but facing limitations in handling admixed individuals. Using large-scale simulations with varying sample sizes and ancestry compositions, we compare these methods alongside real data analyses of eight continuous and five binary traits from the UK Biobank ( ) and All of Us Research Program ( ). Our results demonstrate that pooled analysis generally exhibits better statistical power while effectively adjusting for population stratification. We further present a theoretical framework linking power differences to allele frequency variations across populations. These findings, validated across both biobanks, highlight pooled analysis as a robust and scalable strategy for multi-ancestry GWAS, improving genetic discovery while maintaining rigorous population structure control.

摘要

越来越多不同的生物样本库使得多祖先全基因组关联研究(GWAS)成为可能,增强了对跨性状和疾病的遗传变异的发现。然而,由于不同祖先群体在统计效力差异以及考虑群体结构的方法方面存在挑战,最优方法的选择仍存在争议。存在两种主要策略:(1)合并分析,即将来自所有遗传背景的个体合并到一个数据集中,同时使用主成分调整群体分层,增加样本量和统计效力,但需要仔细控制群体分层。(2)荟萃分析,即进行特定祖先群体的GWAS,随后合并汇总统计数据,可能捕捉到精细的群体结构,但在处理混合个体方面存在局限性。通过使用具有不同样本量和祖先组成的大规模模拟,我们将这些方法与英国生物样本库( )和“我们所有人”研究计划( )中八个连续性状和五个二元性状的实际数据分析进行了比较。我们的结果表明,合并分析通常具有更好的统计效力,同时能有效调整群体分层。我们进一步提出了一个理论框架,将效力差异与不同群体间的等位基因频率变化联系起来。这些在两个生物样本库中均得到验证的发现,突出了合并分析作为多祖先GWAS的一种稳健且可扩展的策略,在保持严格的群体结构控制的同时改善了遗传发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/15a0d44ec0a9/nihpp-2025.03.11.25323772v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/66ae2eb69904/nihpp-2025.03.11.25323772v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/361b04b8a2e0/nihpp-2025.03.11.25323772v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/536b8a83c145/nihpp-2025.03.11.25323772v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/15a0d44ec0a9/nihpp-2025.03.11.25323772v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/66ae2eb69904/nihpp-2025.03.11.25323772v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/361b04b8a2e0/nihpp-2025.03.11.25323772v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/536b8a83c145/nihpp-2025.03.11.25323772v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c26/11952640/15a0d44ec0a9/nihpp-2025.03.11.25323772v1-f0004.jpg

相似文献

1
Evaluating Multi-Ancestry Genome-Wide Association Methods: Statistical Power, Population Structure, and Practical Implications.评估多血统全基因组关联方法:统计功效、群体结构及实际意义
medRxiv. 2025 Mar 12:2025.03.11.25323772. doi: 10.1101/2025.03.11.25323772.
2
Evaluating multi-ancestry genome-wide association methods: Statistical power, population structure, and practical implications.评估多祖先全基因组关联方法:统计功效、群体结构及实际意义。
Am J Hum Genet. 2025 Aug 28. doi: 10.1016/j.ajhg.2025.08.006.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Extending Genome-Wide Association Studies to admixed cohorts with high degrees of relatedness.将全基因组关联研究扩展至具有高度亲缘关系的混合队列。
medRxiv. 2025 Jun 9:2025.05.27.25328444. doi: 10.1101/2025.05.27.25328444.
5
All of Us diversity and scale improve polygenic prediction contextually with greatest improvements for under-represented populations.“我们所有人”项目的多样性和规模在情境中改善了多基因预测,对代表性不足的人群改善最大。
bioRxiv. 2024 Aug 6:2024.08.06.606846. doi: 10.1101/2024.08.06.606846.
6
Genetic variants influencing liver fat in normal-weight individuals of European ancestry.影响欧洲血统正常体重个体肝脏脂肪的基因变异。
JHEP Rep. 2025 May 14;7(8):101453. doi: 10.1016/j.jhepr.2025.101453. eCollection 2025 Aug.
7
Fine-mapping in admixed populations using CARMA-X, with applications to Latin American studies.使用CARMA-X在混合人群中进行精细定位及其在拉丁美洲研究中的应用。
Am J Hum Genet. 2025 May 1;112(5):1215-1232. doi: 10.1016/j.ajhg.2025.02.020. Epub 2025 Mar 26.
8
Leveraging local ancestry and cross-ancestry genetic architecture to improve genetic prediction of complex traits in admixed populations.利用本地祖先和跨祖先遗传结构改善混合人群复杂性状的遗传预测。
Am J Hum Genet. 2025 Jul 3. doi: 10.1016/j.ajhg.2025.06.010.
9
Identification and Validation of Novel Combinatorial Genetic Risk Factors for Endometriosis across Multiple UK and US Patient Cohorts.英国和美国多个患者队列中子宫内膜异位症新型组合遗传风险因素的识别与验证
medRxiv. 2025 Aug 15:2025.08.13.25333595. doi: 10.1101/2025.08.13.25333595.
10
A genome-wide association study of anti-Müllerian hormone (AMH) levels in Samoan women.一项关于萨摩亚女性抗苗勒管激素(AMH)水平的全基因组关联研究。
medRxiv. 2024 Dec 8:2024.12.05.24318457. doi: 10.1101/2024.12.05.24318457.

本文引用的文献

1
Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.多样性与规模:退伍军人事务部百万退伍军人计划中2068个性状的遗传结构
Science. 2024 Jul 19;385(6706):eadj1182. doi: 10.1126/science.adj1182.
2
Sources of gene expression variation in a globally diverse human cohort.全球多样化人类群体中基因表达变异的来源。
Nature. 2024 Aug;632(8023):122-130. doi: 10.1038/s41586-024-07708-2. Epub 2024 Jul 17.
3
Multi-ancestry genome-wide association study of kidney cancer identifies 63 susceptibility regions.
多民族全基因组关联研究鉴定出 63 个肾癌易感性区域。
Nat Genet. 2024 May;56(5):809-818. doi: 10.1038/s41588-024-01725-7. Epub 2024 Apr 26.
4
An ensemble penalized regression method for multi-ancestry polygenic risk prediction.一种用于多祖裔多基因风险预测的集成惩罚回归方法。
Nat Commun. 2024 Apr 15;15(1):3238. doi: 10.1038/s41467-024-47357-7.
5
MUSSEL: Enhanced Bayesian polygenic risk prediction leveraging information across multiple ancestry groups.基于多祖先群体信息的贝类增强贝叶斯多基因风险预测
Cell Genom. 2024 Apr 10;4(4):100539. doi: 10.1016/j.xgen.2024.100539.
6
Admix-kit: an integrated toolkit and pipeline for genetic analyses of admixed populations.Admix-kit:用于混合人群遗传分析的集成工具包和管道。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae148.
7
Genomic data in the All of Us Research Program.全美国研究计划中的基因组数据。
Nature. 2024 Mar;627(8003):340-346. doi: 10.1038/s41586-023-06957-x. Epub 2024 Feb 19.
8
MESuSiE enables scalable and powerful multi-ancestry fine-mapping of causal variants in genome-wide association studies.MESuSiE 可实现全基因组关联研究中因果变异的可扩展和强大的多祖系精细映射。
Nat Genet. 2024 Jan;56(1):170-179. doi: 10.1038/s41588-023-01604-7. Epub 2024 Jan 2.
9
Multi-ancestry genome-wide association meta-analysis of Parkinson's disease.多族裔帕金森病全基因组关联荟萃分析。
Nat Genet. 2024 Jan;56(1):27-36. doi: 10.1038/s41588-023-01584-8. Epub 2023 Dec 28.
10
BridgePRS leverages shared genetic effects across ancestries to increase polygenic risk score portability.BridgePRS 利用跨种族的共享遗传效应来提高多基因风险评分的可转移性。
Nat Genet. 2024 Jan;56(1):180-186. doi: 10.1038/s41588-023-01583-9. Epub 2023 Dec 20.