一种基于主成分分析的方法，用于解决欧洲队列中多基因风险评分的可转移性问题。

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.

作者信息

Pärna Katri, Nolte Ilja M, Snieder Harold, Fischer Krista, Marnetto Davide, Pagani Luca

机构信息

Institute of Genomics, University of Tartu, Tartu, Estonia.

Department of Epidemiology, University of Groningen, Groningen, Netherlands.

出版信息

Front Genet. 2022 Jul 18;13:899523. doi: 10.3389/fgene.2022.899523. eCollection 2022.

DOI:10.3389/fgene.2022.899523

PMID:35923706

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9340200/

Abstract

One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS.

摘要

全基因组关联研究（GWAS）中的一个重要混杂因素是群体遗传结构，如果不加以适当考虑，可能会产生虚假关联。这最终可能导致多基因风险评分（PRS）预测出现偏差，尤其是当应用于另一群体时。为了探讨这个问题，我们重点研究了主成分分析（PCA），并询问基于来自外部参考群体的主成分的群体遗传学策略是否有助于缓解PRS可转移性问题。在整个研究中，我们使用了身高和体重指数这两个复杂的模型性状，以及来自英国生物银行和爱沙尼亚生物银行的样本。我们旨在研究：1）在发现队列中使用参考群体（1000基因组计划）计算调整后的主成分是否能提高在来自另一群体的目标集中所得的PRS性能；2）是否根本需要针对主成分调整验证模型。我们的结果表明，任何其他主成分集的表现都比在与发现数据集相同群体的样本上计算的主成分集更差。此外，我们表明，GWAS中的主成分校正无法防止PRS中残留的群体结构信息，对于非结构化性状也是如此。因此，我们证实，当所研究的性状与群体遗传结构存在实际关联时，在验证模型中进行主成分校正有助于在评估PRS预测价值时考虑残留的混杂效应。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/e6a8bc29d366/fgene-13-899523-g001.jpg

相似文献

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.

Front Genet. 2022 Jul 18;13:899523. doi: 10.3389/fgene.2022.899523. eCollection 2022.

Polygenic prediction of major depressive disorder and related traits in African ancestries UK Biobank participants.

Mol Psychiatry. 2025 Jan;30(1):151-157. doi: 10.1038/s41380-024-02662-x. Epub 2024 Jul 16.

Polygenic prediction of major depressive disorder and related traits in African ancestries UK Biobank participants.

medRxiv. 2023 Dec 28:2023.12.24.23300412. doi: 10.1101/2023.12.24.23300412.

Improved genetic prediction of the risk of knee osteoarthritis using the risk factor-based polygenic score.

Arthritis Res Ther. 2023 Jun 12;25(1):103. doi: 10.1186/s13075-023-03082-y.

The construction of cross-population polygenic risk scores using transfer learning.

Am J Hum Genet. 2022 Nov 3;109(11):1998-2008. doi: 10.1016/j.ajhg.2022.09.010. Epub 2022 Oct 13.

Improving polygenic prediction in ancestrally diverse populations.

Nat Genet. 2022 May;54(5):573-580. doi: 10.1038/s41588-022-01054-7. Epub 2022 May 5.

Transferability of Alzheimer Disease Polygenic Risk Score Across Populations and Its Association With Alzheimer Disease-Related Phenotypes.

JAMA Netw Open. 2022 Dec 1;5(12):e2247162. doi: 10.1001/jamanetworkopen.2022.47162.

Multiancestral polygenic risk score for pediatric asthma.

J Allergy Clin Immunol. 2022 Nov;150(5):1086-1096. doi: 10.1016/j.jaci.2022.03.035. Epub 2022 May 18.

A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits.

Am J Hum Genet. 2021 Apr 1;108(4):632-655. doi: 10.1016/j.ajhg.2021.03.002. Epub 2021 Mar 25.

Integrating multiple traits for improving polygenic risk prediction in disease and pharmacogenomics GWAS.

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad181.

引用本文的文献

Ancestral genetic components are consistently associated with the complex trait landscape in European biobanks.

Eur J Hum Genet. 2024 Nov;32(11):1492-1499. doi: 10.1038/s41431-024-01678-9. Epub 2024 Aug 10.

Influences of genetically predicted and attained education on geographic mobility and their association with mortality.

Soc Sci Med. 2023 May;324:115882. doi: 10.1016/j.socscimed.2023.115882. Epub 2023 Mar 31.

本文引用的文献

Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.

Am J Hum Genet. 2022 Jan 6;109(1):12-23. doi: 10.1016/j.ajhg.2021.11.008.

Demographic history mediates the effect of stratification on polygenic scores.

Elife. 2020 Nov 17;9:e61548. doi: 10.7554/eLife.61548.

Dutch population structure across space, time and GWAS design.

Nat Commun. 2020 Sep 11;11(1):4556. doi: 10.1038/s41467-020-18418-4.

Polygenic Scores for Height in Admixed Populations.

G3 (Bethesda). 2020 Nov 5;10(11):4027-4036. doi: 10.1534/g3.120.401658.

Differences in local population history at the finest level: the case of the Estonian population.

Eur J Hum Genet. 2020 Nov;28(11):1580-1591. doi: 10.1038/s41431-020-0699-4. Epub 2020 Jul 25.

Tutorial: a guide to performing polygenic risk score analyses.

Nat Protoc. 2020 Sep;15(9):2759-2772. doi: 10.1038/s41596-020-0353-1. Epub 2020 Jul 24.

Validating the doubly weighted genetic risk score for the prediction of type 2 diabetes in the Lifelines and Estonian Biobank cohorts.

Genet Epidemiol. 2020 Sep;44(6):589-600. doi: 10.1002/gepi.22327. Epub 2020 Jun 14.

Ancestry deconvolution and partial polygenic score can improve susceptibility predictions in recently admixed individuals.

Nat Commun. 2020 Apr 2;11(1):1628. doi: 10.1038/s41467-020-15464-w.

Dimensionality reduction reveals fine-scale structure in the Japanese population with consequences for polygenic risk prediction.

Nat Commun. 2020 Mar 26;11(1):1569. doi: 10.1038/s41467-020-15194-z.

Variable prediction accuracy of polygenic scores within an ancestry group.

Elife. 2020 Jan 30;9:e48376. doi: 10.7554/eLife.48376.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于主成分分析的方法，用于解决欧洲队列中多基因风险评分的可转移性问题。

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献