• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于主成分分析的方法,用于解决欧洲队列中多基因风险评分的可转移性问题。

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.

作者信息

Pärna Katri, Nolte Ilja M, Snieder Harold, Fischer Krista, Marnetto Davide, Pagani Luca

机构信息

Institute of Genomics, University of Tartu, Tartu, Estonia.

Department of Epidemiology, University of Groningen, Groningen, Netherlands.

出版信息

Front Genet. 2022 Jul 18;13:899523. doi: 10.3389/fgene.2022.899523. eCollection 2022.

DOI:10.3389/fgene.2022.899523
PMID:35923706
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9340200/
Abstract

One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS.

摘要

全基因组关联研究(GWAS)中的一个重要混杂因素是群体遗传结构,如果不加以适当考虑,可能会产生虚假关联。这最终可能导致多基因风险评分(PRS)预测出现偏差,尤其是当应用于另一群体时。为了探讨这个问题,我们重点研究了主成分分析(PCA),并询问基于来自外部参考群体的主成分的群体遗传学策略是否有助于缓解PRS可转移性问题。在整个研究中,我们使用了身高和体重指数这两个复杂的模型性状,以及来自英国生物银行和爱沙尼亚生物银行的样本。我们旨在研究:1)在发现队列中使用参考群体(1000基因组计划)计算调整后的主成分是否能提高在来自另一群体的目标集中所得的PRS性能;2)是否根本需要针对主成分调整验证模型。我们的结果表明,任何其他主成分集的表现都比在与发现数据集相同群体的样本上计算的主成分集更差。此外,我们表明,GWAS中的主成分校正无法防止PRS中残留的群体结构信息,对于非结构化性状也是如此。因此,我们证实,当所研究的性状与群体遗传结构存在实际关联时,在验证模型中进行主成分校正有助于在评估PRS预测价值时考虑残留的混杂效应。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/cf2587b8f788/fgene-13-899523-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/e6a8bc29d366/fgene-13-899523-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/df86e58ecfd2/fgene-13-899523-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/cf2587b8f788/fgene-13-899523-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/e6a8bc29d366/fgene-13-899523-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/df86e58ecfd2/fgene-13-899523-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac1a/9340200/cf2587b8f788/fgene-13-899523-g003.jpg

相似文献

1
A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts.一种基于主成分分析的方法,用于解决欧洲队列中多基因风险评分的可转移性问题。
Front Genet. 2022 Jul 18;13:899523. doi: 10.3389/fgene.2022.899523. eCollection 2022.
2
Polygenic prediction of major depressive disorder and related traits in African ancestries UK Biobank participants.非洲裔英国生物银行参与者中重度抑郁症及相关性状的多基因预测
Mol Psychiatry. 2025 Jan;30(1):151-157. doi: 10.1038/s41380-024-02662-x. Epub 2024 Jul 16.
3
Polygenic prediction of major depressive disorder and related traits in African ancestries UK Biobank participants.非洲裔英国生物银行参与者中重度抑郁症及相关特征的多基因预测
medRxiv. 2023 Dec 28:2023.12.24.23300412. doi: 10.1101/2023.12.24.23300412.
4
Improved genetic prediction of the risk of knee osteoarthritis using the risk factor-based polygenic score.基于风险因素的多基因风险评分可提高膝关节骨关节炎风险的遗传预测。
Arthritis Res Ther. 2023 Jun 12;25(1):103. doi: 10.1186/s13075-023-03082-y.
5
The construction of cross-population polygenic risk scores using transfer learning.使用迁移学习构建跨人群多基因风险评分。
Am J Hum Genet. 2022 Nov 3;109(11):1998-2008. doi: 10.1016/j.ajhg.2022.09.010. Epub 2022 Oct 13.
6
Improving polygenic prediction in ancestrally diverse populations.提高在祖源多样化人群中的多基因预测能力。
Nat Genet. 2022 May;54(5):573-580. doi: 10.1038/s41588-022-01054-7. Epub 2022 May 5.
7
Transferability of Alzheimer Disease Polygenic Risk Score Across Populations and Its Association With Alzheimer Disease-Related Phenotypes.阿尔茨海默病多基因风险评分在人群间的可转移性及其与阿尔茨海默病相关表型的关系。
JAMA Netw Open. 2022 Dec 1;5(12):e2247162. doi: 10.1001/jamanetworkopen.2022.47162.
8
Multiancestral polygenic risk score for pediatric asthma.多祖先多基因风险评分与儿童哮喘。
J Allergy Clin Immunol. 2022 Nov;150(5):1086-1096. doi: 10.1016/j.jaci.2022.03.035. Epub 2022 May 18.
9
A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits.利用多基因性状的遗传相关性进行跨人群性状预测的统一框架。
Am J Hum Genet. 2021 Apr 1;108(4):632-655. doi: 10.1016/j.ajhg.2021.03.002. Epub 2021 Mar 25.
10
Integrating multiple traits for improving polygenic risk prediction in disease and pharmacogenomics GWAS.综合多种特征以提高疾病和药物基因组学 GWAS 中的多基因风险预测。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad181.

引用本文的文献

1
Ancestral genetic components are consistently associated with the complex trait landscape in European biobanks.祖先遗传成分在欧洲生物库中与复杂性状图谱始终保持一致。
Eur J Hum Genet. 2024 Nov;32(11):1492-1499. doi: 10.1038/s41431-024-01678-9. Epub 2024 Aug 10.
2
Influences of genetically predicted and attained education on geographic mobility and their association with mortality.遗传预测教育和实际获得的教育对地理流动性的影响及其与死亡率的关系。
Soc Sci Med. 2023 May;324:115882. doi: 10.1016/j.socscimed.2023.115882. Epub 2023 Mar 31.

本文引用的文献

1
Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.245 个多基因评分在英国生物样本库中得出并应用于来自同一队列的 9 个祖先群体时的可转移性。
Am J Hum Genet. 2022 Jan 6;109(1):12-23. doi: 10.1016/j.ajhg.2021.11.008.
2
Demographic history mediates the effect of stratification on polygenic scores.人口统计学历史调节了分层对多基因评分的影响。
Elife. 2020 Nov 17;9:e61548. doi: 10.7554/eLife.61548.
3
Dutch population structure across space, time and GWAS design.荷兰人口结构在空间、时间和全基因组关联研究设计方面的情况。
Nat Commun. 2020 Sep 11;11(1):4556. doi: 10.1038/s41467-020-18418-4.
4
Polygenic Scores for Height in Admixed Populations.混合人群中身高的多基因评分
G3 (Bethesda). 2020 Nov 5;10(11):4027-4036. doi: 10.1534/g3.120.401658.
5
Differences in local population history at the finest level: the case of the Estonian population.最细微层面的局部人口历史差异:以爱沙尼亚人口为例。
Eur J Hum Genet. 2020 Nov;28(11):1580-1591. doi: 10.1038/s41431-020-0699-4. Epub 2020 Jul 25.
6
Tutorial: a guide to performing polygenic risk score analyses.教程:多基因风险评分分析操作指南。
Nat Protoc. 2020 Sep;15(9):2759-2772. doi: 10.1038/s41596-020-0353-1. Epub 2020 Jul 24.
7
Validating the doubly weighted genetic risk score for the prediction of type 2 diabetes in the Lifelines and Estonian Biobank cohorts.验证用于预测 2 型糖尿病的双重加权遗传风险评分在 Lifelines 和爱沙尼亚生物库队列中的表现。
Genet Epidemiol. 2020 Sep;44(6):589-600. doi: 10.1002/gepi.22327. Epub 2020 Jun 14.
8
Ancestry deconvolution and partial polygenic score can improve susceptibility predictions in recently admixed individuals.祖源去卷积和部分多基因评分可提高近期混合人群的易感性预测。
Nat Commun. 2020 Apr 2;11(1):1628. doi: 10.1038/s41467-020-15464-w.
9
Dimensionality reduction reveals fine-scale structure in the Japanese population with consequences for polygenic risk prediction.降维揭示了日本人群中的精细结构,对多基因风险预测有影响。
Nat Commun. 2020 Mar 26;11(1):1569. doi: 10.1038/s41467-020-15194-z.
10
Variable prediction accuracy of polygenic scores within an ancestry group.群体内多基因评分的预测准确性存在差异。
Elife. 2020 Jan 30;9:e48376. doi: 10.7554/eLife.48376.