超越预测性R：分位数回归和非等效性检验揭示了性状与多基因评分之间的复杂关系。

Beyond predictive R: Quantile regression and non-equivalence tests reveal complex relationships of traits and polygenic scores.

作者信息

Mefford Joel, Smullen Molly, Zhang Felix, Sadowski Michal, Border Richard, Dahl Andy, Flint Jonathan, Zaitlen Noah

机构信息

Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, Los Angeles, CA, USA.

Chan Medical School, University of Massachusetts, Worcester, MA, USA.

出版信息

Am J Hum Genet. 2025 Jun 5;112(6):1363-1375. doi: 10.1016/j.ajhg.2025.04.013.

DOI:10.1016/j.ajhg.2025.04.013

PMID:40480198

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12256909/

Abstract

Polygenic scores (PGSs) are genetic predictions of trait values or disease risk that are increasingly finding applications in clinical predictive models and basic genetics research. However, the predictive value of a PGS can vary within similar population groups, depending on characteristics such as the environmental exposures, sex, age, or socioeconomic status of the individuals. To maximize the value of a PGS, approaches to screen trait-PGS pairs for evidence of such heterogeneity without having to specify the relevant exposure or individual characteristics would be useful. Here, in analyses from the UK Biobank, we show that a PGS's predictive accuracy depends on the quantile of the phenotypic distribution to which the PGS is being applied. We quantify differences in predictive value across the phenotypic range using quantile regression linear models to estimate quantile-specific effect sizes for linear models of phenotype values as a function of PGS. Of 25 continuous traits, only three have no quantile-specific effect sizes that varied by at least 1.2-fold from the ordinary least squares estimate. Through simulation, we demonstrate that this heterogeneity of PGS predictive value can arise from gene-by-environment interactions. Our approach can be used to flag traits where the use of PGSs warrants extra caution, and perhaps stratification variables should be sought and used because PGSs perform substantially differently in portions of the sampled population than expected from quoted predictive R or incremental R values that represent average performance across a dataset.

摘要

多基因分数（PGS）是对性状值或疾病风险的遗传预测，越来越多地应用于临床预测模型和基础遗传学研究。然而，PGS的预测价值在相似人群组中可能会有所不同，这取决于个体的环境暴露、性别、年龄或社会经济地位等特征。为了最大化PGS的价值，筛选性状-PGS对以寻找这种异质性证据而无需指定相关暴露或个体特征的方法将很有用。在此，在英国生物银行的分析中，我们表明PGS的预测准确性取决于应用PGS的表型分布分位数。我们使用分位数回归线性模型来估计表型值线性模型的分位数特定效应大小，以此作为PGS的函数，从而量化整个表型范围内预测价值的差异。在25个连续性状中，只有三个没有分位数特定效应大小，其与普通最小二乘估计的差异至少为1.2倍。通过模拟，我们证明PGS预测价值的这种异质性可能源于基因-环境相互作用。我们的方法可用于标记那些使用PGS需要格外谨慎的性状，也许应该寻找并使用分层变量，因为PGS在抽样人群的某些部分中的表现与代表数据集平均表现的引用预测R或增量R值所预期的有很大不同。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3c4/12256909/dda1121fd64f/gr1.jpg

相似文献

Beyond predictive R: Quantile regression and non-equivalence tests reveal complex relationships of traits and polygenic scores.超越预测性R：分位数回归和非等效性检验揭示了性状与多基因评分之间的复杂关系。

Am J Hum Genet. 2025 Jun 5;112(6):1363-1375. doi: 10.1016/j.ajhg.2025.04.013.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Bias and Precision of Parameter Estimates from Models Using Polygenic Scores to Estimate Environmental and Genetic Parental Influences.基于多基因评分模型估计环境和遗传父母影响的参数估计的偏差和精度。

Behav Genet. 2021 May;51(3):279-288. doi: 10.1007/s10519-020-10033-9. Epub 2020 Dec 10.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

A Litmus Test for Confounding in Polygenic Scores.多基因评分中混杂因素的石蕊试验

bioRxiv. 2025 Feb 4:2025.02.01.635985. doi: 10.1101/2025.02.01.635985.

Thromboprophylaxis during pregnancy and the puerperium: a systematic review and economic evaluation to estimate the value of future research.妊娠期和产褥期的血栓预防：一项系统评价和经济评估，以估算未来研究的价值。

Health Technol Assess. 2024 Mar;28(9):1-176. doi: 10.3310/DFWT3873.

Testing for differences in polygenic scores in the presence of confounding.在存在混杂因素的情况下对多基因分数差异进行检测。

Genetics. 2025 Jun 4;230(2). doi: 10.1093/genetics/iyaf071.

引用本文的文献

Three Open Questions in Polygenic Score Portability.多基因评分可移植性的三个开放性问题。

bioRxiv. 2024 Aug 21:2024.08.20.608703. doi: 10.1101/2024.08.20.608703.

本文引用的文献

Characterizing the genetic architecture of drug response using gene-context interaction methods.使用基因背景相互作用方法表征药物反应的遗传结构。

Cell Genom. 2024 Dec 11;4(12):100722. doi: 10.1016/j.xgen.2024.100722. Epub 2024 Dec 4.

A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release.英国生物银行多基因风险评分（PRS）发布的性能和特征的系统评价。

PLoS One. 2024 Sep 18;19(9):e0307270. doi: 10.1371/journal.pone.0307270. eCollection 2024.

Genotype error due to low-coverage sequencing induces uncertainty in polygenic scoring.由于低覆盖度测序导致的基因型错误会给多基因评分带来不确定性。

Am J Hum Genet. 2023 Aug 3;110(8):1319-1329. doi: 10.1016/j.ajhg.2023.06.015. Epub 2023 Jul 24.

Polygenic scoring accuracy varies across the genetic ancestry continuum.多基因评分准确性在遗传祖先连续体上有所差异。

Nature. 2023 Jun;618(7966):774-781. doi: 10.1038/s41586-023-06079-4. Epub 2023 May 17.

Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals.在 300 万人的全基因组关联分析中，对家庭内和家庭间的受教育程度进行多基因预测。

Nat Genet. 2022 Apr;54(4):437-449. doi: 10.1038/s41588-022-01016-z. Epub 2022 Mar 31.

Polygenic scores in biomedical research.多基因评分在生物医学研究中的应用。

Nat Rev Genet. 2022 Sep;23(9):524-532. doi: 10.1038/s41576-022-00470-z. Epub 2022 Mar 30.

Calculating Polygenic Risk Scores (PRS) in UK Biobank: A Practical Guide for Epidemiologists.在英国生物银行中计算多基因风险评分（PRS）：流行病学家实用指南

Front Genet. 2022 Feb 18;13:818574. doi: 10.3389/fgene.2022.818574. eCollection 2022.

A polygenic-score-based approach for identification of gene-drug interactions stratifying breast cancer risk.基于多基因评分的方法鉴定基因-药物相互作用分层乳腺癌风险。

Am J Hum Genet. 2021 Sep 2;108(9):1752-1764. doi: 10.1016/j.ajhg.2021.07.008. Epub 2021 Aug 6.

Educational attainment polygenic score predicts inhibitory control and academic skills in early and middle childhood.教育程度多基因评分可预测儿童早期和中期的抑制控制和学业技能。

Genes Brain Behav. 2021 Sep;20(7):e12762. doi: 10.1111/gbb.12762. Epub 2021 Aug 3.

Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.利用个体水平的遗传数据和 GWAS 汇总统计数据可以提高多基因预测。

Am J Hum Genet. 2021 Jun 3;108(6):1001-1011. doi: 10.1016/j.ajhg.2021.04.014. Epub 2021 May 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

超越预测性R：分位数回归和非等效性检验揭示了性状与多基因评分之间的复杂关系。

Beyond predictive R: Quantile regression and non-equivalence tests reveal complex relationships of traits and polygenic scores.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献