Department of Genetics and Genomic Sciences, Icahn School of Medicine, Mount Sinai, New York City, New York, United States of America.
The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America.
PLoS Genet. 2023 Feb 7;19(2):e1010624. doi: 10.1371/journal.pgen.1010624. eCollection 2023 Feb.
Polygenic risk scores (PRSs) have been among the leading advances in biomedicine in recent years. As a proxy of genetic liability, PRSs are utilised across multiple fields and applications. While numerous statistical and machine learning methods have been developed to optimise their predictive accuracy, these typically distil genetic liability to a single number based on aggregation of an individual's genome-wide risk alleles. This results in a key loss of information about an individual's genetic profile, which could be critical given the functional sub-structure of the genome and the heterogeneity of complex disease. In this manuscript, we introduce a 'pathway polygenic' paradigm of disease risk, in which multiple genetic liabilities underlie complex diseases, rather than a single genome-wide liability. We describe a method and accompanying software, PRSet, for computing and analysing pathway-based PRSs, in which polygenic scores are calculated across genomic pathways for each individual. We evaluate the potential of pathway PRSs in two distinct ways, creating two major sections: (1) In the first section, we benchmark PRSet as a pathway enrichment tool, evaluating its capacity to capture GWAS signal in pathways. We find that for target sample sizes of >10,000 individuals, pathway PRSs have similar power for evaluating pathway enrichment as leading methods MAGMA and LD score regression, with the distinct advantage of providing individual-level estimates of genetic liability for each pathway -opening up a range of pathway-based PRS applications, (2) In the second section, we evaluate the performance of pathway PRSs for disease stratification. We show that using a supervised disease stratification approach, pathway PRSs (computed by PRSet) outperform two standard genome-wide PRSs (computed by C+T and lassosum) for classifying disease subtypes in 20 of 21 scenarios tested. As the definition and functional annotation of pathways becomes increasingly refined, we expect pathway PRSs to offer key insights into the heterogeneity of complex disease and treatment response, to generate biologically tractable therapeutic targets from polygenic signal, and, ultimately, to provide a powerful path to precision medicine.
多基因风险评分(PRSs)是近年来生物医学领域的主要进展之一。作为遗传易感性的代理,PRSs 在多个领域和应用中得到了广泛应用。虽然已经开发了许多统计和机器学习方法来优化其预测准确性,但这些方法通常会根据个体全基因组风险等位基因的聚集将遗传易感性简化为一个单一的数字。这导致个体遗传谱的关键信息丢失,考虑到基因组的功能亚结构和复杂疾病的异质性,这可能是至关重要的。在本文中,我们引入了一种疾病风险的“途径多基因”范式,其中多个遗传易感性导致复杂疾病,而不是单一的全基因组易感性。我们描述了一种计算和分析基于途径的 PRS 的方法和配套软件 PRSet,其中为每个个体计算跨基因组途径的多基因评分。我们通过两种不同的方式评估途径 PRS 的潜力,分为两个主要部分:(1)在第一部分中,我们将 PRSet 作为一种途径富集工具进行基准测试,评估其捕获 GWAS 信号在途径中的能力。我们发现,对于目标样本量大于 10000 个个体,途径 PRS 在评估途径富集方面具有与领先方法 MAGMA 和 LD 得分回归相似的能力,具有提供每个途径遗传易感性个体水平估计的独特优势-开辟了一系列基于途径的 PRS 应用途径,(2)在第二部分中,我们评估了途径 PRS 用于疾病分层的性能。我们表明,使用有监督的疾病分层方法,途径 PRS(由 PRSet 计算)在 21 种测试场景中的 20 种情况下优于两种标准的全基因组 PRS(由 C+T 和 lassosum 计算)用于疾病亚型的分类。随着途径的定义和功能注释变得越来越精细,我们预计途径 PRS 将为复杂疾病和治疗反应的异质性提供关键见解,从多基因信号中生成可操作的治疗靶点,并最终为精准医学提供强大的途径。