School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, H91TK33, Ireland.
SFI Centre for Research Training in Genomics Data Science, National University of Ireland Galway, Galway, H91TK33, Ireland.
Sci Rep. 2021 Oct 1;11(1):19571. doi: 10.1038/s41598-021-99031-3.
Ongoing increases in the size of human genotype and phenotype collections offer the promise of improved understanding of the genetics of complex diseases. In addition to the biological insights that can be gained from the nature of the variants that contribute to the genetic component of complex trait variability, these data bring forward the prospect of predicting complex traits and the risk of complex genetic diseases from genotype data. Here we show that advances in phenotype prediction can be applied to improve the power of genome-wide association studies. We demonstrate a simple and efficient method to model genetic background effects using polygenic scores derived from SNPs that are not on the same chromosome as the target SNP. Using simulated and real data we found that this can result in a substantial increase in the number of variants passing genome-wide significance thresholds. This increase in power to detect trait-associated variants also translates into an increase in the accuracy with which the resulting polygenic score predicts the phenotype from genotype data. Our results suggest that advances in methods for phenotype prediction can be exploited to improve the control of background genetic effects, leading to more accurate GWAS results and further improvements in phenotype prediction.
人类基因型和表型集合的不断扩大,为深入了解复杂疾病的遗传学提供了可能。除了可以从导致复杂性状变异性的遗传成分的变异的本质中获得生物学见解外,这些数据还提出了从基因型数据预测复杂性状和复杂遗传疾病风险的前景。在这里,我们表明表型预测的进展可用于提高全基因组关联研究的功效。我们展示了一种简单有效的方法,该方法使用与目标 SNP 不在同一染色体上的 SNP 衍生的多基因评分来模拟遗传背景效应。使用模拟和真实数据,我们发现这可以导致通过全基因组显著性阈值的变体数量大大增加。检测与性状相关的变体的功效增加,也转化为生成的多基因评分从基因型数据预测表型的准确性提高。我们的研究结果表明,表型预测方法的进展可用于改善背景遗传效应的控制,从而提高 GWAS 结果的准确性,并进一步提高表型预测的准确性。