Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
Sci Rep. 2021 Nov 2;11(1):21495. doi: 10.1038/s41598-021-00427-y.
Complementary to the genome, the concept of exposome has been proposed to capture the totality of human environmental exposures. While there has been some recent progress on the construction of the exposome, few tools exist that can integrate the genome and exposome for complex trait analyses. Here we propose a linear mixed model approach to bridge this gap, which jointly models the random effects of the two omics layers on phenotypes of complex traits. We illustrate our approach using traits from the UK Biobank (e.g., BMI and height for N ~ 35,000) with a small fraction of the exposome that comprises 28 lifestyle factors. The joint model of the genome and exposome explains substantially more phenotypic variance and significantly improves phenotypic prediction accuracy, compared to the model based on the genome alone. The additional phenotypic variance captured by the exposome includes its additive effects as well as non-additive effects such as genome-exposome (gxe) and exposome-exposome (exe) interactions. For example, 19% of variation in BMI is explained by additive effects of the genome, while additional 7.2% by additive effects of the exposome, 1.9% by exe interactions and 4.5% by gxe interactions. Correspondingly, the prediction accuracy for BMI, computed using Pearson's correlation between the observed and predicted phenotypes, improves from 0.15 (based on the genome alone) to 0.35 (based on the genome and exposome). We also show, using established theories, that integrating genomic and exposomic data can be an effective way of attaining a clinically meaningful level of prediction accuracy for disease traits. In conclusion, the genomic and exposomic effects can contribute to phenotypic variation via their latent relationships, i.e. genome-exposome correlation, and gxe and exe interactions, and modelling these effects has a potential to improve phenotypic prediction accuracy and thus holds a great promise for future clinical practice.
除了基因组,人们还提出了暴露组学的概念,以捕捉人类环境暴露的全部信息。虽然在构建暴露组学方面已经取得了一些进展,但能够将基因组和暴露组整合用于复杂特征分析的工具却很少。在这里,我们提出了一种线性混合模型方法来弥补这一空白,该方法联合建模了这两个组学层对复杂特征表型的随机效应。我们使用英国生物库(例如,N~35000 人的 BMI 和身高)中的特征来说明我们的方法,其中一小部分暴露组包含 28 种生活方式因素。与仅基于基因组的模型相比,基因组和暴露组的联合模型解释了更多的表型方差,并显著提高了表型预测准确性。暴露组捕捉到的额外表型方差包括其加性效应以及非加性效应,例如基因组-暴露组(gxe)和暴露组-暴露组(exe)相互作用。例如,BMI 的 19%的变异是由基因组的加性效应解释的,而额外的 7.2%是由暴露组的加性效应解释的,1.9%是由 exe 相互作用解释的,4.5%是由 gxe 相互作用解释的。相应地,使用 Pearson 相关系数计算的 BMI 预测准确性从 0.15(仅基于基因组)提高到 0.35(基于基因组和暴露组)。我们还使用已建立的理论表明,整合基因组和暴露组数据可以是一种有效的方法,可以为疾病特征达到临床有意义的预测准确性水平。总之,基因组和暴露组效应可以通过它们的潜在关系,即基因组-暴露组相关性、gxe 和 exe 相互作用,对表型变异产生影响,并且对这些效应进行建模有可能提高表型预测准确性,因此为未来的临床实践提供了巨大的潜力。