Ren Jie, Zhou Fei, Li Xiaoxi, Chen Qi, Zhang Hongmei, Ma Shuangge, Jiang Yu, Wu Cen
Department of Statistics, Kansas State University, Manhattan, Kansas.
Department of Pharmacology, Toxicology and Therapeutics, University of Kansas Medical Center, Kansas City, Kansas.
Stat Med. 2020 Feb 28;39(5):617-638. doi: 10.1002/sim.8434. Epub 2019 Dec 21.
Many complex diseases are known to be affected by the interactions between genetic variants and environmental exposures beyond the main genetic and environmental effects. Study of gene-environment (G×E) interactions is important for elucidating the disease etiology. Existing Bayesian methods for G×E interaction studies are challenged by the high-dimensional nature of the study and the complexity of environmental influences. Many studies have shown the advantages of penalization methods in detecting G×E interactions in "large p, small n" settings. However, Bayesian variable selection, which can provide fresh insight into G×E study, has not been widely examined. We propose a novel and powerful semiparametric Bayesian variable selection model that can investigate linear and nonlinear G×E interactions simultaneously. Furthermore, the proposed method can conduct structural identification by distinguishing nonlinear interactions from main-effects-only case within the Bayesian framework. Spike-and-slab priors are incorporated on both individual and group levels to identify the sparse main and interaction effects. The proposed method conducts Bayesian variable selection more efficiently than existing methods. Simulation shows that the proposed model outperforms competing alternatives in terms of both identification and prediction. The proposed Bayesian method leads to the identification of main and interaction effects with important implications in a high-throughput profiling study with high-dimensional SNP data.
许多复杂疾病已知会受到基因变异与环境暴露之间相互作用的影响,而不仅仅是主要的基因和环境效应。基因-环境(G×E)相互作用的研究对于阐明疾病病因很重要。现有的用于G×E相互作用研究的贝叶斯方法受到研究的高维性质和环境影响复杂性的挑战。许多研究表明,惩罚方法在“大p,小n”情况下检测G×E相互作用方面具有优势。然而,能够为G×E研究提供新见解的贝叶斯变量选择尚未得到广泛研究。我们提出了一种新颖且强大的半参数贝叶斯变量选择模型,该模型可以同时研究线性和非线性G×E相互作用。此外,所提出的方法可以在贝叶斯框架内通过区分非线性相互作用与仅为主效应的情况来进行结构识别。在个体和组水平上都纳入了尖峰和平板先验,以识别稀疏的主效应和相互作用效应。所提出的方法比现有方法更有效地进行贝叶斯变量选择。模拟表明,所提出的模型在识别和预测方面均优于竞争方法。所提出的贝叶斯方法在具有高维SNP数据的高通量分析研究中能够识别出具有重要意义的主效应和相互作用效应。