Division of Biostatistics, Department of Preventive Medicine, University of Southern California (USC), 2001 N. Soto Street, Los Angeles, CA, USA.
Int J Genomics. 2013;2013:406217. doi: 10.1155/2013/406217. Epub 2013 Dec 31.
A variety of methods have been proposed for studying the association of multiple genes thought to be involved in a common pathway for a particular disease. Here, we present an extension of a Bayesian hierarchical modeling strategy that allows for multiple SNPs within each gene, with external prior information at either the SNP or gene level. The model involves variable selection at the SNP level through latent indicator variables and Bayesian shrinkage at the gene level towards a prior mean vector and covariance matrix that depend on external information. The entire model is fitted using Markov chain Monte Carlo methods. Simulation studies show that the approach is capable of recovering many of the truly causal SNPs and genes, depending upon their frequency and size of their effects. The method is applied to data on 504 SNPs in 38 candidate genes involved in DNA damage response in the WECARE study of second breast cancers in relation to radiotherapy exposure.
已经提出了多种方法来研究被认为与特定疾病的共同途径有关的多个基因的关联。在这里,我们提出了一种贝叶斯分层建模策略的扩展,该策略允许每个基因内存在多个 SNP,并在 SNP 或基因水平上具有外部先验信息。该模型通过潜在指示变量在 SNP 水平上进行变量选择,并在基因水平上对先验均值向量和协方差矩阵进行贝叶斯收缩,该先验均值向量和协方差矩阵取决于外部信息。整个模型使用马尔可夫链蒙特卡罗方法进行拟合。模拟研究表明,该方法能够根据其频率和效应大小恢复许多真正的因果 SNP 和基因。该方法应用于 WECARE 研究中与放射治疗暴露有关的第二次乳腺癌中涉及 DNA 损伤反应的 38 个候选基因中的 504 个 SNP 的数据。