Oh Cheongeun, Ye Kenny Q, He Qimei, Mendell Nancy R
Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT, USA.
BMC Genet. 2003 Dec 31;4 Suppl 1(Suppl 1):S69. doi: 10.1186/1471-2156-4-S1-S69.
We applied stochastic search variable selection (SSVS), a Bayesian model selection method, to the simulated data of Genetic Analysis Workshop 13. We used SSVS with the revisited Haseman-Elston method to find the markers linked to the loci determining change in cholesterol over time. To study gene-gene interaction (epistasis) and gene-environment interaction, we adopted prior structures, which incorporate the relationship among the predictors. This allows SSVS to search in the model space more efficiently and avoid the less likely models.
In applying SSVS, instead of looking at the posterior distribution of each of the candidate models, which is sensitive to the setting of the prior, we ranked the candidate variables (markers) according to their marginal posterior probability, which was shown to be more robust to the prior. Compared with traditional methods that consider one marker at a time, our method considers all markers simultaneously and obtains more favorable results.
We showed that SSVS is a powerful method for identifying linked markers using the Haseman-Elston method, even for weak effects. SSVS is very effective because it does a smart search over the entire model space.
我们将随机搜索变量选择法(SSVS),一种贝叶斯模型选择方法,应用于遗传分析研讨会13的模拟数据。我们使用SSVS结合重新审视的哈斯曼-埃尔斯顿方法来寻找与决定胆固醇随时间变化的位点连锁的标记。为了研究基因-基因相互作用(上位性)和基因-环境相互作用,我们采用了先验结构,该结构纳入了预测变量之间的关系。这使得SSVS能够在模型空间中更有效地搜索,并避免不太可能的模型。
在应用SSVS时,我们不是查看每个候选模型的后验分布(它对先验设置敏感),而是根据其边际后验概率对候选变量(标记)进行排序,结果表明该概率对先验更具稳健性。与一次考虑一个标记的传统方法相比,我们的方法同时考虑所有标记并获得了更优的结果。
我们表明,即使对于弱效应,SSVS也是一种使用哈斯曼-埃尔斯顿方法识别连锁标记的强大方法。SSVS非常有效,因为它在整个模型空间中进行智能搜索。