Chasman Daniel I
Center for Cardiovascular Disease Prevention, Division of Preventive Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA.
Genet Epidemiol. 2008 Nov;32(7):658-68. doi: 10.1002/gepi.20334.
In genomewide genetic association studies, prior biological knowledge may help distinguish variation that is truly associated with a quantitative trait from the vast majority of unassociated variation that may be significant in hypothesis testing due to chance. However, formal methods for integrating prior biological knowledge into association studies have only been proposed recently, and their potential utility has not been thoroughly evaluated. Herein, gene set methods from genomewide analysis of gene expression data are adapted for application to genomewide genetic analysis of quantitative traits. The proposed gene set method was tested in simulations with gene sets that included up to 500 total variants, among which up to 20 collectively explained 5% of the variance. In a population of 1,000 individuals, the gene set method was largely more efficient at detecting truly associated variants in these gene sets than a comparably calibrated conventional approach relying on P-values alone. While extremely strong associations remain best identified by conventional methods, the gene set approach may provide a complementary mode of analysis for revealing the full spectrum of genes that influence a quantitative trait.
在全基因组关联研究中,先前的生物学知识可能有助于从大量因偶然因素在假设检验中具有显著性的非关联变异中,区分出真正与数量性状相关的变异。然而,将先前生物学知识整合到关联研究中的正式方法直到最近才被提出,其潜在效用尚未得到充分评估。在此,将来自基因表达数据全基因组分析的基因集方法应用于数量性状的全基因组遗传分析。所提出的基因集方法在模拟中进行了测试,基因集包含多达500个总变异,其中多达20个共同解释了5%的方差。在一个1000人的群体中,与仅依赖P值的同等校准的传统方法相比,基因集方法在检测这些基因集中真正相关的变异方面效率更高。虽然极强的关联仍最好通过传统方法识别,但基因集方法可能为揭示影响数量性状的基因全貌提供一种补充分析模式。