Li Chun, Li Mingyao, Lange Ethan M, Watanabe Richard M
Department of Biostatistics, Center for Human Genetics Research, Vanderbilt University School of Medicine, Nashville, TN 37232, USA.
Hum Hered. 2008;65(3):129-41. doi: 10.1159/000109730. Epub 2007 Oct 12.
Genome-wide association studies (GWAS) are now feasible for studying the genetics underlying complex diseases. For many diseases, a list of candidate genes or regions exists and incorporation of such information into data analyses can potentially improve the power to detect disease variants. Traditional approaches for assessing the overall statistical significance of GWAS results ignore such information by inherently treating all markers equally.
We propose the prioritized subset analysis (PSA), in which a prioritized subset of markers is pre-selected from candidate regions, and the false discovery rate (FDR) procedure is carried out in the prioritized subset and its complementary subset, respectively.
The PSA is more powerful than the whole-genome single-step FDR adjustment for a range of alternative models. The degree of power improvement depends on the fraction of associated SNPs in the prioritized subset and their nominal power, with higher fraction of associated SNPs and higher nominal power leading to more power improvement. The power improvement can be substantial; for disease loci not included in the prioritized subset, the power loss is almost negligible.
The PSA has the flexibility of allowing investigators to combine prior information from a variety of sources, and will be a useful tool for GWAS.
全基因组关联研究(GWAS)目前对于研究复杂疾病的遗传基础是可行的。对于许多疾病,存在一系列候选基因或区域,将此类信息纳入数据分析可能会提高检测疾病变异的能力。评估GWAS结果总体统计显著性的传统方法通过同等对待所有标记而忽略了此类信息。
我们提出了优先子集分析(PSA),其中从候选区域预先选择一个标记的优先子集,并分别在优先子集中及其互补子集中进行错误发现率(FDR)程序。
对于一系列替代模型,PSA比全基因组单步FDR调整更具效力。效力提高的程度取决于优先子集中相关单核苷酸多态性(SNP)的比例及其名义效力,相关SNP比例越高且名义效力越高,效力提高就越大。效力提高可能相当显著;对于未包含在优先子集中的疾病位点,效力损失几乎可以忽略不计。
PSA具有允许研究人员整合来自各种来源的先验信息的灵活性,将成为GWAS的一个有用工具。