Dai Yilin, Guo Ling, Dong Jianping, Jiang Renfang
Department of Mathematical Sciences, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA.
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S114. doi: 10.1186/1753-6561-5-S9-S114.
Genome-wide association studies have been used successfully to detect associations between common genetic variants and complex diseases, but common single-nucleotide polymorphisms (SNPs) detected by these studies explain only 5-10% of disease heritability. Alternatively, the common disease/rare variants hypothesis suggests that complex diseases are often caused by multiple rare variants with moderate to high effects. Under this hypothesis, the analysis of the cumulative effect of rare variants may thus help us discover the missing genetic variations. Collapsing all rare variants across a functional region is currently a popular method to find rare variants that may have a causal effect on certain diseases. However, the power of tests based on collapsing methods is often impaired by misclassification of functional variants. We develop a data-adaptive forward selection procedure that selectively chooses only variants that improve the association signal between functional regions and the disease risk. We apply our strategy to the Genetic Analysis Workshop 17 unrelated individuals data with quantitative traits. The type I error rate and the power of different collapsing functions are evaluated. The substantially higher power of the proposed strategy was demonstrated. The new method provides a useful strategy for the association study of sequencing data by taking advantage of the selection of rare variants.
全基因组关联研究已成功用于检测常见基因变异与复杂疾病之间的关联,但这些研究检测到的常见单核苷酸多态性(SNP)仅解释了疾病遗传力的5%-10%。另外,常见疾病/罕见变异假说表明,复杂疾病通常由多个具有中等到高度效应的罕见变异引起。在此假说下,分析罕见变异的累积效应可能有助于我们发现缺失的基因变异。目前,将功能区域内的所有罕见变异合并是一种寻找可能对某些疾病有因果效应的罕见变异的常用方法。然而,基于合并方法的检验效能常常因功能变异的错误分类而受损。我们开发了一种数据自适应向前选择程序,该程序仅选择性地选择那些能增强功能区域与疾病风险之间关联信号的变异。我们将我们的策略应用于遗传分析研讨会17的具有数量性状的非亲属个体数据。评估了不同合并函数的I型错误率和检验效能。结果表明,所提出策略的检验效能显著更高。该新方法通过利用罕见变异的选择,为测序数据的关联研究提供了一种有用的策略。