Lin Keng-Han, Zöllner Sebastian
Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, United States of America.
Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, United States of America.
Genet Epidemiol. 2015 Jul;39(5):325-33. doi: 10.1002/gepi.21903. Epub 2015 May 13.
Advances in DNA sequencing technology facilitate investigating the impact of rare variants on complex diseases. However, using a conventional case-control design, large samples are needed to capture enough rare variants to achieve sufficient power for testing the association between suspected loci and complex diseases. In such large samples, population stratification may easily cause spurious signals. One approach to overcome stratification is to use a family-based design. For rare variants, this strategy is especially appropriate, as power can be increased considerably by analyzing cases with affected relatives. We propose a novel framework for association testing in affected sibpairs by comparing the allele count of rare variants on chromosome regions shared identical by descent to the allele count of rare variants on nonshared chromosome regions, referred to as test for rare variant association with family-based internal control (TRAFIC). This design is generally robust to population stratification as cases and controls are matched within each sibpair. We evaluate the power analytically using general model for effect size of rare variants. For the same number of genotyped people, TRAFIC shows superior power over the conventional case-control study for variants with summed risk allele frequency f < 0.05; this power advantage is even more substantial when considering allelic heterogeneity. For complex models of gene-gene interaction, this power advantage depends on the direction of interaction and overall heritability. In sum, we introduce a new method for analyzing rare variants in affected sibpairs that is robust to population stratification, and provide freely available software.
DNA测序技术的进步有助于研究罕见变异对复杂疾病的影响。然而,采用传统的病例对照设计,需要大量样本才能捕获足够多的罕见变异,以获得足够的检验效能来检测可疑基因座与复杂疾病之间的关联。在如此大的样本中,群体分层很容易导致假信号。克服分层的一种方法是采用基于家系的设计。对于罕见变异,这种策略尤其合适,因为通过分析有患病亲属的病例可以显著提高检验效能。我们提出了一种用于受累同胞对关联检验的新框架,即将通过血缘共享的染色体区域上罕见变异的等位基因计数与非共享染色体区域上罕见变异的等位基因计数进行比较,称为基于家系内部控制的罕见变异关联检验(TRAFIC)。由于病例和对照在每个同胞对内是匹配的,所以这种设计通常对群体分层具有稳健性。我们使用罕见变异效应大小的一般模型进行分析评估检验效能。对于相同数量的基因分型个体,对于汇总风险等位基因频率f < 0.05的变异,TRAFIC比传统病例对照研究显示出更高的检验效能;当考虑等位基因异质性时,这种效能优势更为显著。对于基因-基因相互作用的复杂模型,这种效能优势取决于相互作用的方向和总体遗传度。总之,我们介绍了一种用于分析受累同胞对中罕见变异的新方法,该方法对群体分层具有稳健性,并提供了免费可用的软件。