Drichel Dmitriy, Herold Christine, Lacour André, Ramirez Alfredo, Jessen Frank, Maier Wolfgang, Noethen Markus M, Leber Markus, Vaitsiakhovich Tatsiana, Becker Tim
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany.
Hum Hered. 2014;78(3-4):164-78. doi: 10.1159/000368676. Epub 2014 Dec 10.
Important methodological advancements in rare variant association testing have been made recently, among them collapsing tests, kernel methods and the variable threshold (VT) technique. Typically, rare variants from a region of interest are tested for association as a group ('bin'). Rare variant studies are already routinely performed as whole-exome sequencing studies. As an alternative approach, we propose a pipeline for rare variant analysis of imputed data and develop respective quality control criteria. We provide suggestions for the choice and construction of analysis bins in whole-genome application and support the analysis with implementations of standard burden tests (COLL, CMAT) in our INTERSNP-RARE software. In addition, three rare variant regression tests (REG, FRACREG and COLLREG) are implemented. All tests are accompanied with the VT approach which optimizes the definition of 'rareness'. We integrate kernel tests as implemented in SKAT/SKAT-O into the suggested strategies. Then, we apply our analysis scheme to a genome-wide association study of Alzheimer's disease. Further, we show that our pipeline leads to valid significance testing procedures with controlled type I error rates. Strong association signals surrounding the known APOE locus demonstrate statistical power. In addition, we highlight several suggestive rare variant association findings for follow-up studies, including genomic regions overlapping MCPH1, MED18 and NOTCH3. In summary, we describe and support a straightforward and cost-efficient rare variant analysis pipeline for imputed data and demonstrate its feasibility and validity. The strategy can complement rare variant studies with next generation sequencing data.
最近,罕见变异关联测试在方法学上取得了重要进展,其中包括合并检验、核方法和可变阈值(VT)技术。通常,来自感兴趣区域的罕见变异作为一个组(“bin”)进行关联测试。罕见变异研究已经作为全外显子测序研究常规开展。作为一种替代方法,我们提出了一种用于推断数据的罕见变异分析流程,并制定了相应的质量控制标准。我们为全基因组应用中分析bin的选择和构建提供了建议,并在我们的INTERSNP-RARE软件中通过标准负担检验(COLL、CMAT)的实现来支持分析。此外,还实现了三种罕见变异回归检验(REG、FRACREG和COLLREG)。所有检验都伴随着优化“稀有性”定义的VT方法。我们将SKAT/SKAT-O中实现的核检验整合到建议的策略中。然后,我们将我们的分析方案应用于阿尔茨海默病的全基因组关联研究。此外,我们表明我们的流程能够产生具有可控I型错误率的有效显著性检验程序。已知载脂蛋白E(APOE)基因座周围的强关联信号证明了统计功效。此外,我们强调了几个可供后续研究的提示性罕见变异关联发现,包括与小头畸形相关基因1(MCPH1)、中介体亚基18(MED18)和Notch受体3(NOTCH3)重叠的基因组区域。总之,我们描述并支持一种用于推断数据的简单且经济高效的罕见变异分析流程,并证明了其可行性和有效性。该策略可以补充下一代测序数据的罕见变异研究。