Capanu Marinela, Ionita-Laza Iuliana
Memorial Sloan-Kettering Cancer Center New York, NY, USA.
Department of Biostatistics, Columbia University New York, NY, USA.
Front Genet. 2015 May 8;6:17. doi: 10.3389/fgene.2015.00176. eCollection 2015.
Identifying the small number of rare causal variants contributing to disease has been a major focus of investigation in recent years, but represents a formidable statistical challenge due to the rare frequencies with which these variants are observed. In this commentary we draw attention to a formal statistical framework, namely hierarchical modeling, to combine functional genomic annotations with sequencing data with the objective of enhancing our ability to identify rare causal variants. Using simulations we show that in all configurations studied, the hierarchical modeling approach has superior discriminatory ability compared to a recently proposed aggregate measure of deleteriousness, the Combined Annotation-Dependent Depletion (CADD) score, supporting our premise that aggregate functional genomic measures can more accurately identify causal variants when used in conjunction with sequencing data through a hierarchical modeling approach.
识别导致疾病的少数罕见因果变异一直是近年来研究的主要焦点,但由于这些变异出现的频率很低,这是一个巨大的统计挑战。在本评论中,我们提请注意一个正式的统计框架,即层次模型,将功能基因组注释与测序数据相结合,目的是增强我们识别罕见因果变异的能力。通过模拟我们表明,在所有研究的配置中,与最近提出的一种有害性综合度量方法——联合注释依赖损耗(CADD)评分相比,层次模型方法具有更强的鉴别能力,这支持了我们的前提,即通过层次模型方法将综合功能基因组度量与测序数据结合使用时,可以更准确地识别因果变异。