Lady Davis Institute, Jewish General Hospital, Montreal, Canada.
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada.
Genet Epidemiol. 2020 Nov;44(8):825-840. doi: 10.1002/gepi.22344. Epub 2020 Aug 11.
It is challenging to estimate the phenotypic impact of the structural genome changes known as copy-number variations (CNVs), since there are many unique CNVs which are nonrecurrent, and most are too rare to be studied individually. In recent work, we found that CNV-aggregated genomic annotations, that is, specifically the intolerance to mutation as measured by the pLI score (probability of being loss-of-function intolerant), can be strong predictors of intellectual quotient (IQ) loss. However, this aggregation method only estimates the individual CNV effects indirectly. Here, we propose the use of hierarchical Bayesian models to directly estimate individual effects of rare CNVs on measures of intelligence. Annotation information on the impact of major mutations in genomic regions is extracted from genomic databases and used to define prior information for the approach we call HBIQ. We applied HBIQ to the analysis of CNV deletions and duplications from three datasets and identified several genomic regions containing CNVs demonstrating significant deleterious effects on IQ, some of which validate previously known associations. We also show that several CNVs were identified as deleterious by HBIQ even if they have a zero pLI score, and the converse is also true. Furthermore, we show that our new model yields higher out-of-sample concordance (78%) for predicting the consequences of carrying known recurrent CNVs compared with our previous approach.
评估结构基因组变化(称为拷贝数变异,CNVs)的表型影响具有挑战性,因为存在许多非重复的独特 CNVs,并且大多数都太罕见而无法单独研究。在最近的工作中,我们发现 CNV 聚合基因组注释,即特别通过 pLI 分数(功能丧失不耐受的概率)测量的突变不耐受性,可以成为智商(IQ)损失的强有力预测因子。然而,这种聚合方法只能间接地估计单个 CNV 的影响。在这里,我们提出使用分层贝叶斯模型来直接估计罕见 CNV 对智力测量的个体影响。从基因组数据库中提取基因组区域中主要突变影响的注释信息,并将其用于定义我们称之为 HBIQ 的方法的先验信息。我们将 HBIQ 应用于来自三个数据集的 CNV 缺失和重复分析,并确定了几个包含 CNV 的基因组区域,这些区域对 IQ 具有显著的有害影响,其中一些验证了先前已知的关联。我们还表明,即使 CNV 的 pLI 分数为零,HBIQ 也会将其识别为有害,反之亦然。此外,我们表明,与我们之前的方法相比,我们的新模型在预测已知重复 CNV 的携带后果方面具有更高的样本外一致性(78%)。