Virginia Institute for Psychiatric and Behavioral Genetics, Department of Psychiatry, Virginia Commonwealth University, Richmond, VA, USA.
The Department of Human Genetics, University of Chicago, IL 60637, USA; Grossman Institute for Neuroscience, Quantitative Biology and Human Behavior, University of Chicago, Chicago, IL 60637, USA.
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab067.
Rare variant-based analyses are beginning to identify risk genes for neuropsychiatric disorders and other diseases. However, the identified genes only account for a fraction of predicted causal genes. Recent studies have shown that rare damaging variants are significantly enriched in specific gene-sets. Methods which are able to jointly model rare variants and gene-sets to identify enriched gene-sets and use these enriched gene-sets to prioritize additional risk genes could improve understanding of the genetic architecture of diseases.
We propose DECO (Integrated analysis of de novo mutations, rare case/control variants and omics information via gene-sets), an integrated method for rare-variant and gene-set analysis. The method can (i) test the enrichment of gene-sets directly within the statistical model, and (ii) use enriched gene-sets to rank existing genes and prioritize additional risk genes for tested disorders. In simulations, DECO performs better than a homologous method that uses only variant data. To demonstrate the application of the proposed protocol, we have applied this approach to rare-variant datasets of schizophrenia. Compared with a method which only uses variant information, DECO is able to prioritize additional risk genes.
DECO can be used to analyze rare-variants and biological pathways or cell types for any disease. The package is available on Github https://github.com/hoangtn/DECO.
基于罕见变异的分析开始鉴定出神经精神疾病和其他疾病的风险基因。然而,已鉴定的基因仅占预测因果基因的一小部分。最近的研究表明,罕见的有害变异在特定基因集中显著富集。能够联合建模罕见变异和基因集以识别富集的基因集并使用这些富集的基因集来优先考虑其他风险基因的方法可以提高对疾病遗传结构的理解。
我们提出了 DECO(通过基因集整合分析新生突变、罕见病例/对照变异和组学信息),这是一种用于罕见变异和基因集分析的综合方法。该方法可以 (i) 在统计模型中直接测试基因集的富集,以及 (ii) 使用富集的基因集对现有基因进行排序,并优先考虑针对测试疾病的其他风险基因。在模拟中,DECO 的表现优于仅使用变异数据的同源方法。为了证明所提出方案的应用,我们已经将该方法应用于精神分裂症的罕见变异数据集。与仅使用变异信息的方法相比,DECO 能够优先考虑其他风险基因。
DECO 可用于分析任何疾病的罕见变异和生物途径或细胞类型。该软件包可在 Github 上获得 https://github.com/hoangtn/DECO。