Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA.
Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Psychiatry, Harvard Medical School, Boston, MA 02115, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
J Genet Genomics. 2021 Mar 20;48(3):173-183. doi: 10.1016/j.jgg.2021.01.007. Epub 2021 Feb 26.
Pathway analysis, also known as gene-set enrichment analysis, is a multilocus analytic strategy that integrates a priori, biological knowledge into the statistical analysis of high-throughput genetics data. Originally developed for the studies of gene expression data, it has become a powerful analytic procedure for in-depth mining of genome-wide genetic variation data. Astonishing discoveries were made in the past years, uncovering genes and biological mechanisms underlying common and complex disorders. However, as massive amounts of diverse functional genomics data accrue, there is a pressing need for newer generations of pathway analysis methods that can utilize multiple layers of high-throughput genomics data. In this review, we provide an intellectual foundation of this powerful analytic strategy, as well as an update of the state-of-the-art in recent method developments. The goal of this review is threefold: (1) introduce the motivation and basic steps of pathway analysis for genome-wide genetic variation data; (2) review the merits and the shortcomings of classic and newly emerging integrative pathway analysis tools; and (3) discuss remaining challenges and future directions for further method developments.
通路分析(Pathway analysis),也称为基因集富集分析(gene-set enrichment analysis),是一种多基因分析策略,它将先验的、生物学方面的知识整合到高通量遗传学数据的统计分析中。该方法最初是为研究基因表达数据而开发的,现已成为深入挖掘全基因组遗传变异数据的强大分析程序。在过去的几年中,该方法取得了惊人的发现,揭示了常见和复杂疾病的基因和生物学机制。然而,随着大量不同功能基因组学数据的积累,需要新一代的通路分析方法来利用多层次的高通量基因组学数据。在这篇综述中,我们提供了这种强大分析策略的知识基础,并更新了最新方法发展的现状。这篇综述的目的有三:(1)介绍用于全基因组遗传变异数据的通路分析的动机和基本步骤;(2)综述经典和新兴的综合通路分析工具的优点和缺点;(3)讨论进一步方法发展的剩余挑战和未来方向。