School of Biological Science, The University of Edinburgh, Edinburgh, EH9 3BF, UK.
BMC Bioinformatics. 2020 Oct 22;21(1):476. doi: 10.1186/s12859-020-03801-1.
Gene and protein interaction experiments provide unique opportunities to study the molecular wiring of a cell. Integrating high-throughput functional genomics data with this information can help identifying networks associated with complex diseases and phenotypes.
Here we introduce an integrated statistical framework to test network properties of single and multiple genesets under different interaction models. We implemented this framework as an open-source software, called Python Geneset Network Analysis (PyGNA). Our software is designed for easy integration into existing analysis pipelines and to generate high quality figures and reports. We also developed PyGNA to take advantage of multi-core systems to generate calibrated null distributions on large datasets. We then present the results of extensive benchmarking of the tests implemented in PyGNA and a use case inspired by RNA sequencing data analysis, showing how PyGNA can be easily integrated to study biological networks. PyGNA is available at http://github.com/stracquadaniolab/pygna and can be easily installed using the PyPi or Anaconda package managers, and Docker.
We present a tool for network-aware geneset analysis. PyGNA can either be readily used and easily integrated into existing high-performance data analysis pipelines or as a Python package to implement new tests and analyses. With the increasing availability of population-scale omic data, PyGNA provides a viable approach for large scale geneset network analysis.
基因和蛋白质相互作用实验为研究细胞的分子连接提供了独特的机会。将高通量功能基因组学数据与这些信息整合,可以帮助识别与复杂疾病和表型相关的网络。
在这里,我们引入了一个综合的统计框架,用于在不同的相互作用模型下测试单个和多个基因集的网络特性。我们将这个框架实现为一个开源软件,称为 Python 基因集网络分析(PyGNA)。我们的软件旨在易于集成到现有的分析管道中,并生成高质量的图形和报告。我们还开发了 PyGNA 来利用多核系统在大型数据集上生成校准的零分布。然后,我们展示了在 PyGNA 中实现的测试的广泛基准测试的结果,并提供了一个受 RNA 测序数据分析启发的用例,展示了如何轻松地集成 PyGNA 来研究生物网络。PyGNA 可在 http://github.com/stracquadaniolab/pygna 上获得,并且可以使用 PyPi 或 Anaconda 包管理器以及 Docker 轻松安装。
我们提出了一种用于网络感知基因集分析的工具。PyGNA 可以直接使用,也可以轻松集成到现有的高性能数据分析管道中,或者作为一个 Python 包来实现新的测试和分析。随着人群规模的组学数据的可用性不断增加,PyGNA 为大规模基因集网络分析提供了一种可行的方法。