Wu Guanming, Haw Robin
Informatics and Biocomputing Program, Ontario Institute for Cancer Research, 661 University Avenue, Toronto, ON, Canada, M5G 0A3.
Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA.
Methods Mol Biol. 2017;1558:235-253. doi: 10.1007/978-1-4939-6783-4_11.
Network-based approaches project seemingly unrelated genes or proteins onto a large-scale network context, therefore providing a holistic visualization and analysis platform for genomic data generated from high-throughput experiments, reducing the dimensionality of data via using network modules and increasing the statistic analysis power. Based on the Reactome database, the most popular and comprehensive open-source biological pathway knowledgebase, we have developed a highly reliable protein functional interaction network covering around 60 % of total human genes and an app called ReactomeFIViz for Cytoscape, the most popular biological network visualization and analysis platform. In this chapter, we describe the detailed procedures on how this functional interaction network is constructed by integrating multiple external data sources, extracting functional interactions from human curated pathway databases, building a machine learning classifier called a Naïve Bayesian Classifier, predicting interactions based on the trained Naïve Bayesian Classifier, and finally constructing the functional interaction database. We also provide an example on how to use ReactomeFIViz for performing network-based data analysis for a list of genes.
基于网络的方法将看似不相关的基因或蛋白质投射到大规模网络环境中,从而为高通量实验产生的基因组数据提供一个整体的可视化和分析平台,通过使用网络模块降低数据维度并提高统计分析能力。基于最流行且全面的开源生物通路知识库Reactome数据库,我们开发了一个高度可靠的蛋白质功能相互作用网络,覆盖约60%的人类基因总数,并为最流行的生物网络可视化和分析平台Cytoscape开发了一个名为ReactomeFIViz的应用程序。在本章中,我们描述了通过整合多个外部数据源构建这个功能相互作用网络、从人工策划的通路数据库中提取功能相互作用、构建一个名为朴素贝叶斯分类器的机器学习分类器、基于训练好的朴素贝叶斯分类器预测相互作用以及最终构建功能相互作用数据库的详细步骤。我们还提供了一个关于如何使用ReactomeFIViz对基因列表进行基于网络的数据分析的示例。