GeNICE：一种通过聚类、穷举搜索和多变量分析进行基因网络推断的新型框架。

GeNICE: A Novel Framework for Gene Network Inference by Clustering, Exhaustive Search, and Multivariate Analysis.

作者信息

De Souza Jacomini Ricardo, Martins David Correa, Da Silva Felipe Leno, Costa Anna Helena Reali

机构信息

1 Escola Politécnica da Universidade de São Paulo , São Paulo, Brazil .

2 Universidade Federal do ABC , Santo André, Brazil .

出版信息

J Comput Biol. 2017 Aug;24(8):809-830. doi: 10.1089/cmb.2017.0022. Epub 2017 Jun 21.

DOI:10.1089/cmb.2017.0022

PMID:28636461

Abstract

Gene network (GN) inference from temporal gene expression data is a crucial and challenging problem in systems biology. Expression data sets usually consist of dozens of temporal samples, while networks consist of thousands of genes, thus rendering many inference methods unfeasible in practice. To improve the scalability of GN inference methods, we propose a novel framework called GeNICE, based on probabilistic GNs; the main novelty is the introduction of a clustering procedure to group genes with related expression profiles and to provide an approximate solution with reduced computational complexity. We use the defined clusters to perform an exhaustive search to retrieve the best predictor gene subsets for each target gene, according to multivariate criterion functions. GeNICE greatly reduces the search space because predictor candidates are restricted to one gene per cluster. Finally, a multivariate analysis is performed for each defined predictor subset to retrieve minimal subsets and to simplify the network. In our experiments with in silico generated data sets, GeNICE achieved substantial computational time reduction when compared to solutions without the clustering step, while preserving the gene expression prediction accuracy even when the number of clusters is small (about 50) relative to the number of genes (order of thousands). For a Plasmodium falciparum microarray data set, the prediction accuracy achieved by GeNICE was roughly 97%, while the respective topologies involving glycolytic and apicoplast seed genes had a very large intramodularity, very small interconnection between modules, and some module hub genes, reflecting small-world and scale-free topological properties, as expected.

摘要

从时间基因表达数据推断基因网络（GN）是系统生物学中一个关键且具有挑战性的问题。表达数据集通常由数十个时间样本组成，而网络由数千个基因组成，这使得许多推断方法在实际中不可行。为了提高GN推断方法的可扩展性，我们基于概率基因网络提出了一种名为GeNICE的新颖框架；主要新颖之处在于引入了一种聚类程序，对具有相关表达谱的基因进行分组，并提供具有降低计算复杂度的近似解。我们使用定义的聚类进行穷举搜索，根据多变量准则函数为每个目标基因检索最佳预测基因子集。由于预测候选基因被限制为每个聚类一个基因，GeNICE大大减少了搜索空间。最后，对每个定义的预测子集进行多变量分析，以检索最小子集并简化网络。在我们对计算机生成的数据集进行的实验中，与没有聚类步骤的解决方案相比，GeNICE显著减少了计算时间，即使聚类数量相对于基因数量（数千个量级）较少（约50个）时，也能保持基因表达预测准确性。对于恶性疟原虫微阵列数据集，GeNICE实现的预测准确率约为97%，而涉及糖酵解和顶质体种子基因的相应拓扑结构具有非常大的模块内聚性、模块之间非常小的互连性以及一些模块中心基因，如预期的那样反映了小世界和无标度拓扑特性。

相似文献

GeNICE: A Novel Framework for Gene Network Inference by Clustering, Exhaustive Search, and Multivariate Analysis.GeNICE：一种通过聚类、穷举搜索和多变量分析进行基因网络推断的新型框架。

J Comput Biol. 2017 Aug;24(8):809-830. doi: 10.1089/cmb.2017.0022. Epub 2017 Jun 21.

Gene expression complex networks: synthesis, identification, and analysis.基因表达复杂网络：合成、识别与分析。

J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6.

Disease specific modules and hub genes for intervention strategies: A co-expression network based approach for Plasmodium falciparum clinical isolates.用于干预策略的疾病特异性模块和枢纽基因：一种基于共表达网络的恶性疟原虫临床分离株研究方法。

Infect Genet Evol. 2015 Oct;35:96-108. doi: 10.1016/j.meegid.2015.08.007. Epub 2015 Aug 4.

A novel mutual information-based Boolean network inference method from time-series gene expression data.一种基于互信息的从时间序列基因表达数据推断布尔网络的新方法。

PLoS One. 2017 Feb 8;12(2):e0171097. doi: 10.1371/journal.pone.0171097. eCollection 2017.

A Novel Model Integration Network Inference Algorithm with Clustering and Hub Genes Finding.一种具有聚类和枢纽基因发现功能的新型模型整合网络推断算法。

Mol Inform. 2020 May;39(5):e1900075. doi: 10.1002/minf.201900075. Epub 2020 Jan 28.

Enhancing Gene Co-Expression Network Inference for the Malaria Parasite .增强疟原虫基因共表达网络推断

Genes (Basel). 2024 May 25;15(6):685. doi: 10.3390/genes15060685.

Network-based gene prediction for Plasmodium falciparum malaria towards genetics-based drug discovery.基于网络的恶性疟原虫疟疾基因预测用于基于遗传学的药物发现。

BMC Genomics. 2015;16 Suppl 7(Suppl 7):S9. doi: 10.1186/1471-2164-16-S7-S9. Epub 2015 Jun 11.

Reverse engineering module networks by PSO-RNN hybrid modeling.通过粒子群优化-递归神经网络混合建模对模块网络进行逆向工程。

BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S15. doi: 10.1186/1471-2164-10-S1-S15.

Analysis of nucleosome positioning landscapes enables gene discovery in the human malaria parasite Plasmodium falciparum.核小体定位图谱分析有助于在人类疟原虫恶性疟原虫中发现基因。

BMC Genomics. 2015 Nov 25;16:1005. doi: 10.1186/s12864-015-2214-9.

Improving gene regulatory network inference using network topology information.利用网络拓扑信息改进基因调控网络推断

Mol Biosyst. 2015 Sep;11(9):2449-63. doi: 10.1039/c5mb00122f. Epub 2015 Jul 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GeNICE：一种通过聚类、穷举搜索和多变量分析进行基因网络推断的新型框架。

GeNICE: A Novel Framework for Gene Network Inference by Clustering, Exhaustive Search, and Multivariate Analysis.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献