从时间序列和静态基因表达数据推断基因网络：将基于随机森林的推断方法与特征选择方法相结合

Inference of Genetic Networks From Time-Series and Static Gene Expression Data: Combining a Random-Forest-Based Inference Method With Feature Selection Methods.

作者信息

Kimura Shuhei, Fukutomi Ryo, Tokuhisa Masato, Okada Mariko

机构信息

Faculty of Engineering, Tottori University, Tottori, Japan.

Graduate School of Sustainability Science, Tottori University, Tottori, Japan.

出版信息

Front Genet. 2020 Dec 15;11:595912. doi: 10.3389/fgene.2020.595912. eCollection 2020.

DOI:10.3389/fgene.2020.595912

PMID:33384716

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7770182/

Abstract

Several researchers have focused on random-forest-based inference methods because of their excellent performance. Some of these inference methods also have a useful ability to analyze both time-series and static gene expression data. However, they are only of use in ranking all of the candidate regulations by assigning them confidence values. None have been capable of detecting the regulations that actually affect a gene of interest. In this study, we propose a method to remove unpromising candidate regulations by combining the random-forest-based inference method with a series of feature selection methods. In addition to detecting unpromising regulations, our proposed method uses outputs from the feature selection methods to adjust the confidence values of all of the candidate regulations that have been computed by the random-forest-based inference method. Numerical experiments showed that the combined application with the feature selection methods improved the performance of the random-forest-based inference method on 99 of the 100 trials performed on the artificial problems. However, the improvement tends to be small, since our combined method succeeded in removing only 19% of the candidate regulations at most. The combined application with the feature selection methods moreover makes the computational cost higher. While a bigger improvement at a lower computational cost would be ideal, we see no impediments to our investigation, given that our aim is to extract as much useful information as possible from a limited amount of gene expression data.

摘要

由于基于随机森林的推理方法性能出色，一些研究人员专注于此。其中一些推理方法还具备分析时间序列和静态基因表达数据的有用能力。然而，它们仅用于通过为所有候选调控赋予置信值来进行排序。尚无方法能够检测出实际影响目标基因的调控。在本研究中，我们提出一种方法，通过将基于随机森林的推理方法与一系列特征选择方法相结合，去除没有前景的候选调控。除了检测没有前景的调控外，我们提出的方法还利用特征选择方法的输出，调整基于随机森林的推理方法计算出的所有候选调控的置信值。数值实验表明，在针对人工问题进行的100次试验中，有99次将特征选择方法与之结合应用提高了基于随机森林的推理方法的性能。然而，这种改进往往较小，因为我们的组合方法最多只能成功去除19%的候选调控。此外，将特征选择方法与之结合应用会使计算成本更高。虽然以较低的计算成本实现更大的改进是理想的，但鉴于我们的目标是从有限的基因表达数据中提取尽可能多的有用信息，我们认为我们的研究没有障碍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/444a/7770182/71048105187a/fgene-11-595912-g0001.jpg

相似文献

Inference of Genetic Networks From Time-Series and Static Gene Expression Data: Combining a Random-Forest-Based Inference Method With Feature Selection Methods.从时间序列和静态基因表达数据推断基因网络：将基于随机森林的推断方法与特征选择方法相结合

Front Genet. 2020 Dec 15;11:595912. doi: 10.3389/fgene.2020.595912. eCollection 2020.

Inference of genetic networks using random forests: Assigning different weights for gene expression data.使用随机森林推断遗传网络：为基因表达数据赋予不同权重。

J Bioinform Comput Biol. 2019 Aug;17(4):1950015. doi: 10.1142/S021972001950015X. Epub 2019 Apr 10.

Genetic Network Inference Using Hierarchical Structure.使用层次结构的基因网络推断

Front Physiol. 2016 Feb 23;7:57. doi: 10.3389/fphys.2016.00057. eCollection 2016.

Large scale gene regulatory network inference with a multi-level strategy.基于多层次策略的大规模基因调控网络推断

Mol Biosyst. 2016 Feb;12(2):588-97. doi: 10.1039/c5mb00560d.

Integrative random forest for gene regulatory network inference.用于基因调控网络推断的集成随机森林

Bioinformatics. 2015 Jun 15;31(12):i197-205. doi: 10.1093/bioinformatics/btv268.

A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。

Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.

Evaluating the performance of random forest and iterative random forest based methods when applied to gene expression data.评估随机森林和基于迭代随机森林的方法应用于基因表达数据时的性能。

Comput Struct Biotechnol J. 2022 Jun 22;20:3372-3386. doi: 10.1016/j.csbj.2022.06.037. eCollection 2022.

Inference of Vohradský's models of genetic networks by solving two-dimensional function optimization problems.通过求解二维函数优化问题来推断 Vohradský 的遗传网络模型。

PLoS One. 2013 Dec 30;8(12):e83308. doi: 10.1371/journal.pone.0083308. eCollection 2013.

Inference of biological networks using Bi-directional Random Forest Granger causality.使用双向随机森林格兰杰因果关系推断生物网络。

Springerplus. 2016 Apr 26;5:514. doi: 10.1186/s40064-016-2156-y. eCollection 2016.

Integrative approach for inference of gene regulatory networks using lasso-based random featuring and application to psychiatric disorders.基于套索随机特征的基因调控网络推断综合方法及其在精神疾病中的应用

BMC Med Genomics. 2016 Aug 10;9 Suppl 2(Suppl 2):50. doi: 10.1186/s12920-016-0202-9.

引用本文的文献

Transcriptome data are insufficient to control false discoveries in regulatory network inference.转录组数据不足以控制调控网络推断中的假发现。

Cell Syst. 2024 Aug 21;15(8):709-724.e13. doi: 10.1016/j.cels.2024.07.006.

From time-series transcriptomics to gene regulatory networks: A review on inference methods.从时间序列转录组学到基因调控网络：推理方法综述。

PLoS Comput Biol. 2023 Aug 10;19(8):e1011254. doi: 10.1371/journal.pcbi.1011254. eCollection 2023 Aug.

Machine Learning for Causal Inference in Biological Networks: Perspectives of This Challenge.用于生物网络因果推断的机器学习：这一挑战的视角

Front Bioinform. 2021 Sep 22;1:746712. doi: 10.3389/fbinf.2021.746712. eCollection 2021.

Gene Co-Expression in Breast Cancer: A Matter of Distance.乳腺癌中的基因共表达：距离问题

Front Oncol. 2021 Nov 17;11:726493. doi: 10.3389/fonc.2021.726493. eCollection 2021.

本文引用的文献

Inference of genetic networks using random forests: Assigning different weights for gene expression data.使用随机森林推断遗传网络：为基因表达数据赋予不同权重。

J Bioinform Comput Biol. 2019 Aug;17(4):1950015. doi: 10.1142/S021972001950015X. Epub 2019 Apr 10.

dynGENIE3: dynamical GENIE3 for the inference of gene networks from time series expression data.dynGENIE3：用于从时间序列表达数据中推断基因网络的动态 GENIE3。

Sci Rep. 2018 Feb 21;8(1):3384. doi: 10.1038/s41598-018-21715-0.

Regulation of Peripheral Myelination through Transcriptional Buffering of Egr2 by an Antisense Long Non-coding RNA.通过反义长非编码 RNA 对 Egr2 的转录缓冲来调节外周髓鞘形成。

Cell Rep. 2017 Aug 22;20(8):1950-1963. doi: 10.1016/j.celrep.2017.07.068.

Integrative random forest for gene regulatory network inference.用于基因调控网络推断的集成随机森林

Bioinformatics. 2015 Jun 15;31(12):i197-205. doi: 10.1093/bioinformatics/btv268.

STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10：整合了整个生命之树的蛋白质-蛋白质相互作用网络。

Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.

A promoter-level mammalian expression atlas.一个启动子水平的哺乳动物表达图谱。

Nature. 2014 Mar 27;507(7493):462-70. doi: 10.1038/nature13182.

Passing messages between biological networks to refine predicted interactions.在生物网络之间传递信息以完善预测的相互作用。

PLoS One. 2013 May 31;8(5):e64832. doi: 10.1371/journal.pone.0064832. Print 2013.

Bagging statistical network inference from large-scale gene expression data.从大规模基因表达数据中进行统计网络推断的装袋方法。

PLoS One. 2012;7(3):e33624. doi: 10.1371/journal.pone.0033624. Epub 2012 Mar 30.

Statistical inference and reverse engineering of gene regulatory networks from observational expression data.基于观测表达数据的基因调控网络的统计推断与逆向工程

Front Genet. 2012 Feb 3;3:8. doi: 10.3389/fgene.2012.00008. eCollection 2012.

GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods.GeneNetWeaver：网络推理方法的计算机基准生成和性能分析。

Bioinformatics. 2011 Aug 15;27(16):2263-70. doi: 10.1093/bioinformatics/btr373. Epub 2011 Jun 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从时间序列和静态基因表达数据推断基因网络：将基于随机森林的推断方法与特征选择方法相结合

Inference of Genetic Networks From Time-Series and Static Gene Expression Data: Combining a Random-Forest-Based Inference Method With Feature Selection Methods.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献