通过具有L1正则化的状态空间模型整合多源生物学知识来推断基因调控网络。

Inference of gene regulatory networks incorporating multi-source biological knowledge via a state space model with L1 regularization.

作者信息

Hasegawa Takanori, Yamaguchi Rui, Nagasaki Masao, Miyano Satoru, Imoto Seiya

机构信息

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan.

Human Genome Center, The Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan.

出版信息

PLoS One. 2014 Aug 27;9(8):e105942. doi: 10.1371/journal.pone.0105942. eCollection 2014.

DOI:10.1371/journal.pone.0105942

PMID:25162401

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4146587/

Abstract

Comprehensive understanding of gene regulatory networks (GRNs) is a major challenge in the field of systems biology. Currently, there are two main approaches in GRN analysis using time-course observation data, namely an ordinary differential equation (ODE)-based approach and a statistical model-based approach. The ODE-based approach can generate complex dynamics of GRNs according to biologically validated nonlinear models. However, it cannot be applied to ten or more genes to simultaneously estimate system dynamics and regulatory relationships due to the computational difficulties. The statistical model-based approach uses highly abstract models to simply describe biological systems and to infer relationships among several hundreds of genes from the data. However, the high abstraction generates false regulations that are not permitted biologically. Thus, when dealing with several tens of genes of which the relationships are partially known, a method that can infer regulatory relationships based on a model with low abstraction and that can emulate the dynamics of ODE-based models while incorporating prior knowledge is urgently required. To accomplish this, we propose a method for inference of GRNs using a state space representation of a vector auto-regressive (VAR) model with L1 regularization. This method can estimate the dynamic behavior of genes based on linear time-series modeling constructed from an ODE-based model and can infer the regulatory structure among several tens of genes maximizing prediction ability for the observational data. Furthermore, the method is capable of incorporating various types of existing biological knowledge, e.g., drug kinetics and literature-recorded pathways. The effectiveness of the proposed method is shown through a comparison of simulation studies with several previous methods. For an application example, we evaluated mRNA expression profiles over time upon corticosteroid stimulation in rats, thus incorporating corticosteroid kinetics/dynamics, literature-recorded pathways and transcription factor (TF) information.

摘要

全面理解基因调控网络（GRNs）是系统生物学领域的一项重大挑战。目前，在利用时间序列观测数据进行GRN分析时，主要有两种方法，即基于常微分方程（ODE）的方法和基于统计模型的方法。基于ODE的方法可以根据经过生物学验证的非线性模型生成GRNs的复杂动态。然而，由于计算困难，它不能应用于十个或更多基因以同时估计系统动态和调控关系。基于统计模型的方法使用高度抽象的模型来简单描述生物系统，并从数据中推断数百个基因之间的关系。然而，这种高度抽象会产生生物学上不允许的错误调控。因此，当处理关系部分已知的几十个基因时，迫切需要一种能够基于低抽象模型推断调控关系，并能在纳入先验知识的同时模拟基于ODE模型动态的方法。为了实现这一点，我们提出了一种使用具有L1正则化的向量自回归（VAR）模型的状态空间表示来推断GRNs的方法。该方法可以基于从基于ODE的模型构建的线性时间序列建模来估计基因的动态行为，并能推断几十个基因之间的调控结构，以最大化对观测数据的预测能力。此外，该方法能够纳入各种类型的现有生物学知识，例如药物动力学和文献记录的途径。通过与几种先前方法的模拟研究比较，展示了所提出方法的有效性。作为一个应用实例，我们评估了大鼠在皮质类固醇刺激下随时间的mRNA表达谱，从而纳入了皮质类固醇动力学/动态、文献记录的途径和转录因子（TF）信息。

相似文献

Inference of gene regulatory networks incorporating multi-source biological knowledge via a state space model with L1 regularization.

PLoS One. 2014 Aug 27;9(8):e105942. doi: 10.1371/journal.pone.0105942. eCollection 2014.

Genomic data assimilation using a higher moment filtering technique for restoration of gene regulatory networks.

BMC Syst Biol. 2015 Mar 13;9:14. doi: 10.1186/s12918-015-0154-2.

An efficient data assimilation schema for restoration and extension of gene regulatory networks using time-course observation data.

J Comput Biol. 2014 Nov;21(11):785-98. doi: 10.1089/cmb.2014.0171. Epub 2014 Sep 22.

An efficient method of exploring simulation models by assimilating literature and biological observational data.

Biosystems. 2014 Jul;121:54-66. doi: 10.1016/j.biosystems.2014.06.001. Epub 2014 Jun 5.

MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.

BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.

Reverse engineering module networks by PSO-RNN hybrid modeling.

BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S15. doi: 10.1186/1471-2164-10-S1-S15.

Independence screening for high dimensional nonlinear additive ODE models with applications to dynamic gene regulatory networks.

Stat Med. 2018 Jul 30;37(17):2630-2644. doi: 10.1002/sim.7669. Epub 2018 May 2.

Inferring gene regulatory networks using transcriptional profiles as dynamical attractors.

PLoS Comput Biol. 2023 Aug 22;19(8):e1010991. doi: 10.1371/journal.pcbi.1010991. eCollection 2023 Aug.

An algebra-based method for inferring gene regulatory networks.

BMC Syst Biol. 2014 Mar 26;8:37. doi: 10.1186/1752-0509-8-37.

Periodic synchronization of isolated network elements facilitates simulating and inferring gene regulatory networks including stochastic molecular kinetics.

BMC Bioinformatics. 2022 Jan 5;23(1):13. doi: 10.1186/s12859-021-04541-6.

引用本文的文献

Identifying large-scale interaction atlases using probabilistic graphs and external knowledge.

J Clin Transl Sci. 2022 Feb 11;6(1):e27. doi: 10.1017/cts.2022.18. eCollection 2022.

Prediction of blood test values under different lifestyle scenarios using time-series electronic health record.

PLoS One. 2020 Mar 20;15(3):e0230172. doi: 10.1371/journal.pone.0230172. eCollection 2020.

Inferring a nonlinear biochemical network model from a heterogeneous single-cell time course data.

Sci Rep. 2018 May 1;8(1):6790. doi: 10.1038/s41598-018-25064-w.

Bayesian Regression with Network Prior: Optimal Bayesian Filtering Perspective.

IEEE Trans Signal Process. 2016 Dec 1;64(23):6243-6253. doi: 10.1109/TSP.2016.2605072. Epub 2016 Sep 1.

Inference of Gene Regulatory Networks Using Bayesian Nonparametric Regression and Topology Information.

Comput Math Methods Med. 2017;2017:8307530. doi: 10.1155/2017/8307530. Epub 2017 Jan 4.

Data- and knowledge-based modeling of gene regulatory networks: an update.

EXCLI J. 2015 Mar 2;14:346-78. doi: 10.17179/excli2015-168. eCollection 2015.

Genomic data assimilation using a higher moment filtering technique for restoration of gene regulatory networks.

BMC Syst Biol. 2015 Mar 13;9:14. doi: 10.1186/s12918-015-0154-2.

本文引用的文献

Network reconstruction using nonparametric additive ODE models.

PLoS One. 2014 Apr 14;9(4):e94003. doi: 10.1371/journal.pone.0094003. eCollection 2014.

Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks.

Bioinformatics. 2013 Apr 15;29(8):1060-7. doi: 10.1093/bioinformatics/btt099. Epub 2013 Mar 21.

Integrating literature-constrained and data-driven inference of signalling networks.

Bioinformatics. 2012 Sep 15;28(18):2311-7. doi: 10.1093/bioinformatics/bts363. Epub 2012 Jun 25.

Inferring epigenetic and transcriptional regulation during blood cell development with a mixture of sparse linear models.

Bioinformatics. 2012 Sep 15;28(18):2297-303. doi: 10.1093/bioinformatics/bts362. Epub 2012 Jun 23.

Identification of feedback loops in neural networks based on multi-step Granger causality.

Bioinformatics. 2012 Aug 15;28(16):2146-53. doi: 10.1093/bioinformatics/bts354. Epub 2012 Jun 23.

Joint Bayesian inference of condition-specific miRNA and transcription factor activities from combined gene and microRNA expression data.

Bioinformatics. 2012 Jul 1;28(13):1714-20. doi: 10.1093/bioinformatics/bts257. Epub 2012 May 4.

State and parameter estimation of the heat shock response system using Kalman and particle filters.

Bioinformatics. 2012 Jun 1;28(11):1501-7. doi: 10.1093/bioinformatics/bts161. Epub 2012 Apr 26.

GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods.

Bioinformatics. 2011 Aug 15;27(16):2263-70. doi: 10.1093/bioinformatics/btr373. Epub 2011 Jun 22.

Large-scale learning of combinatorial transcriptional dynamics from gene expression.

Bioinformatics. 2011 May 1;27(9):1277-83. doi: 10.1093/bioinformatics/btr113. Epub 2011 Mar 2.

SiGN-SSM: open source parallel software for estimating gene networks with state space models.

Bioinformatics. 2011 Apr 15;27(8):1172-3. doi: 10.1093/bioinformatics/btr078. Epub 2011 Feb 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过具有L1正则化的状态空间模型整合多源生物学知识来推断基因调控网络。

Inference of gene regulatory networks incorporating multi-source biological knowledge via a state space model with L1 regularization.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献