IEEE/ACM Trans Comput Biol Bioinform. 2018 Jul-Aug;15(4):1093-1106. doi: 10.1109/TCBB.2015.2509992. Epub 2015 Dec 17.
The reconstruction of gene regulatory networks from gene expression data has been the subject of intense research activity. A variety of models and methods have been developed to address different aspects of this important problem. However, these techniques are narrowly focused on particular biological and experimental platforms, and require experimental data that are typically unavailable and difficult to ascertain. The more recent availability of higher-throughput sequencing platforms, combined with more precise modes of genetic perturbation, presents an opportunity to formulate more robust and comprehensive approaches to gene network inference. Here, we propose a step-wise framework for identifying gene-gene regulatory interactions that expand from a known point of genetic or chemical perturbation using time series gene expression data. This novel approach sequentially identifies non-steady state genes post-perturbation and incorporates them into a growing series of low-complexity optimization problems. The governing ordinary differential equations of this model are rooted in the biophysics of stochastic molecular events that underlie gene regulation, delineating roles for both protein and RNA-mediated gene regulation. We show the successful application of our core algorithms for network inference using simulated and real datasets.
从基因表达数据中重建基因调控网络一直是研究的热点。已经开发了各种模型和方法来解决这个重要问题的不同方面。然而,这些技术仅限于特定的生物和实验平台,并且需要通常不可用且难以确定的实验数据。最近高通量测序平台的出现,以及更精确的遗传干扰模式,为制定更强大和全面的基因网络推断方法提供了机会。在这里,我们提出了一个逐步框架,用于使用时间序列基因表达数据从已知的遗传或化学干扰点识别基因-基因调控相互作用。这种新方法依次识别扰动后的非稳态基因,并将其纳入一系列不断增长的低复杂度优化问题中。该模型的常微分方程源于随机分子事件的生物物理学,这些事件是基因调控的基础,描绘了蛋白质和 RNA 介导的基因调控的作用。我们展示了我们的核心网络推断算法在模拟和真实数据集上的成功应用。