IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):322-335. doi: 10.1109/TCBB.2017.2764908. Epub 2017 Oct 23.
To survive environmental conditions, cells transcribe their response activities into encoded mRNA sequences in order to produce certain amounts of protein concentrations. The external conditions are mapped into the cell through the activation of special proteins called transcription factors (TFs). Due to the difficult task to measure experimentally TF behaviors, and the challenges to capture their quick-time dynamics, different types of models based on differential equations have been proposed. However, those approaches usually incur in costly procedures, and they present problems to describe sudden changes in TF regulators. In this paper, we present a switched dynamical latent force model for reverse-engineering transcriptional regulation in gene expression data which allows the exact inference over latent TF activities driving some observed gene expressions through a linear differential equation. To deal with discontinuities in the dynamics, we introduce an approach that switches between different TF activities and different dynamical systems. This creates a versatile representation of transcription networks that can capture discrete changes and non-linearities. We evaluate our model on both simulated data and real data (e.g., microaerobic shift in E. coli, yeast respiration), concluding that our framework allows for the fitting of the expression data while being able to infer continuous-time TF profiles.
为了在环境条件下生存,细胞将其反应活动转录为编码的 mRNA 序列,以产生一定量的蛋白质浓度。外部条件通过激活称为转录因子 (TF) 的特殊蛋白质映射到细胞中。由于难以通过实验测量 TF 行为,并且难以捕捉其快速动态,因此已经提出了基于微分方程的不同类型的模型。然而,这些方法通常涉及昂贵的程序,并且在描述 TF 调节剂的突然变化方面存在问题。在本文中,我们提出了一种用于反向工程基因表达数据中转录调控的开关动态潜在力模型,该模型允许通过线性微分方程对驱动某些观察到的基因表达的潜在 TF 活动进行精确推断。为了处理动力学中的不连续性,我们引入了一种在不同的 TF 活动和不同的动力系统之间切换的方法。这为转录网络创建了一个通用的表示形式,可以捕获离散变化和非线性。我们在模拟数据和真实数据(例如,大肠杆菌的微需氧转移、酵母呼吸)上评估了我们的模型,得出的结论是,我们的框架允许拟合表达数据,同时能够推断连续时间 TF 分布。