Haye Alexandre, Albert Jaroslav, Rooman Marianne
BioModeling, BioInformatics & BioProcesses Department, Université Libre de Bruxelles, Bruxelles, Belgium.
PLoS One. 2014 Mar 3;9(3):e90285. doi: 10.1371/journal.pone.0090285. eCollection 2014.
The development of accurate and reliable dynamical modeling procedures that describe the time evolution of gene expression levels is a prerequisite to understanding and controlling the transcription process. We focused on data from DNA microarray time series for 20 Drosophila genes involved in muscle development during the embryonic stage. Genes with similar expression profiles were clustered on the basis of a translation-invariant and scale-invariant distance measure. The time evolution of these clusters was modeled using coupled differential equations. Three model structures involving a transcription term and a degradation term were tested. The parameters were identified in successive steps: network construction, parameter optimization, and parameter reduction. The solutions were evaluated on the basis of the data reproduction and the number of parameters, as well as on two biology-based requirements: the robustness with respect to parameter variations and the values of the expression levels not being unrealistically large upon extrapolation in time. Various solutions were obtained that satisfied all our evaluation criteria. The regulatory networks inferred from these solutions were compared with experimental data. The best solution has half of the experimental connections, which compares favorably with previous approaches. Biasing the network toward the experimental connections led to the identification of a model that is only slightly less good on the basis of the evaluation criteria. The non-uniqueness of the solutions and the variable agreement with experimental connections were discussed in the context of the different hypotheses underlying this type of approach.
开发准确可靠的动力学建模程序来描述基因表达水平的时间演变是理解和控制转录过程的先决条件。我们专注于来自DNA微阵列时间序列的数据,这些数据涉及20个在胚胎阶段参与肌肉发育的果蝇基因。具有相似表达谱的基因基于平移不变和尺度不变的距离度量进行聚类。使用耦合微分方程对这些聚类的时间演变进行建模。测试了三种涉及转录项和降解项的模型结构。参数通过连续步骤确定:网络构建、参数优化和参数约简。根据数据再现、参数数量以及基于生物学的两个要求对解进行评估:对参数变化的鲁棒性以及在时间外推时表达水平的值不会过大而不切实际。获得了满足我们所有评估标准的各种解。将从这些解推断出的调控网络与实验数据进行比较。最佳解具有一半的实验连接,与先前的方法相比具有优势。使网络偏向实验连接导致识别出一个基于评估标准仅略逊一筹的模型。在这种方法背后的不同假设的背景下讨论了解的非唯一性以及与实验连接的可变一致性。