Department of Computer Science, Yale University, New Haven, Connecticut, United States of America.
PLoS One. 2010 Jan 26;5(1):e8121. doi: 10.1371/journal.pone.0008121.
We performed computational reconstruction of the in silico gene regulatory networks in the DREAM3 Challenges. Our task was to learn the networks from two types of data, namely gene expression profiles in deletion strains (the 'deletion data') and time series trajectories of gene expression after some initial perturbation (the 'perturbation data'). In the course of developing the prediction method, we observed that the two types of data contained different and complementary information about the underlying network. In particular, deletion data allow for the detection of direct regulatory activities with strong responses upon the deletion of the regulator while perturbation data provide richer information for the identification of weaker and more complex types of regulation. We applied different techniques to learn the regulation from the two types of data. For deletion data, we learned a noise model to distinguish real signals from random fluctuations using an iterative method. For perturbation data, we used differential equations to model the change of expression levels of a gene along the trajectories due to the regulation of other genes. We tried different models, and combined their predictions. The final predictions were obtained by merging the results from the two types of data. A comparison with the actual regulatory networks suggests that our approach is effective for networks with a range of different sizes. The success of the approach demonstrates the importance of integrating heterogeneous data in network reconstruction.
我们对 DREAM3 挑战赛中的计算机基因调控网络进行了计算重建。我们的任务是从两种类型的数据中学习网络,即删除菌株中的基因表达谱(“删除数据”)和初始扰动后基因表达的时间序列轨迹(“扰动数据”)。在开发预测方法的过程中,我们观察到两种类型的数据包含了有关基础网络的不同且互补的信息。特别是,删除数据允许检测到在删除调节剂时具有强烈反应的直接调控活动,而扰动数据为识别较弱和更复杂类型的调控提供了更丰富的信息。我们应用不同的技术从两种类型的数据中学习调控。对于删除数据,我们使用迭代方法学习噪声模型来区分真实信号和随机波动。对于扰动数据,我们使用微分方程来模拟由于其他基因的调控而导致基因表达水平在轨迹上的变化。我们尝试了不同的模型,并结合了它们的预测。最终的预测是通过合并两种类型的数据的结果获得的。与实际调控网络的比较表明,我们的方法对于具有不同大小范围的网络是有效的。该方法的成功证明了在网络重建中整合异构数据的重要性。