Zak Daniel E, Gonye Gregory E, Schwaber James S, Doyle Francis J
Department of Chemical Engineering, University of Delaware, Newark, Delaware 19716, USA.
Genome Res. 2003 Nov;13(11):2396-405. doi: 10.1101/gr.1198103.
Gene expression profiles are an increasingly common data source that can yield insights into the functions of cells at a system-wide level. The present work considers the limitations in information content of gene expression data for reverse engineering regulatory networks. An in silico genetic regulatory network was constructed for this purpose. Using the in silico network, a formal identifiability analysis was performed that considered the accuracy with which the parameters in the network could be estimated using gene expression data and prior structural knowledge (which transcription factors regulate which genes) as a function of the input perturbation and stochastic gene expression. The analysis yielded experimentally relevant results. It was observed that, in addition to prior structural knowledge, prior knowledge of kinetic parameters, particularly mRNA degradation rate constants, was necessary for the network to be identifiable. Also, with the exception of cases where the noise due to stochastic gene expression was high, complex perturbations were more favorable for identifying the network than simple ones. Although the results may be specific to the network considered, the present study provides a framework for posing similar questions in other systems.
基因表达谱是一种越来越常见的数据源,它能够在全系统水平上深入了解细胞的功能。目前的工作考虑了在反向工程调控网络时基因表达数据信息内容方面的局限性。为此构建了一个计算机模拟的遗传调控网络。利用这个计算机模拟网络,进行了一项形式可识别性分析,该分析考虑了根据输入扰动和随机基因表达,使用基因表达数据和先验结构知识(哪些转录因子调控哪些基因)来估计网络中参数的准确性。该分析得出了与实验相关的结果。研究发现,除了先验结构知识外,动力学参数的先验知识,特别是mRNA降解速率常数,对于网络的可识别性也是必要的。此外,除了随机基因表达导致的噪声较高的情况外,复杂扰动比简单扰动更有利于识别网络。尽管结果可能特定于所考虑的网络,但本研究为在其他系统中提出类似问题提供了一个框架。