Center for Engineering in Medicine, Massachusetts General Hospital, Harvard Medical School, Shriners Hospital for Children, 51 Blossom Street, Boston, MA-02114, USA.
Bioinformatics. 2010 May 15;26(10):1332-9. doi: 10.1093/bioinformatics/btq139. Epub 2010 Mar 31.
Primary purpose of modeling gene regulatory networks for developmental process is to reveal pathways governing the cellular differentiation to specific phenotypes. Knowledge of differentiation network will enable generation of desired cell fates by careful alteration of the governing network by adequate manipulation of cellular environment.
We have developed a novel integer programming-based approach to reconstruct the underlying regulatory architecture of differentiating embryonic stem cells from discrete temporal gene expression data. The network reconstruction problem is formulated using inherent features of biological networks: (i) that of cascade architecture which enables treatment of the entire complex network as a set of interconnected modules and (ii) that of sparsity of interconnection between the transcription factors. The developed framework is applied to the system of embryonic stem cells differentiating towards pancreatic lineage. Experimentally determined expression profile dynamics of relevant transcription factors serve as the input to the network identification algorithm. The developed formulation accurately captures many of the known regulatory modes involved in pancreatic differentiation. The predictive capacity of the model is tested by simulating an in silico potential pathway of subsequent differentiation. The predicted pathway is experimentally verified by concurrent differentiation experiments. Experimental results agree well with model predictions, thereby illustrating the predictive accuracy of the proposed algorithm.
Supplementary data are available at Bioinformatics online.
为发育过程建立基因调控网络模型的主要目的是揭示控制细胞向特定表型分化的途径。了解分化网络将使我们能够通过仔细改变调控网络,通过对细胞环境的适当操作,产生所需的细胞命运。
我们开发了一种新的基于整数规划的方法,从离散的时间基因表达数据中重建分化胚胎干细胞的潜在调控结构。网络重建问题是使用生物网络的固有特征来制定的:(i)级联架构,使整个复杂网络能够作为一组相互连接的模块进行处理;(ii)转录因子之间的连接稀疏性。所开发的框架应用于向胰腺谱系分化的胚胎干细胞系统。相关转录因子的实验确定的表达谱动力学作为网络识别算法的输入。所开发的公式准确地捕捉到了许多已知的参与胰腺分化的调控模式。通过模拟随后分化的潜在途径的模拟来测试模型的预测能力。通过同时进行的分化实验验证了预测途径。实验结果与模型预测吻合良好,从而说明了所提出算法的预测准确性。
补充数据可在“Bioinformatics”在线获取。