Eduati Federica, Di Camillo Barbara, Karbiener Michael, Scheideler Marcel, Corà Davide, Caselle Michele, Toffolo Gianna
Department of Information Engineering, University of Padova, Padova, Italy.
J Comput Biol. 2012 Feb;19(2):188-99. doi: 10.1089/cmb.2011.0274.
Given the important role of microRNAs (miRNAs) in genome-wide regulation of gene expression, increasing interest is devoted to mixed transcriptional and post-transcriptional regulatory networks analyzing the combinatorial effect of transcription factors (TFs) and miRNAs on target genes. In particular, miRNAs are known to be involved in feed-forward loops (FFLs), where a TF regulates a miRNA and they both regulate a target gene. Different algorithms have been proposed to identify miRNA targets, based on pairing between the 5' region of the miRNA and the 3'UTR of the target gene, and correlation between miRNA host genes and target mRNA expression data. Here we propose a quantitative approach integrating an existing method for mixed FFL identification based on sequence analysis with differential equation modeling approach that permits us to select active FFLs based on their dynamics. Different models are assessed based on their ability to properly reproduce miRNA and mRNA expression data in terms of identification criteria, namely: goodness of fit, precision of the estimates, and comparison with submodels. In comparison with standard approaches based on correlation, our method improves in specificity. As a case study, we applied our method to adipogenic differentiation gene expression data providing potential novel players in this regulatory network. Supplementary Material for this article is available at www.liebertonline.com/cmb.
鉴于微小RNA(miRNA)在全基因组基因表达调控中的重要作用,人们越来越关注分析转录因子(TF)和miRNA对靶基因的组合效应的混合转录和转录后调控网络。特别是,已知miRNA参与前馈环(FFL),其中一个TF调控一个miRNA,并且它们两者都调控一个靶基因。基于miRNA的5'区域与靶基因的3'UTR之间的配对以及miRNA宿主基因与靶mRNA表达数据之间的相关性,已经提出了不同的算法来识别miRNA靶标。在这里,我们提出了一种定量方法,将基于序列分析的现有混合FFL识别方法与微分方程建模方法相结合,使我们能够根据其动力学选择活跃的FFL。根据不同模型在识别标准方面正确再现miRNA和mRNA表达数据的能力来评估不同模型,即:拟合优度、估计精度以及与子模型的比较。与基于相关性的标准方法相比,我们的方法在特异性方面有所提高。作为一个案例研究,我们将我们的方法应用于脂肪生成分化基因表达数据,揭示了该调控网络中潜在的新参与者。本文的补充材料可在www.liebertonline.com/cmb上获取。