Fan Danhua, Bitterman Peter B, Larsson Ola
Department of Medicine, University of Minnesota, Minneapolis, Minnesota 55455, USA.
RNA. 2009 Aug;15(8):1469-82. doi: 10.1261/rna.1617009. Epub 2009 Jun 24.
Regulatory elements in mRNA play an often pivotal role in post-transcriptional regulation of gene expression. However, a systematic approach to efficiently identify putative regulatory elements from sets of post-transcriptionally coregulated genes is lacking, hampering studies of coregulation mechanisms. Although there are several analytical methods that can be used to detect conserved mRNA regulatory elements in a set of transcripts, there has been no systematic study of how well any of these methods perform individually or as a group. We therefore compared how well three algorithms, each based on a different principle (enumeration, optimization, or structure/sequence profiles), can identify elements in unaligned untranslated sequence regions. Two algorithms were originally designed to detect transcription factor binding sites, Weeder and BioProspector; and one was designed to detect RNA elements conserved in structure, RNAProfile. Three types of elements were examined: (1) elements conserved in both primary sequence and secondary structure; (2) elements conserved only in primary sequence; and (3) microRNA targets. Our results indicate that all methods can uniquely identify certain known RNA elements, and therefore, integrating the output from all algorithms leads to the most complete identification of elements. We therefore developed an approach to integrate results and guide selection of candidate elements from several algorithms presented as a web service (https://dbw.msi.umn.edu:8443/recit). These findings together with the approach for integration can be used to identify candidate elements from genome-wide post-transcriptional profiling data sets.
信使核糖核酸(mRNA)中的调控元件在基因表达的转录后调控中常常发挥关键作用。然而,目前缺乏一种系统的方法来有效地从转录后共调控基因集中识别假定的调控元件,这阻碍了对共调控机制的研究。尽管有几种分析方法可用于检测一组转录本中保守的mRNA调控元件,但尚未对这些方法单独或作为一个整体的性能进行系统研究。因此,我们比较了三种基于不同原理(枚举、优化或结构/序列图谱)的算法在未比对的非翻译序列区域中识别元件的能力。两种算法最初设计用于检测转录因子结合位点,即Weeder和BioProspector;另一种设计用于检测结构上保守的RNA元件,即RNAProfile。我们研究了三种类型的元件:(1)在一级序列和二级结构中均保守的元件;(2)仅在一级序列中保守的元件;(3)微小RNA靶标。我们的结果表明,所有方法都能独特地识别某些已知的RNA元件,因此,整合所有算法的输出可实现对元件的最全面识别。因此,我们开发了一种方法来整合结果,并从作为网络服务呈现的几种算法中指导候选元件的选择(https://dbw.msi.umn.edu:8443/recit)。这些发现以及整合方法可用于从全基因组转录后分析数据集中识别候选元件。