转录本子集中的调控元件识别：当前计算方法的比较与整合

Regulatory element identification in subsets of transcripts: comparison and integration of current computational methods.

作者信息

Fan Danhua, Bitterman Peter B, Larsson Ola

机构信息

Department of Medicine, University of Minnesota, Minneapolis, Minnesota 55455, USA.

出版信息

RNA. 2009 Aug;15(8):1469-82. doi: 10.1261/rna.1617009. Epub 2009 Jun 24.

DOI:10.1261/rna.1617009

PMID:19553345

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2714745/

Abstract

Regulatory elements in mRNA play an often pivotal role in post-transcriptional regulation of gene expression. However, a systematic approach to efficiently identify putative regulatory elements from sets of post-transcriptionally coregulated genes is lacking, hampering studies of coregulation mechanisms. Although there are several analytical methods that can be used to detect conserved mRNA regulatory elements in a set of transcripts, there has been no systematic study of how well any of these methods perform individually or as a group. We therefore compared how well three algorithms, each based on a different principle (enumeration, optimization, or structure/sequence profiles), can identify elements in unaligned untranslated sequence regions. Two algorithms were originally designed to detect transcription factor binding sites, Weeder and BioProspector; and one was designed to detect RNA elements conserved in structure, RNAProfile. Three types of elements were examined: (1) elements conserved in both primary sequence and secondary structure; (2) elements conserved only in primary sequence; and (3) microRNA targets. Our results indicate that all methods can uniquely identify certain known RNA elements, and therefore, integrating the output from all algorithms leads to the most complete identification of elements. We therefore developed an approach to integrate results and guide selection of candidate elements from several algorithms presented as a web service (https://dbw.msi.umn.edu:8443/recit). These findings together with the approach for integration can be used to identify candidate elements from genome-wide post-transcriptional profiling data sets.

摘要

信使核糖核酸（mRNA）中的调控元件在基因表达的转录后调控中常常发挥关键作用。然而，目前缺乏一种系统的方法来有效地从转录后共调控基因集中识别假定的调控元件，这阻碍了对共调控机制的研究。尽管有几种分析方法可用于检测一组转录本中保守的mRNA调控元件，但尚未对这些方法单独或作为一个整体的性能进行系统研究。因此，我们比较了三种基于不同原理（枚举、优化或结构/序列图谱）的算法在未比对的非翻译序列区域中识别元件的能力。两种算法最初设计用于检测转录因子结合位点，即Weeder和BioProspector；另一种设计用于检测结构上保守的RNA元件，即RNAProfile。我们研究了三种类型的元件：（1）在一级序列和二级结构中均保守的元件；（2）仅在一级序列中保守的元件；（3）微小RNA靶标。我们的结果表明，所有方法都能独特地识别某些已知的RNA元件，因此，整合所有算法的输出可实现对元件的最全面识别。因此，我们开发了一种方法来整合结果，并从作为网络服务呈现的几种算法中指导候选元件的选择（https://dbw.msi.umn.edu:8443/recit）。这些发现以及整合方法可用于从全基因组转录后分析数据集中识别候选元件。

相似文献

Regulatory element identification in subsets of transcripts: comparison and integration of current computational methods.

RNA. 2009 Aug;15(8):1469-82. doi: 10.1261/rna.1617009. Epub 2009 Jun 24.

Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data.

BMC Bioinformatics. 2006 Jul 4;7:330. doi: 10.1186/1471-2105-7-330.

MoD Tools: regulatory motif discovery in nucleotide sequences from co-regulated or homologous genes.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W566-70. doi: 10.1093/nar/gkl285.

CompMoby: comparative MobyDick for detection of cis-regulatory motifs.

BMC Bioinformatics. 2008 Oct 27;9:455. doi: 10.1186/1471-2105-9-455.

Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

BMC Genomics. 2017 Mar 27;18(1):259. doi: 10.1186/s12864-017-3645-2.

Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification.

RNA. 2004 Sep;10(9):1309-22. doi: 10.1261/rna.5206304.

RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences.

Nucleic Acids Res. 2004 Jun 15;32(10):3258-69. doi: 10.1093/nar/gkh650. Print 2004.

Ab initio identification of putative human transcription factor binding sites by comparative genomics.

BMC Bioinformatics. 2005 May 2;6:110. doi: 10.1186/1471-2105-6-110.

Web-based tools for studying RNA structure and function.

Methods Mol Biol. 2011;703:67-86. doi: 10.1007/978-1-59745-248-9_6.

Comparative promoter region analysis powered by CORG.

BMC Genomics. 2005 Feb 21;6:24. doi: 10.1186/1471-2164-6-24.

引用本文的文献

Anota2seq Analysis for Transcriptome-Wide Studies of mRNA Translation.

Methods Mol Biol. 2022;2418:243-268. doi: 10.1007/978-1-0716-1920-9_15.

DynaMIT: the dynamic motif integration toolkit.

Nucleic Acids Res. 2016 Jan 8;44(1):e2. doi: 10.1093/nar/gkv807. Epub 2015 Aug 7.

Evolutionary Dynamics of GLD-1-mRNA complexes in Caenorhabditis nematodes.

Genome Biol Evol. 2014 Dec 9;7(1):314-35. doi: 10.1093/gbe/evu272.

Toward a genome-wide landscape of translational control.

Cold Spring Harb Perspect Biol. 2013 Jan 1;5(1):a012302. doi: 10.1101/cshperspect.a012302.

A miRNA-regulatory network explains how dysregulated miRNAs perturb oncogenic processes across diverse cancers.

Genome Res. 2012 Nov;22(11):2302-14. doi: 10.1101/gr.133991.111. Epub 2012 Jun 28.

Known and novel post-transcriptional regulatory sequences are conserved across plant families.

RNA. 2012 Mar;18(3):368-84. doi: 10.1261/rna.031179.111. Epub 2012 Jan 11.

miRvestigator: web application to identify miRNAs responsible for co-regulated gene expression patterns discovered through transcriptome profiling.

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W125-31. doi: 10.1093/nar/gkr374. Epub 2011 May 20.

Identification of differential translation in genome wide studies.

Proc Natl Acad Sci U S A. 2010 Dec 14;107(50):21487-92. doi: 10.1073/pnas.1006821107. Epub 2010 Nov 29.

本文引用的文献

Transcription factor and microRNA motif discovery: the Amadeus platform and a compendium of metazoan target sets.

Genome Res. 2008 Jul;18(7):1180-9. doi: 10.1101/gr.076117.108. Epub 2008 Apr 14.

Conserved GU-rich elements mediate mRNA decay by binding to CUG-binding protein 1.

Mol Cell. 2008 Feb 1;29(2):263-70. doi: 10.1016/j.molcel.2007.11.024.

A fast structural multiple alignment method for long RNA sequences.

BMC Bioinformatics. 2008 Jan 23;9:33. doi: 10.1186/1471-2105-9-33.

MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing.

Bioinformatics. 2007 Dec 15;23(24):3304-11. doi: 10.1093/bioinformatics/btm525. Epub 2007 Nov 15.

Finding a common motif of RNA sequences using genetic programming: the GeRNAMo system.

IEEE/ACM Trans Comput Biol Bioinform. 2007 Oct-Dec;4(4):596-610. doi: 10.1109/tcbb.2007.1045.

Eukaryotic translation initiation factor 4E induced progression of primary human mammary epithelial cells along the cancer pathway is associated with targeted translational deregulation of oncogenic drivers and inhibitors.

Cancer Res. 2007 Jul 15;67(14):6814-24. doi: 10.1158/0008-5472.CAN-07-0752.

RNA regulons: coordination of post-transcriptional events.

Nat Rev Genet. 2007 Jul;8(7):533-43. doi: 10.1038/nrg2111.

Tuberous sclerosis complex proteins 1 and 2 control serum-dependent translation in a TOP-dependent and -independent manner.

Mol Cell Biol. 2007 Aug;27(16):5746-64. doi: 10.1128/MCB.02136-06. Epub 2007 Jun 11.

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.

Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.

Epigenetic activation of a subset of mRNAs by eIF4E explains its effects on cell proliferation.

PLoS One. 2007 Feb 21;2(2):e242. doi: 10.1371/journal.pone.0000242.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

转录本子集中的调控元件识别：当前计算方法的比较与整合

Regulatory element identification in subsets of transcripts: comparison and integration of current computational methods.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献