Suppr超能文献

在未比对的RNA序列中发现常见的茎环基序。

Discovering common stem-loop motifs in unaligned RNA sequences.

作者信息

Gorodkin J, Stricklin S L, Stormo G D

机构信息

Department of Genetics and Ecology, The Institute of Biological Sciences, University of Aarhus, Building 540, Ny Munkegade, DK-8000 Aarhus C, Denmark.

出版信息

Nucleic Acids Res. 2001 May 15;29(10):2135-44. doi: 10.1093/nar/29.10.2135.

Abstract

Post-transcriptional regulation of gene expression is often accomplished by proteins binding to specific sequence motifs in mRNA molecules, to affect their translation or stability. The motifs are often composed of a combination of sequence and structural constraints such that the overall structure is preserved even though much of the primary sequence is variable. While several methods exist to discover transcriptional regulatory sites in the DNA sequences of coregulated genes, the RNA motif discovery problem is much more difficult because of covariation in the positions. We describe the combined use of two approaches for RNA structure prediction, FOLDALIGN and COVE, that together can discover and model stem-loop RNA motifs in unaligned sequences, such as UTRs from post-transcriptionally coregulated genes. We evaluate the method on two datasets, one a section of rRNA genes with randomly truncated ends so that a global alignment is not possible, and the other a hyper-variable collection of IRE-like elements that were inserted into randomized UTR sequences. In both cases the combined method identified the motifs correctly, and in the rRNA example we show that it is capable of determining the structure, which includes bulge and internal loops as well as a variable length hairpin loop. Those automated results are quantitatively evaluated and found to agree closely with structures contained in curated databases, with correlation coefficients up to 0.9. A basic server, Stem-Loop Align SearcH (SLASH), which will perform stem-loop searches in unaligned RNA sequences, is available at http://www.bioinf.au.dk/slash/.

摘要

基因表达的转录后调控通常是通过蛋白质与mRNA分子中的特定序列基序结合来实现的,从而影响其翻译或稳定性。这些基序通常由序列和结构限制的组合构成,这样即使大部分一级序列是可变的,整体结构仍能得以保留。虽然存在多种方法可用于发现共调控基因DNA序列中的转录调控位点,但由于位置上的共变,RNA基序发现问题要困难得多。我们描述了两种RNA结构预测方法FOLDALIGN和COVE的联合使用,它们共同能够在未比对的序列中发现并模拟茎环RNA基序,比如来自转录后共调控基因的非编码区。我们在两个数据集上评估了该方法,一个是rRNA基因的一部分,其末端随机截断,因此无法进行全局比对,另一个是插入到随机非编码区序列中的类似IRE元件的高变集合。在这两种情况下,联合方法都能正确识别基序,在rRNA的例子中,我们表明它能够确定结构,该结构包括凸起环和内环以及可变长度的发夹环。对这些自动化结果进行了定量评估,发现与经过整理的数据库中包含的结构密切相符,相关系数高达0.9。一个基本服务器,即茎环比对搜索(SLASH),可在http://www.bioinf.au.dk/slash/ 进行未比对RNA序列中的茎环搜索。

相似文献

9
RADAR: a web server for RNA data analysis and research.RADAR:一个用于RNA数据分析与研究的网络服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W300-4. doi: 10.1093/nar/gkm253. Epub 2007 May 21.
10

引用本文的文献

2
Comparative RNA Genomics.比较 RNA 基因组学。
Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12.
3
Assessment of three-dimensional RNA structure prediction in CASP15.评估在 CASP15 中三维 RNA 结构预测。
Proteins. 2023 Dec;91(12):1747-1770. doi: 10.1002/prot.26602. Epub 2023 Oct 24.
6
Fitness functions for RNA structure design.RNA 结构设计的适应度函数。
Nucleic Acids Res. 2023 Apr 24;51(7):e40. doi: 10.1093/nar/gkad097.
9
The locality dilemma of Sankoff-like RNA alignments.Sankoff 型 RNA 比对的局部困境。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i242-i250. doi: 10.1093/bioinformatics/btaa431.

本文引用的文献

9
The European small subunit ribosomal RNA database.欧洲小亚基核糖体RNA数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):175-6. doi: 10.1093/nar/28.1.175.
10
The RDP (Ribosomal Database Project) continues.核糖体数据库项目(RDP)仍在继续。
Nucleic Acids Res. 2000 Jan 1;28(1):173-4. doi: 10.1093/nar/28.1.173.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验