• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用后缀数组识别共有RNA二级结构。

Identification of consensus RNA secondary structures using suffix arrays.

作者信息

Anwar Mohammad, Nguyen Truong, Turcotte Marcel

机构信息

School of Information Technology and Engineering, University of Ottawa, Ottawa, Ontario, Canada.

出版信息

BMC Bioinformatics. 2006 May 5;7:244. doi: 10.1186/1471-2105-7-244.

DOI:10.1186/1471-2105-7-244
PMID:16677380
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1475642/
Abstract

BACKGROUND

The identification of a consensus RNA motif often consists in finding a conserved secondary structure with minimum free energy in an ensemble of aligned sequences. However, an alignment is often difficult to obtain without prior structural information. Thus the need for tools to automate this process.

RESULTS

We present an algorithm called Seed to identify all the conserved RNA secondary structure motifs in a set of unaligned sequences. The search space is defined as the set of all the secondary structure motifs inducible from a seed sequence. A general-to-specific search allows finding all the motifs that are conserved. Suffix arrays are used to enumerate efficiently all the biological palindromes as well as for the matching of RNA secondary structure expressions. We assessed the ability of this approach to uncover known structures using four datasets. The enumeration of the motifs relies only on the secondary structure definition and conservation only, therefore allowing for the independent evaluation of scoring schemes. Twelve simple objective functions based on free energy were evaluated for their potential to discriminate native folds from the rest.

CONCLUSION

Our evaluation shows that 1) support and exclusion constraints are sufficient to make an exhaustive search of the secondary structure space feasible. 2) The search space induced from a seed sequence contains known motifs. 3) Simple objective functions, consisting of a combination of the free energy of matching sequences, can generally identify motifs with high positive predictive value and sensitivity to known motifs.

摘要

背景

识别共有RNA基序通常在于在一组比对序列中找到具有最小自由能的保守二级结构。然而,在没有先验结构信息的情况下,通常很难获得比对结果。因此需要自动化此过程的工具。

结果

我们提出了一种名为Seed的算法,用于识别一组未比对序列中的所有保守RNA二级结构基序。搜索空间被定义为可从种子序列诱导出的所有二级结构基序的集合。从一般到特殊的搜索允许找到所有保守的基序。后缀数组用于高效枚举所有生物回文序列以及用于RNA二级结构表达式的匹配。我们使用四个数据集评估了这种方法揭示已知结构的能力。基序的枚举仅依赖于二级结构定义和保守性,因此允许对评分方案进行独立评估。基于自由能评估了十二个简单目标函数区分天然折叠与其他结构的潜力。

结论

我们的评估表明:1)支持和排除约束足以使对二级结构空间的穷举搜索可行。2)从种子序列诱导出的搜索空间包含已知基序。3)由匹配序列的自由能组合而成的简单目标函数通常可以识别出对已知基序具有高阳性预测值和敏感性的基序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/e5b5ec5f3312/1471-2105-7-244-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/98780d2ed0b6/1471-2105-7-244-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/b0f782fa504c/1471-2105-7-244-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/922f33c67a64/1471-2105-7-244-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/30c95641ae4d/1471-2105-7-244-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/e5852abe02f1/1471-2105-7-244-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/a2f3da0498a2/1471-2105-7-244-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/019cb791d48b/1471-2105-7-244-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/b3645719ad1b/1471-2105-7-244-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/21d17caa9c83/1471-2105-7-244-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/cd5efb5917e5/1471-2105-7-244-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/e5b5ec5f3312/1471-2105-7-244-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/98780d2ed0b6/1471-2105-7-244-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/b0f782fa504c/1471-2105-7-244-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/922f33c67a64/1471-2105-7-244-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/30c95641ae4d/1471-2105-7-244-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/e5852abe02f1/1471-2105-7-244-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/a2f3da0498a2/1471-2105-7-244-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/019cb791d48b/1471-2105-7-244-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/b3645719ad1b/1471-2105-7-244-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/21d17caa9c83/1471-2105-7-244-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/cd5efb5917e5/1471-2105-7-244-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/983d/1475642/e5b5ec5f3312/1471-2105-7-244-11.jpg

相似文献

1
Identification of consensus RNA secondary structures using suffix arrays.使用后缀数组识别共有RNA二级结构。
BMC Bioinformatics. 2006 May 5;7:244. doi: 10.1186/1471-2105-7-244.
2
A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences.一种用于预测未比对序列中包括假结在内的常见RNA二级结构基序的图论方法。
Bioinformatics. 2004 Jul 10;20(10):1591-602. doi: 10.1093/bioinformatics/bth131. Epub 2004 Feb 12.
3
RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences.RNAProfile:一种用于在未比对的RNA序列中寻找保守二级结构基序的算法。
Nucleic Acids Res. 2004 Jun 15;32(10):3258-69. doi: 10.1093/nar/gkh650. Print 2004.
4
RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.RNA采样器:一种基于采样的新算法,用于常见RNA二级结构预测和结构比对。
Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.
5
An algorithm for searching RNA motifs in genomic sequences.一种在基因组序列中搜索RNA基序的算法。
Biomol Eng. 2007 Sep;24(3):343-50. doi: 10.1016/j.bioeng.2007.02.005. Epub 2007 Mar 3.
6
Finding common sequence and structure motifs in a set of RNA sequences.在一组RNA序列中寻找共同的序列和结构基序。
Proc Int Conf Intell Syst Mol Biol. 1997;5:120-3.
7
Mining frequent stem patterns from unaligned RNA sequences.从未比对的RNA序列中挖掘频繁茎模式。
Bioinformatics. 2006 Oct 15;22(20):2480-7. doi: 10.1093/bioinformatics/btl431. Epub 2006 Aug 14.
8
CMfinder--a covariance model based RNA motif finding algorithm.CMfinder——一种基于协方差模型的RNA基序查找算法。
Bioinformatics. 2006 Feb 15;22(4):445-52. doi: 10.1093/bioinformatics/btk008. Epub 2005 Dec 15.
9
Aligning two RNA secondary structures with l-block.使用l块比对两个RNA二级结构。
Biomol Eng. 2007 Sep;24(3):321-6. doi: 10.1016/j.bioeng.2007.02.004. Epub 2007 Mar 3.
10
Predicting a set of minimal free energy RNA secondary structures common to two sequences.预测两个序列共有的一组最小自由能RNA二级结构。
Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24.

引用本文的文献

1
A novel method for the identification of conserved structural patterns in RNA: From small scale to high-throughput applications.一种用于识别RNA中保守结构模式的新方法:从小规模到高通量应用。
Nucleic Acids Res. 2016 Oct 14;44(18):8600-8609. doi: 10.1093/nar/gkw750. Epub 2016 Aug 31.
2
RiboFSM: frequent subgraph mining for the discovery of RNA structures and interactions.RiboFSM:用于发现RNA结构和相互作用的频繁子图挖掘
BMC Bioinformatics. 2014;15 Suppl 13(Suppl 13):S2. doi: 10.1186/1471-2105-15-S13-S2. Epub 2014 Nov 13.
3
Classification and assessment tools for structural motif discovery algorithms.

本文引用的文献

1
Simultaneous alignment and structure prediction of three RNA sequences.三个RNA序列的同时比对与结构预测
Int J Bioinform Res Appl. 2005;1(2):230-45. doi: 10.1504/IJBRA.2005.007581.
2
Compilation of tRNA sequences and sequences of tRNA genes.tRNA序列及tRNA基因序列的汇编。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D139-40. doi: 10.1093/nar/gki012.
3
Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction.使用最近邻能量参数的自由能最小化对RNA二级结构预测适用性的评估。
结构模体发现算法的分类和评估工具。
BMC Bioinformatics. 2013;14 Suppl 9(Suppl 9):S4. doi: 10.1186/1471-2105-14-S9-S4. Epub 2013 Jun 28.
4
CONS-COCOMAPS: a novel tool to measure and visualize the conservation of inter-residue contacts in multiple docking solutions.CONS-COCOMAPS:一种用于测量和可视化多个对接解决方案中残基间接触守恒性的新工具。
BMC Bioinformatics. 2012 Mar 28;13 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-13-S4-S19.
5
PicXAA-R: efficient structural alignment of multiple RNA sequences using a greedy approach.PicXAA-R:一种基于贪心策略的高效 RNA 序列多重结构比对方法。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S38. doi: 10.1186/1471-2105-12-S1-S38.
6
RNAVLab: A virtual laboratory for studying RNA secondary structures based on grid computing technology.RNAV实验室:一个基于网格计算技术研究RNA二级结构的虚拟实验室。
Parallel Comput. 2008 Nov 1;34(11):661-680. doi: 10.1016/j.parco.2008.08.002.
7
PIDA:A new algorithm for pattern identification.PIDA:一种用于模式识别的新算法。
Online J Bioinform. 2007 Jan 1;8(1):30-40.
8
Informatic resources for identifying and annotating structural RNA motifs.用于识别和注释结构RNA基序的信息资源。
Mol Biotechnol. 2009 Feb;41(2):180-93. doi: 10.1007/s12033-008-9114-z. Epub 2008 Nov 1.
9
Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments.统一多序列比对中RNA折叠的进化信息和热力学信息。
Nucleic Acids Res. 2008 Nov;36(20):6355-62. doi: 10.1093/nar/gkn544. Epub 2008 Oct 4.
10
Can Clustal-style progressive pairwise alignment of multiple sequences be used in RNA secondary structure prediction?多个序列的Clustal式渐进成对比对能否用于RNA二级结构预测?
BMC Bioinformatics. 2007 Jun 8;8:190. doi: 10.1186/1471-2105-8-190.
BMC Bioinformatics. 2004 Aug 5;5:105. doi: 10.1186/1471-2105-5-105.
4
RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences.RNAProfile:一种用于在未比对的RNA序列中寻找保守二级结构基序的算法。
Nucleic Acids Res. 2004 Jun 15;32(10):3258-69. doi: 10.1093/nar/gkh650. Print 2004.
5
Evaluating the predictability of conformational switching in RNA.评估RNA中构象转换的可预测性。
Bioinformatics. 2004 Jul 10;20(10):1573-82. doi: 10.1093/bioinformatics/bth129. Epub 2004 Feb 12.
6
Versatile and open software for comparing large genomes.用于比较大型基因组的通用且开放的软件。
Genome Biol. 2004;5(2):R12. doi: 10.1186/gb-2004-5-2-r12. Epub 2004 Jan 30.
7
MicroRNAs: genomics, biogenesis, mechanism, and function.微小RNA:基因组学、生物发生、作用机制及功能
Cell. 2004 Jan 23;116(2):281-97. doi: 10.1016/s0092-8674(04)00045-5.
8
The riboswitch control of bacterial metabolism.细菌代谢的核糖开关控制
Trends Biochem Sci. 2004 Jan;29(1):11-7. doi: 10.1016/j.tibs.2003.11.004.
9
RNA sensors and riboswitches: self-regulating messages.RNA传感器与核糖开关:自我调节的信息
Curr Biol. 2003 Apr 1;13(7):R285-91. doi: 10.1016/s0960-9822(03)00203-3.
10
Secondary structure prediction for aligned RNA sequences.比对RNA序列的二级结构预测。
J Mol Biol. 2002 Jun 21;319(5):1059-66. doi: 10.1016/S0022-2836(02)00308-X.