• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于精确序列-结构匹配的 RNA 轻量级比较。

Lightweight comparison of RNAs based on exact sequence-structure matches.

机构信息

Bioinformatics Group, Albert-Ludwigs-University Freiburg, Georges-Koehler-Allee 106, Freiburg, D-79110, Germany.

出版信息

Bioinformatics. 2009 Aug 15;25(16):2095-102. doi: 10.1093/bioinformatics/btp065. Epub 2009 Feb 2.

DOI:10.1093/bioinformatics/btp065
PMID:19189979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2722993/
Abstract

MOTIVATION

Specific functions of ribonucleic acid (RNA) molecules are often associated with different motifs in the RNA structure. The key feature that forms such an RNA motif is the combination of sequence and structure properties. In this article, we introduce a new RNA sequence-structure comparison method which maintains exact matching substructures. Existing common substructures are treated as whole unit while variability is allowed between such structural motifs. Based on a fast detectable set of overlapping and crossing substructure matches for two nested RNA secondary structures, our method ExpaRNA (exact pattern of alignment of RNA) computes the longest collinear sequence of substructures common to two RNAs in O(H.nm) time and O(nm) space, where H << n.m for real RNA structures. Applied to different RNAs, our method correctly identifies sequence-structure similarities between two RNAs.

RESULTS

We have compared ExpaRNA with two other alignment methods that work with given RNA structures, namely RNAforester and RNA_align. The results are in good agreement, but can be obtained in a fraction of running time, in particular for larger RNAs. We have also used ExpaRNA to speed up state-of-the-art Sankoff-style alignment tools like LocARNA, and observe a tradeoff between quality and speed. However, we get a speedup of 4.25 even in the highest quality setting, where the quality of the produced alignment is comparable to that of LocARNA alone.

AVAILABILITY

The presented algorithm is implemented in the program ExpaRNA, which is available from our website (http://www.bioinf.uni-freiburg.de/Software).

摘要

动机

核糖核酸(RNA)分子的特定功能通常与 RNA 结构中的不同基序相关联。形成这种 RNA 基序的关键特征是序列和结构特性的组合。在本文中,我们引入了一种新的 RNA 序列-结构比较方法,该方法保持精确匹配的子结构。现有的常见子结构被视为整体单元,同时允许这些结构基序之间存在可变性。基于两个嵌套 RNA 二级结构的可检测重叠和交叉子结构匹配的快速检测集,我们的方法 ExpaRNA(RNA 对齐的精确模式)在 O(H.nm)时间和 O(nm)空间中计算两个 RNA 之间共有子结构的最长共线性序列,其中 H << n.m 适用于真实的 RNA 结构。应用于不同的 RNA,我们的方法正确识别两个 RNA 之间的序列-结构相似性。

结果

我们将 ExpaRNA 与另外两种适用于给定 RNA 结构的对齐方法 RNAforester 和 RNA_align 进行了比较。结果非常吻合,但运行时间可以缩短到一小部分,特别是对于较大的 RNA。我们还使用 ExpaRNA 来加速 Sankoff 风格的对齐工具,如 LocARNA,并观察到质量和速度之间的权衡。然而,即使在最高质量设置下,我们也可以获得 4.25 的加速,其中产生的对齐质量与单独使用 LocARNA 的质量相当。

可用性

所提出的算法在程序 ExpaRNA 中实现,该程序可从我们的网站(http://www.bioinf.uni-freiburg.de/Software)获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/fbc9f563b573/btp065f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/4e274cd17387/btp065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/9a8b940d7995/btp065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/defb1d199e3e/btp065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/434794309db2/btp065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/bbc2832c67a8/btp065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/ae4fdbd06269/btp065f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/a0d85ddb2deb/btp065f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/7d01781cfb8e/btp065f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/fbc9f563b573/btp065f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/4e274cd17387/btp065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/9a8b940d7995/btp065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/defb1d199e3e/btp065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/434794309db2/btp065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/bbc2832c67a8/btp065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/ae4fdbd06269/btp065f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/a0d85ddb2deb/btp065f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/7d01781cfb8e/btp065f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f28/2722993/fbc9f563b573/btp065f9.jpg

相似文献

1
Lightweight comparison of RNAs based on exact sequence-structure matches.基于精确序列-结构匹配的 RNA 轻量级比较。
Bioinformatics. 2009 Aug 15;25(16):2095-102. doi: 10.1093/bioinformatics/btp065. Epub 2009 Feb 2.
2
ExpaRNA-P: simultaneous exact pattern matching and folding of RNAs.ExpaRNA-P:RNA的同步精确模式匹配与折叠
BMC Bioinformatics. 2014 Dec 31;15(1):404. doi: 10.1186/s12859-014-0404-0.
3
Freiburg RNA Tools: a web server integrating INTARNA, EXPARNA and LOCARNA.弗莱堡 RNA 工具:一个整合了 INTARNA、EXPARNA 和 LOCARNA 的网络服务器。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W373-7. doi: 10.1093/nar/gkq316. Epub 2010 May 5.
4
CARNA--alignment of RNA structure ensembles.CARNA--RNA 结构集合的对齐。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W49-53. doi: 10.1093/nar/gks491. Epub 2012 Jun 11.
5
Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns.快速在线和基于索引的算法,用于近似搜索 RNA 序列-结构模式。
BMC Bioinformatics. 2013 Jul 17;14:226. doi: 10.1186/1471-2105-14-226.
6
LocARNA 2.0: Versatile Simultaneous Alignment and Folding of RNAs.LocARNA 2.0:通用的 RNA 同时对齐和折叠。
Methods Mol Biol. 2024;2726:235-254. doi: 10.1007/978-1-0716-3519-3_10.
7
INFO-RNA--a server for fast inverse RNA folding satisfying sequence constraints.INFO-RNA——一个用于满足序列约束条件的快速RNA反向折叠的服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W310-3. doi: 10.1093/nar/gkm218. Epub 2007 Apr 22.
8
RNAscClust: clustering RNA sequences using structure conservation and graph based motifs.RNAscClust:使用结构保守性和基于图的基元对 RNA 序列进行聚类。
Bioinformatics. 2017 Jul 15;33(14):2089-2096. doi: 10.1093/bioinformatics/btx114.
9
A method for aligning RNA secondary structures and its application to RNA motif detection.一种用于比对RNA二级结构的方法及其在RNA基序检测中的应用。
BMC Bioinformatics. 2005 Apr 7;6:89. doi: 10.1186/1471-2105-6-89.
10
RNA structural alignments, part II: non-Sankoff approaches for structural alignments.RNA结构比对,第二部分:结构比对的非桑科夫方法。
Methods Mol Biol. 2014;1097:291-301. doi: 10.1007/978-1-62703-709-9_14.

引用本文的文献

1
ECSFinder: optimized prediction of evolutionarily conserved RNA secondary structures from genome sequences.ECSFinder:从基因组序列中对进化保守RNA二级结构进行优化预测。
Nucleic Acids Res. 2025 Aug 11;53(15). doi: 10.1093/nar/gkaf780.
2
Decryption of sequence, structure, and functional features of SINE repeat elements in SINEUP non-coding RNA-mediated post-transcriptional gene regulation.SINEUP 非编码 RNA 介导的转录后基因调控中 SINE 重复元件序列、结构和功能特征的解密。
Nat Commun. 2024 Feb 21;15(1):1400. doi: 10.1038/s41467-024-45517-3.
3
Discovery of putative long non-coding RNAs expressed in the eyes of Astyanax mexicanus (Actinopterygii: Characidae).

本文引用的文献

1
Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix.通过修剪动态规划矩阵实现快速成对结构RNA比对。
PLoS Comput Biol. 2007 Oct;3(10):1896-908. doi: 10.1371/journal.pcbi.0030193. Epub 2007 Aug 20.
2
Ribozymes, riboswitches and beyond: regulation of gene expression without proteins.核酶、核糖开关及其他:无蛋白质参与的基因表达调控
Nat Rev Genet. 2007 Oct;8(10):776-90. doi: 10.1038/nrg2172. Epub 2007 Sep 11.
3
Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization.
在墨西哥脂鲤(硬骨鱼纲:脂鲤科)眼睛中表达的假定长非编码 RNA 的发现。
Sci Rep. 2023 Jul 25;13(1):12051. doi: 10.1038/s41598-023-34198-5.
4
SSRTool: A web tool for evaluating RNA secondary structure predictions based on species-specific functional interpretability.SSRTool:一种基于物种特异性功能可解释性评估RNA二级结构预测的网络工具。
Comput Struct Biotechnol J. 2022 May 18;20:2473-2483. doi: 10.1016/j.csbj.2022.05.028. eCollection 2022.
5
Freiburg RNA tools: a central online resource for RNA-focused research and teaching.弗莱堡 RNA 工具:一个专注于 RNA 研究和教学的集中在线资源。
Nucleic Acids Res. 2018 Jul 2;46(W1):W25-W29. doi: 10.1093/nar/gky329.
6
sRNA a newly identified regulator of nitrogen fixation in Methanosarcina mazei strain Gö1.sRNA 是一种新发现的甲烷八叠球菌 Gö1 菌株固氮调控因子。
RNA Biol. 2017 Nov 2;14(11):1544-1558. doi: 10.1080/15476286.2017.1306170. Epub 2017 Apr 27.
7
RNA motif search with data-driven element ordering.基于数据驱动的元件排序进行RNA基序搜索。
BMC Bioinformatics. 2016 May 18;17(1):216. doi: 10.1186/s12859-016-1074-x.
8
ExpaRNA-P: simultaneous exact pattern matching and folding of RNAs.ExpaRNA-P:RNA的同步精确模式匹配与折叠
BMC Bioinformatics. 2014 Dec 31;15(1):404. doi: 10.1186/s12859-014-0404-0.
9
R-PASS: A Fast Structure-based RNA Sequence Alignment Algorithm.R-PASS:一种基于结构的快速RNA序列比对算法。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2011 Dec 31;2011:618-622. doi: 10.1109/BIBM.2011.74.
10
LocARNAscan: Incorporating thermodynamic stability in sequence and structure-based RNA homology search.LocARNAscan:在基于序列和结构的RNA同源性搜索中纳入热力学稳定性
Algorithms Mol Biol. 2013 Apr 20;8:14. doi: 10.1186/1748-7188-8-14. eCollection 2013.
使用组合优化对RNA序列进行准确的多序列-结构比对。
BMC Bioinformatics. 2007 Jul 27;8:271. doi: 10.1186/1471-2105-8-271.
4
Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering.通过基于基因组规模结构的聚类推断非编码RNA家族和类别。
PLoS Comput Biol. 2007 Apr 13;3(4):e65. doi: 10.1371/journal.pcbi.0030065. Epub 2007 Feb 22.
5
Multiple structural alignment and clustering of RNA sequences.RNA序列的多重结构比对与聚类
Bioinformatics. 2007 Apr 15;23(8):926-32. doi: 10.1093/bioinformatics/btm049. Epub 2007 Feb 25.
6
An enhanced RNA alignment benchmark for sequence alignment programs.用于序列比对程序的增强型RNA比对基准。
Algorithms Mol Biol. 2006 Oct 24;1:19. doi: 10.1186/1748-7188-1-19.
7
A new distance for high level RNA secondary structure comparison.一种用于高级RNA二级结构比较的新距离。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jan-Mar;2(1):3-14. doi: 10.1109/TCBB.2005.2.
8
Local similarity in RNA secondary structures.RNA二级结构中的局部相似性。
Proc IEEE Comput Soc Bioinform Conf. 2003;2:159-68.
9
A benchmark of multiple sequence alignment programs upon structural RNAs.基于结构RNA的多序列比对程序的基准测试。
Nucleic Acids Res. 2005 Apr 28;33(8):2433-9. doi: 10.1093/nar/gki541. Print 2005.
10
Rfam: annotating non-coding RNAs in complete genomes.Rfam:对完整基因组中的非编码RNA进行注释。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D121-4. doi: 10.1093/nar/gki081.