• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于 RNA 二级结构预测的进化随机上下文无关语法。

Evolving stochastic context--free grammars for RNA secondary structure prediction.

出版信息

BMC Bioinformatics. 2012 May 4;13:78. doi: 10.1186/1471-2105-13-78.

DOI:10.1186/1471-2105-13-78
PMID:22559985
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3464655/
Abstract

BACKGROUND

Stochastic Context-Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few intuitively designed grammars have remained dominant. In this paper we investigate two automatic search techniques for effective grammars - exhaustive search for very compact grammars and an evolutionary algorithm to find larger grammars. We also examine whether grammar ambiguity is as problematic to structure prediction as has been previously suggested.

RESULTS

These search techniques were applied to predict RNA secondary structure on a maximal data set and revealed new and interesting grammars, though none are dramatically better than classic grammars. In general, results showed that many grammars with quite different structure could have very similar predictive ability. Many ambiguous grammars were found which were at least as effective as the best current unambiguous grammars.

CONCLUSIONS

Overall the method of evolving SCFGs for RNA secondary structure prediction proved effective in finding many grammars that had strong predictive accuracy, as good or slightly better than those designed manually. Furthermore, several of the best grammars found were ambiguous, demonstrating that such grammars should not be disregarded.

摘要

背景

随机上下文无关语法 (SCFG) 在 90 年代初期成功应用于 RNA 二级结构预测,并在 90 年代后期与比较方法结合使用。对于 RNA 二级结构预测,潜在有用的 SCFG 集非常大,但少数直观设计的语法仍然占主导地位。在本文中,我们研究了两种用于有效语法的自动搜索技术 - 用于非常紧凑语法的穷举搜索和用于寻找更大语法的进化算法。我们还检查了语法歧义是否像以前所建议的那样对结构预测构成问题。

结果

这些搜索技术应用于在最大数据集上预测 RNA 二级结构,揭示了新的和有趣的语法,但没有一种语法明显优于经典语法。一般来说,结果表明,许多结构非常不同的语法可能具有非常相似的预测能力。发现了许多有歧义的语法,它们至少与当前最好的无歧义语法一样有效。

结论

总的来说,用于 RNA 二级结构预测的 SCFG 进化方法在寻找具有强预测准确性的许多语法方面非常有效,这些语法与手动设计的语法一样好或略好。此外,发现的一些最佳语法是有歧义的,这表明不应忽视此类语法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6724/3464655/a80d1d03c49f/1471-2105-13-78-1.jpg

相似文献

1
Evolving stochastic context--free grammars for RNA secondary structure prediction.用于 RNA 二级结构预测的进化随机上下文无关语法。
BMC Bioinformatics. 2012 May 4;13:78. doi: 10.1186/1471-2105-13-78.
2
SCFGs in RNA secondary structure prediction RNA secondary structure prediction: a hands-on approach.RNA二级结构预测中的SCFGs:RNA二级结构预测:一种实践方法。
Methods Mol Biol. 2014;1097:143-62. doi: 10.1007/978-1-62703-709-9_8.
3
Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction.用于RNA二级结构预测的几种轻量级随机上下文无关文法的评估
BMC Bioinformatics. 2004 Jun 4;5:71. doi: 10.1186/1471-2105-5-71.
4
Introduction to stochastic context free grammars.随机上下文无关文法简介。
Methods Mol Biol. 2014;1097:85-106. doi: 10.1007/978-1-62703-709-9_5.
5
RNA secondary structure prediction using stochastic context-free grammars and evolutionary history.利用随机上下文无关文法和进化史进行RNA二级结构预测。
Bioinformatics. 1999 Jun;15(6):446-54. doi: 10.1093/bioinformatics/15.6.446.
6
Multithreaded comparative RNA secondary structure prediction using stochastic context-free grammars.使用随机上下文无关语法的多线程比较 RNA 二级结构预测。
BMC Bioinformatics. 2011 Apr 18;12:103. doi: 10.1186/1471-2105-12-103.
7
CONTRAfold: RNA secondary structure prediction without physics-based models.CONTRAfold:无需基于物理模型的RNA二级结构预测
Bioinformatics. 2006 Jul 15;22(14):e90-8. doi: 10.1093/bioinformatics/btl246.
8
Multithreaded parsing for predicting RNA secondary structures.
Int J Bioinform Res Appl. 2010;6(6):609-21. doi: 10.1504/IJBRA.2010.038741.
9
Accelerated probabilistic inference of RNA structure evolution.RNA结构进化的加速概率推断
BMC Bioinformatics. 2005 Mar 24;6:73. doi: 10.1186/1471-2105-6-73.
10
Characterising RNA secondary structure space using information entropy.利用信息熵刻画 RNA 二级结构空间。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S22. doi: 10.1186/1471-2105-14-S2-S22. Epub 2013 Jan 21.

引用本文的文献

1
A Hitchhiker's guide to RNA-RNA structure and interaction prediction tools.RNA 结构和相互作用预测工具的指南
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad421.
2
tRNA renovatio: Rebirth through fragmentation.tRNA 修复:通过断裂实现重生。
Mol Cell. 2023 Nov 16;83(22):3953-3971. doi: 10.1016/j.molcel.2023.09.016. Epub 2023 Oct 5.
3
RNA, Action through Interactions.RNA,通过相互作用发挥作用。

本文引用的文献

1
TurboFold: iterative probabilistic estimation of secondary structures for multiple RNA sequences.TurboFold:用于多个 RNA 序列的二级结构的迭代概率估计。
BMC Bioinformatics. 2011 Apr 20;12:108. doi: 10.1186/1471-2105-12-108.
2
RNAalifold: improved consensus structure prediction for RNA alignments.RNAalifold:改进的RNA比对共有结构预测
BMC Bioinformatics. 2008 Nov 11;9:474. doi: 10.1186/1471-2105-9-474.
3
Specific alignment of structured RNA: stochastic grammars and sequence annealing.结构化RNA的特定比对:随机语法与序列退火
Trends Genet. 2018 Nov;34(11):867-882. doi: 10.1016/j.tig.2018.08.001. Epub 2018 Aug 31.
4
Asymptotic distribution of motifs in a stochastic context-free grammar model of RNA folding.RNA折叠随机上下文无关语法模型中基序的渐近分布
J Math Biol. 2014 Dec;69(6-7):1743-72. doi: 10.1007/s00285-013-0750-y. Epub 2014 Jan 3.
5
The four ingredients of single-sequence RNA secondary structure prediction. A unifying perspective.单序列 RNA 二级结构预测的四个要素。一种统一的观点。
RNA Biol. 2013 Jul;10(7):1185-96. doi: 10.4161/rna.24971. Epub 2013 May 10.
6
Quantifying variances in comparative RNA secondary structure prediction.量化比较 RNA 二级结构预测中的差异。
BMC Bioinformatics. 2013 May 1;14:149. doi: 10.1186/1471-2105-14-149.
7
Characterising RNA secondary structure space using information entropy.利用信息熵刻画 RNA 二级结构空间。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S22. doi: 10.1186/1471-2105-14-S2-S22. Epub 2013 Jan 21.
Bioinformatics. 2008 Dec 1;24(23):2677-83. doi: 10.1093/bioinformatics/btn495. Epub 2008 Sep 16.
4
UNAFold: software for nucleic acid folding and hybridization.UNAFold:用于核酸折叠和杂交的软件。
Methods Mol Biol. 2008;453:3-31. doi: 10.1007/978-1-60327-429-6_1.
5
RNA STRAND: the RNA secondary structure and statistical analysis database.RNA链:RNA二级结构与统计分析数据库。
BMC Bioinformatics. 2008 Aug 13;9:340. doi: 10.1186/1471-2105-9-340.
6
The structure of NoRC-associated RNA is crucial for targeting the chromatin remodelling complex NoRC to the nucleolus.与NoRC相关的RNA的结构对于将染色质重塑复合物NoRC靶向核仁至关重要。
EMBO Rep. 2008 Aug;9(8):774-80. doi: 10.1038/embor.2008.109. Epub 2008 Jul 4.
7
An evolutionary method for learning HMM structure: prediction of protein secondary structure.一种学习隐马尔可夫模型结构的进化方法:蛋白质二级结构预测
BMC Bioinformatics. 2007 Sep 21;8:357. doi: 10.1186/1471-2105-8-357.
8
CONTRAfold: RNA secondary structure prediction without physics-based models.CONTRAfold:无需基于物理模型的RNA二级结构预测
Bioinformatics. 2006 Jul 15;22(14):e90-8. doi: 10.1093/bioinformatics/btl246.
9
The tmRDB and SRPDB resources.tmRDB和SRPDB资源。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D163-8. doi: 10.1093/nar/gkj142.
10
Effective ambiguity checking in biosequence analysis.生物序列分析中有效的模糊性检查
BMC Bioinformatics. 2005 Jun 20;6:153. doi: 10.1186/1471-2105-6-153.