• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用语法字符串推导ncRNA共有二级结构

ncRNA consensus secondary structure derivation using grammar strings.

作者信息

Achawanantakun Rujira, Sun Yanni, Takyar Seyedeh Shohreh

机构信息

Computer Science and Engineering Department, Michigan State University, East Lansing, Michigan 48824, USA.

出版信息

J Bioinform Comput Biol. 2011 Apr;9(2):317-37. doi: 10.1142/s0219720011005501.

DOI:10.1142/s0219720011005501
PMID:21523935
Abstract

Many noncoding RNAs (ncRNAs) function through both their sequences and secondary structures. Thus, secondary structure derivation is an important issue in today's RNA research. The state-of-the-art structure annotation tools are based on comparative analysis, which derives consensus structure of homologous ncRNAs. Despite promising results from existing ncRNA aligning and consensus structure derivation tools, there is a need for more efficient and accurate ncRNA secondary structure modeling and alignment methods. In this work, we introduce a consensus structure derivation approach based on grammar string, a novel ncRNA secondary structure representation that encodes an ncRNA's sequence and secondary structure in the parameter space of a context-free grammar (CFG) and a full RNA grammar including pseudoknots. Being a string defined on a special alphabet constructed from a grammar, grammar string converts ncRNA alignment into sequence alignment. We derive consensus secondary structures from hundreds of ncRNA families from BraliBase 2.1 and 25 families containing pseudoknots using grammar string alignment. Our experiments have shown that grammar string-based structure derivation competes favorably in consensus structure quality with Murlet and RNASampler. Source code and experimental data are available at http://www.cse.msu.edu/~yannisun/grammar-string.

摘要

许多非编码RNA(ncRNA)通过其序列和二级结构发挥作用。因此,二级结构推导是当今RNA研究中的一个重要问题。目前最先进的结构注释工具基于比较分析,即推导同源ncRNA的共有结构。尽管现有的ncRNA比对和共有结构推导工具取得了令人鼓舞的结果,但仍需要更高效、准确的ncRNA二级结构建模和比对方法。在这项工作中,我们介绍了一种基于语法字符串的共有结构推导方法,语法字符串是一种新颖的ncRNA二级结构表示形式,它在上下文无关语法(CFG)和包括假结的完整RNA语法的参数空间中对ncRNA的序列和二级结构进行编码。作为在由语法构建的特殊字母表上定义的字符串,语法字符串将ncRNA比对转换为序列比对。我们使用语法字符串比对从BraliBase 2.1中的数百个ncRNA家族和25个包含假结的家族中推导共有二级结构。我们的实验表明,基于语法字符串的结构推导在共有结构质量方面与Murlet和RNASampler相比具有优势。源代码和实验数据可在http://www.cse.msu.edu/~yannisun/grammar-string获取。

相似文献

1
ncRNA consensus secondary structure derivation using grammar strings.使用语法字符串推导ncRNA共有二级结构
J Bioinform Comput Biol. 2011 Apr;9(2):317-37. doi: 10.1142/s0219720011005501.
2
Efficient known ncRNA search including pseudoknots.高效已知 ncRNA 搜索包括假结。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S25. doi: 10.1186/1471-2105-14-S2-S25. Epub 2013 Jan 21.
3
Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.链 RNA:一种基于二维链算法的比较 ncRNA 搜索工具。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.
4
Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM.基于线性 SVM 的 ncRNAs(包括假结)的结构和二级结构预测。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-14-S2-S1. Epub 2013 Jan 21.
5
Designing filters for fast-known NcRNA identification.设计用于快速已知 NcRNA 鉴定的滤波器。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):774-87. doi: 10.1109/TCBB.2011.149.
6
Structural alignment of RNA with complex pseudoknot structure.具有复杂假结结构的RNA的结构比对
J Comput Biol. 2011 Jan;18(1):97-108. doi: 10.1089/cmb.2009.0275.
7
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法
Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.
8
Designing secondary structure profiles for fast ncRNA identification.设计用于快速鉴定非编码RNA的二级结构图谱。
Comput Syst Bioinformatics Conf. 2008;7:145-56.
9
RNAdetect: efficient computational detection of novel non-coding RNAs.RNAdetect:高效计算新型非编码 RNA 的检测。
Bioinformatics. 2019 Apr 1;35(7):1133-1141. doi: 10.1093/bioinformatics/bty765.
10
RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules.RNAspa:一种用于非编码RNA分子二级结构比较预测的最短路径方法。
BMC Bioinformatics. 2007 Oct 1;8:366. doi: 10.1186/1471-2105-8-366.

引用本文的文献

1
FASTR: A novel data format for concomitant representation of RNA sequence and secondary structure information.FASTR:一种用于同时表示RNA序列和二级结构信息的新型数据格式。
J Biosci. 2015 Sep;40(3):571-7. doi: 10.1007/s12038-015-9546-0.
2
Shape and secondary structure prediction for ncRNAs including pseudoknots based on linear SVM.基于线性 SVM 的 ncRNAs(包括假结)的结构和二级结构预测。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-14-S2-S1. Epub 2013 Jan 21.