• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于比对和预测假结RNA结构的成对随机树邻接文法

Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.

作者信息

Matsui Hiroshi, Sato Kengo, Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University, Kohoku-ku, Yokohama, Japan.

出版信息

Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.

PMID:16448022
Abstract

MOTIVATION

Since the whole genome sequences for many species are currently available, computational predictions of RNA secondary structures and computational identifications of those non-coding RNA regions by comparative genomics become important, and require more advanced alignment methods. Recently, an approach of structural alignments for RNA sequences has been introduced to solve these problems. By structural alignments, we mean a pairwise alignment to align an unfolded RNA sequence into a folded RNA sequence of known secondary structure. Pair HMMs on tree structures (PHMMTSs) proposed by Sakakibara are efficient automata-theoretic models for structural alignments of RNA secondary structures, but are incapable of handling pseudoknots. On the other hand, tree adjoining grammars (TAGs) is a subclass of context-sensitive grammar, which is suitable for modeling pseudoknots. Our goal is to extend PHMMTSs by incorporating TAGs to be able to handle pseudoknots.

RESULTS

We propose the pair stochastic tree adjoining grammars (PSTAGs) for modeling RNA secondary structures including pseudoknots and show the strong experimental evidences that modeling pseudoknot structures significantly improves the prediction accuracies of RNA secondary structures. First, we extend the notion of PHMMTSs defined on alignments of 'trees' to PSTAGs defined on alignments of "TAG (derivation) trees", which represent a top-down parsing process of TAGs and are functionally equivalent to derived trees of TAGs. Second, we modify PSTAGs so that it takes as input a pair of a linear sequence and a TAG tree representing a pseudoknot structure of RNA to produce a structural alignment. Then, we develop a polynomial-time algorithm for obtaining an optimal structural alignment by PSTAGs, based on dynamic programming parser. We have done several computational experiments for predicting pseudoknots by PSTAGs, and our computational experiments suggests that prediction of RNA pseudoknot structures by our method are more efficient and biologically plausible than by other conventional methods. The binary code for PSTAG method is freely available from our website at http://www.dna.bio.keio.ac.jp/pstag/.

摘要

动机

由于目前许多物种的全基因组序列均可获取,通过比较基因组学对RNA二级结构进行计算预测以及对那些非编码RNA区域进行计算识别变得愈发重要,这就需要更先进的比对方法。最近,一种用于RNA序列的结构比对方法被引入以解决这些问题。通过结构比对,我们指的是将一个未折叠的RNA序列与一个已知二级结构的折叠RNA序列进行比对的双序列比对。由坂木原提出的树结构上的配对隐马尔可夫模型(PHMMTS)是用于RNA二级结构结构比对的高效自动机理论模型,但无法处理假结。另一方面,树邻接文法(TAG)是上下文敏感文法的一个子类,适用于对假结进行建模。我们的目标是通过合并TAG来扩展PHMMTS,使其能够处理假结。

结果

我们提出了用于对包括假结的RNA二级结构进行建模的配对随机树邻接文法(PSTAG),并展示了强有力的实验证据,即对假结结构进行建模能显著提高RNA二级结构的预测准确性。首先,我们将在“树”比对上定义的PHMMTS的概念扩展到在“TAG(推导)树”比对上定义的PSTAG,“TAG(推导)树”表示TAG的自顶向下解析过程,并且在功能上等同于TAG的派生树。其次,我们对PSTAG进行修改,使其以一个线性序列和一个表示RNA假结结构的TAG树作为输入,以生成一个结构比对。然后,我们基于动态规划解析器开发了一种多项式时间算法,用于通过PSTAG获得最优结构比对。我们已经进行了几项通过PSTAG预测假结的计算实验,我们的计算实验表明,我们的方法对RNA假结结构的预测比其他传统方法更高效且在生物学上更合理。PSTAG方法的二进制代码可从我们的网站http://www.dna.bio.keio.ac.jp/pstag/免费获取。

相似文献

1
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法
Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.
2
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的配对随机树邻接文法
Bioinformatics. 2005 Jun 1;21(11):2611-7. doi: 10.1093/bioinformatics/bti385. Epub 2005 Mar 22.
3
Pair hidden Markov models on tree structures.树结构上的成对隐马尔可夫模型。
Bioinformatics. 2003;19 Suppl 1:i232-40. doi: 10.1093/bioinformatics/btg1032.
4
RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.RNA采样器:一种基于采样的新算法,用于常见RNA二级结构预测和结构比对。
Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.
5
An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots.一种用于预测含假结的RNA二级结构的迭代循环匹配方法。
Bioinformatics. 2004 Jan 1;20(1):58-66. doi: 10.1093/bioinformatics/btg373.
6
Stochastic modeling of RNA pseudoknotted structures: a grammatical approach.RNA假结结构的随机建模:一种语法方法。
Bioinformatics. 2003;19 Suppl 1:i66-73. doi: 10.1093/bioinformatics/btg1007.
7
RNA secondary structural alignment with conditional random fields.基于条件随机场的RNA二级结构比对
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii237-42. doi: 10.1093/bioinformatics/bti1139.
8
CONTRAfold: RNA secondary structure prediction without physics-based models.CONTRAfold:无需基于物理模型的RNA二级结构预测
Bioinformatics. 2006 Jul 15;22(14):e90-8. doi: 10.1093/bioinformatics/btl246.
9
DAFS: simultaneous aligning and folding of RNA sequences via dual decomposition.DAFS:通过对偶分解实现 RNA 序列的同时对齐和折叠。
Bioinformatics. 2012 Dec 15;28(24):3218-24. doi: 10.1093/bioinformatics/bts612. Epub 2012 Oct 11.
10
Effective alignment of RNA pseudoknot structures using partition function posterior log-odds scores.使用分区函数后验对数值评分来有效对准 RNA 假结结构。
BMC Bioinformatics. 2015 Feb 6;16:39. doi: 10.1186/s12859-015-0464-9.

引用本文的文献

1
Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.肽词汇分析揭示了蛋白质序列中的超保守性和同音性。
Bioinform Biol Insights. 2009 Nov 24;1:101-26. doi: 10.4137/bbi.s415.
2
Informatic resources for identifying and annotating structural RNA motifs.用于识别和注释结构RNA基序的信息资源。
Mol Biotechnol. 2009 Feb;41(2):180-93. doi: 10.1007/s12033-008-9114-z. Epub 2008 Nov 1.