• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

树结构上的成对隐马尔可夫模型。

Pair hidden Markov models on tree structures.

作者信息

Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, 223-8522, Japan.

出版信息

Bioinformatics. 2003;19 Suppl 1:i232-40. doi: 10.1093/bioinformatics/btg1032.

DOI:10.1093/bioinformatics/btg1032
PMID:12855464
Abstract

MOTIVATION

Computationally identifying non-coding RNA regions on the genome has much scope for investigation and is essentially harder than gene-finding problems for protein-coding regions. Since comparative sequence analysis is effective for non-coding RNA detection, efficient computational methods are expected for structural alignments of RNA sequences. On the other hand, Hidden Markov Models (HMMs) have played important roles for modeling and analysing biological sequences. Especially, the concept of Pair HMMs (PHMMs) have been examined extensively as mathematical models for alignments and gene finding.

RESULTS

We propose the pair HMMs on tree structures (PHMMTSs), which is an extension of PHMMs defined on alignments of trees and provides a unifying framework and an automata-theoretic model for alignments of trees, structural alignments and pair stochastic context-free grammars. By structural alignment, we mean a pairwise alignment to align an unfolded RNA sequence into an RNA sequence of known secondary structure. First, we extend the notion of PHMMs defined on alignments of 'linear' sequences to pair stochastic tree automata, called PHMMTSs, defined on alignments of 'trees'. The PHMMTSs provide various types of alignments of trees such as affine-gap alignments of trees and an automata-theoretic model for alignment of trees. Second, based on the observation that a secondary structure of RNA can be represented by a tree, we apply PHMMTSs to the problem of structural alignments of RNAs. We modify PHMMTSs so that it takes as input a pair of a 'linear' sequence and a 'tree' representing a secondary structure of RNA to produce a structural alignment. Further, the PHMMTSs with input of a pair of two linear sequences is mathematically equal to the pair stochastic context-free grammars. We demonstrate some computational experiments to show the effectiveness of our method for structural alignments, and discuss a complexity issue of PHMMTSs.

摘要

动机

通过计算识别基因组上的非编码RNA区域有很大的研究空间,并且本质上比蛋白质编码区域的基因发现问题更难。由于比较序列分析对非编码RNA检测有效,因此期望有高效的计算方法用于RNA序列的结构比对。另一方面,隐马尔可夫模型(HMM)在生物序列的建模和分析中发挥了重要作用。特别是,配对隐马尔可夫模型(PHMM)的概念作为比对和基因发现的数学模型已被广泛研究。

结果

我们提出了树形结构上的配对隐马尔可夫模型(PHMMTS),它是在树比对上定义的PHMM的扩展,为树比对、结构比对和配对随机上下文无关文法提供了一个统一的框架和自动机理论模型。通过结构比对,我们指的是将一个展开的RNA序列与一个已知二级结构的RNA序列进行比对的双序列比对。首先,我们将在“线性”序列比对上定义的PHMM概念扩展到在“树”比对上定义的配对随机树自动机,即PHMMTS。PHMMTS提供了各种类型的树比对,如树的仿射间隙比对和树比对的自动机理论模型。其次,基于RNA的二级结构可以用树表示这一观察结果,我们将PHMMTS应用于RNA的结构比对问题。我们对PHMMTS进行修改,使其以一个“线性”序列和一个表示RNA二级结构的“树”的对作为输入,以产生一个结构比对。此外,输入为一对两个线性序列的PHMMTS在数学上等同于配对随机上下文无关文法。我们展示了一些计算实验,以证明我们的方法在结构比对方面的有效性,并讨论了PHMMTS的复杂性问题。

相似文献

1
Pair hidden Markov models on tree structures.树结构上的成对隐马尔可夫模型。
Bioinformatics. 2003;19 Suppl 1:i232-40. doi: 10.1093/bioinformatics/btg1032.
2
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法
Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.
3
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的配对随机树邻接文法
Bioinformatics. 2005 Jun 1;21(11):2611-7. doi: 10.1093/bioinformatics/bti385. Epub 2005 Mar 22.
4
Stochastic context-free grammars for tRNA modeling.用于tRNA建模的随机上下文无关文法。
Nucleic Acids Res. 1994 Nov 25;22(23):5112-20. doi: 10.1093/nar/22.23.5112.
5
RNA secondary structural alignment with conditional random fields.基于条件随机场的RNA二级结构比对
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii237-42. doi: 10.1093/bioinformatics/bti1139.
6
Noncoding RNA gene detection using comparative sequence analysis.利用比较序列分析进行非编码RNA基因检测。
BMC Bioinformatics. 2001;2:8. doi: 10.1186/1471-2105-2-8. Epub 2001 Oct 10.
7
DAFS: simultaneous aligning and folding of RNA sequences via dual decomposition.DAFS:通过对偶分解实现 RNA 序列的同时对齐和折叠。
Bioinformatics. 2012 Dec 15;28(24):3218-24. doi: 10.1093/bioinformatics/bts612. Epub 2012 Oct 11.
8
Considerations in the identification of functional RNA structural elements in genomic alignments.基因组比对中功能性RNA结构元件识别的考量因素。
BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33.
9
Alignment of RNA base pairing probability matrices.RNA碱基配对概率矩阵的比对。
Bioinformatics. 2004 Sep 22;20(14):2222-7. doi: 10.1093/bioinformatics/bth229. Epub 2004 Apr 8.
10
Pure multiple RNA secondary structure alignments: a progressive profile approach.纯多重RNA二级结构比对:一种渐进式轮廓方法。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):53-62. doi: 10.1109/TCBB.2004.11.

引用本文的文献

1
Structator: fast index-based search for RNA sequence-structure patterns.Structator:基于快速索引的 RNA 序列-结构模式搜索。
BMC Bioinformatics. 2011 May 27;12:214. doi: 10.1186/1471-2105-12-214.
2
Hidden Markov Models and their Applications in Biological Sequence Analysis.隐马尔可夫模型及其在生物序列分析中的应用。
Curr Genomics. 2009 Sep;10(6):402-15. doi: 10.2174/138920209789177575.
3
Evolutionary triplet models of structured RNA.结构化RNA的进化三联体模型
PLoS Comput Biol. 2009 Aug;5(8):e1000483. doi: 10.1371/journal.pcbi.1000483. Epub 2009 Aug 28.
4
Informatic resources for identifying and annotating structural RNA motifs.用于识别和注释结构RNA基序的信息资源。
Mol Biotechnol. 2009 Feb;41(2):180-93. doi: 10.1007/s12033-008-9114-z. Epub 2008 Nov 1.
5
Directed acyclic graph kernels for structural RNA analysis.用于结构RNA分析的有向无环图核
BMC Bioinformatics. 2008 Jul 22;9:318. doi: 10.1186/1471-2105-9-318.
6
Software.ncrna.org: web servers for analyses of RNA sequences.Software.ncrna.org:用于RNA序列分析的网络服务器。
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W75-8. doi: 10.1093/nar/gkn222. Epub 2008 Apr 25.
7
Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization.使用组合优化对RNA序列进行准确的多序列-结构比对。
BMC Bioinformatics. 2007 Jul 27;8:271. doi: 10.1186/1471-2105-8-271.
8
PSSMTS: position specific scoring matrices on tree structures.PSSMTS:树形结构上的位置特异性评分矩阵。
J Math Biol. 2008 Jan;56(1-2):201-14. doi: 10.1007/s00285-007-0108-4. Epub 2007 Jul 7.