• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在Dynalign中使用概率比对约束进行高效的成对RNA结构预测。

Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign.

作者信息

Harmanci Arif Ozgun, Sharma Gaurav, Mathews David H

机构信息

Department of Electrical and Computer Engineering, University of Rochester, Hopeman 204, Rochester, NY 14627, USA.

出版信息

BMC Bioinformatics. 2007 Apr 19;8:130. doi: 10.1186/1471-2105-8-130.

DOI:10.1186/1471-2105-8-130
PMID:17445273
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1868766/
Abstract

BACKGROUND

Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction.

RESULTS

The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources.

CONCLUSION

Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download.

摘要

背景

两条RNA序列的联合比对和二级结构预测能够显著提高结构预测的准确性。然而,解决此问题的方法不得不采用一些约束条件,通过限制允许的比对和/或结构(即折叠)来减少计算量。本文提出了一种新方法,旨在基于核苷酸比对和插入后验概率来建立比对约束。使用隐马尔可夫模型,计算两条序列中所有可能核苷酸位置对的比对和插入后验概率。将这些比对和插入后验概率相加组合,以获得核苷酸位置对的重合概率。通过对比对重合概率进行阈值化处理,得到合适的比对约束。该约束与Dynalign(一种用于联合比对和二级结构预测的自由能最小化算法)相结合。将所得方法与Dynalign的先前版本以及其他用于成对RNA结构预测的程序进行基准测试。

结果

所提出的技术消除了Dynalign中的手动参数选择,与Dynalign先前的约束相比,显著节省了计算时间,同时在结构预测准确性上有小幅提高。在内存方面也实现了节省。在一个平均序列长度约为120个核苷酸的5S RNA数据集上进行的实验中,该方法将计算量减少了一半。与其他用于成对RNA结构预测的程序相比,该方法表现良好:平均而言,具有更高的准确性,并且所需的计算资源显著更少。

结论

可以利用概率分析以一种有原则的方式自动确定成对RNA结构预测方法的比对约束。这些约束可以降低这些方法的计算和内存需求,同时保持或提高其结构预测的准确性。这将这些方法的实际应用范围扩展到了更长的序列。修订后的Dynalign代码可免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/b1639bd87026/1471-2105-8-130-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/b78c827042cb/1471-2105-8-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/20838fe70be5/1471-2105-8-130-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/e4a1a7252417/1471-2105-8-130-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/abe1b8fdd7d5/1471-2105-8-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/266f66fe76e0/1471-2105-8-130-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/36fa48e74900/1471-2105-8-130-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/305cc59de51c/1471-2105-8-130-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/8628421d5769/1471-2105-8-130-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/d4f584c8a8ba/1471-2105-8-130-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/b1639bd87026/1471-2105-8-130-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/b78c827042cb/1471-2105-8-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/20838fe70be5/1471-2105-8-130-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/e4a1a7252417/1471-2105-8-130-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/abe1b8fdd7d5/1471-2105-8-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/266f66fe76e0/1471-2105-8-130-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/36fa48e74900/1471-2105-8-130-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/305cc59de51c/1471-2105-8-130-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/8628421d5769/1471-2105-8-130-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/d4f584c8a8ba/1471-2105-8-130-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/54a8/1868766/b1639bd87026/1471-2105-8-130-10.jpg

相似文献

1
Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign.在Dynalign中使用概率比对约束进行高效的成对RNA结构预测。
BMC Bioinformatics. 2007 Apr 19;8:130. doi: 10.1186/1471-2105-8-130.
2
Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.利用序列比对约束进行高效的成对RNA结构预测和比对。
BMC Bioinformatics. 2006 Sep 4;7:400. doi: 10.1186/1471-2105-7-400.
3
PARTS: probabilistic alignment for RNA joinT secondary structure prediction.PARTS:用于RNA联合二级结构预测的概率比对
Nucleic Acids Res. 2008 Apr;36(7):2406-17. doi: 10.1093/nar/gkn043. Epub 2008 Feb 26.
4
Predicting a set of minimal free energy RNA secondary structures common to two sequences.预测两个序列共有的一组最小自由能RNA二级结构。
Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24.
5
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
6
TurboFold: iterative probabilistic estimation of secondary structures for multiple RNA sequences.TurboFold:用于多个 RNA 序列的二级结构的迭代概率估计。
BMC Bioinformatics. 2011 Apr 20;12:108. doi: 10.1186/1471-2105-12-108.
7
Can Clustal-style progressive pairwise alignment of multiple sequences be used in RNA secondary structure prediction?多个序列的Clustal式渐进成对比对能否用于RNA二级结构预测?
BMC Bioinformatics. 2007 Jun 8;8:190. doi: 10.1186/1471-2105-8-190.
8
Predicting the secondary structure common to two RNA sequences with Dynalign.使用Dynalign预测两个RNA序列共有的二级结构。
Curr Protoc Bioinformatics. 2004 Dec;Chapter 12:Unit 12.4. doi: 10.1002/0471250953.bi1204s08.
9
TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.TurboFold II:基于多个同源物的RNA结构比对与二级结构预测
Nucleic Acids Res. 2017 Nov 16;45(20):11570-11581. doi: 10.1093/nar/gkx815.
10
Murlet: a practical multiple alignment tool for structural RNA sequences.Murlet:一种用于结构RNA序列的实用多序列比对工具。
Bioinformatics. 2007 Jul 1;23(13):1588-98. doi: 10.1093/bioinformatics/btm146. Epub 2007 Apr 25.

引用本文的文献

1
: identification and characterization of plant-conserved microRNA in Rafflesiaceae.大花草科中植物保守微小RNA的鉴定与特征分析
Open Life Sci. 2025 Jan 27;20(1):20221033. doi: 10.1515/biol-2022-1033. eCollection 2025.
2
DecoyFinder: Identification of Contaminants in Sets of Homologous RNA Sequences.诱饵序列查找器:同源RNA序列集中污染物的鉴定
bioRxiv. 2024 Oct 15:2024.10.12.618037. doi: 10.1101/2024.10.12.618037.
3
RNA Secondary Structure Analysis Using RNAstructure.使用 RNAstructure 进行 RNA 二级结构分析。

本文引用的文献

1
Simultaneous alignment and structure prediction of three RNA sequences.三个RNA序列的同时比对与结构预测
Int J Bioinform Res Appl. 2005;1(2):230-45. doi: 10.1504/IJBRA.2005.007581.
2
A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation.一组用于预测RNA二级结构形成焓变的最近邻参数。
Nucleic Acids Res. 2006;34(17):4912-24. doi: 10.1093/nar/gkl472. Epub 2006 Sep 18.
3
Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.
Curr Protoc. 2023 Jul;3(7):e846. doi: 10.1002/cpz1.846.
4
LinearTurboFold: Linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2.LinearTurboFold:用于 SARS-CoV-2 同源 RNA 保守结构的线性时间全局预测及其应用。
Proc Natl Acad Sci U S A. 2021 Dec 28;118(52). doi: 10.1073/pnas.2116269118.
5
LinearTurboFold: Linear-Time Global Prediction of Conserved Structures for RNA Homologs with Applications to SARS-CoV-2.线性TurboFold:用于RNA同源物保守结构的线性时间全局预测及其在新冠病毒中的应用
bioRxiv. 2021 Nov 15:2020.11.23.393488. doi: 10.1101/2020.11.23.393488.
6
Alignments of biomolecular contact maps.生物分子接触图的比对
Interface Focus. 2021 Jun 11;11(4):20200066. doi: 10.1098/rsfs.2020.0066. eCollection 2021 Jun.
7
Conserved Structural Motifs of Two Distant IAV Subtypes in Genomic Segment 5 RNA.两种不同亚型的流感病毒在基因组 5 片段 RNA 中保守的结构基序。
Viruses. 2021 Mar 22;13(3):525. doi: 10.3390/v13030525.
8
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
9
Fast and accurate structure probability estimation for simultaneous alignment and folding of RNAs with Markov chains.利用马尔可夫链对RNA同时进行比对和折叠时的快速准确结构概率估计。
Algorithms Mol Biol. 2020 Nov 13;15(1):19. doi: 10.1186/s13015-020-00179-w.
10
RNA Secondary Structure Motifs of the Influenza A Virus as Targets for siRNA-Mediated RNA Interference.甲型流感病毒的RNA二级结构基序作为siRNA介导的RNA干扰的靶点
Mol Ther Nucleic Acids. 2020 Mar 6;19:627-642. doi: 10.1016/j.omtn.2019.12.018. Epub 2019 Dec 24.
利用序列比对约束进行高效的成对RNA结构预测和比对。
BMC Bioinformatics. 2006 Sep 4;7:400. doi: 10.1186/1471-2105-7-400.
4
Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure.数以千计在一级序列中无法比对的相应人类和小鼠基因组区域含有共同的RNA结构。
Genome Res. 2006 Jul;16(7):885-9. doi: 10.1101/gr.5226606. Epub 2006 Jun 2.
5
Prediction of RNA secondary structure by free energy minimization.通过自由能最小化预测RNA二级结构。
Curr Opin Struct Biol. 2006 Jun;16(3):270-8. doi: 10.1016/j.sbi.2006.05.010. Epub 2006 May 19.
6
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
7
Revolutions in RNA secondary structure prediction.RNA二级结构预测的变革
J Mol Biol. 2006 Jun 9;359(3):526-32. doi: 10.1016/j.jmb.2006.01.067. Epub 2006 Feb 6.
8
Accelerated probabilistic inference of RNA structure evolution.RNA结构进化的加速概率推断
BMC Bioinformatics. 2005 Mar 24;6:73. doi: 10.1186/1471-2105-6-73.
9
Predicting a set of minimal free energy RNA secondary structures common to two sequences.预测两个序列共有的一组最小自由能RNA二级结构。
Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24.
10
Fast and reliable prediction of noncoding RNAs.非编码RNA的快速可靠预测
Proc Natl Acad Sci U S A. 2005 Feb 15;102(7):2454-9. doi: 10.1073/pnas.0409169102. Epub 2005 Jan 21.