通过结合同源序列信息预测RNA二级结构。

Predictions of RNA secondary structure by combining homologous sequence information.

作者信息

Hamada Michiaki, Sato Kengo, Kiryu Hisanori, Mituyama Toutai, Asai Kiyoshi

机构信息

Mizuho Information & Research Institute, Inc, Tokyo, Japan.

出版信息

Bioinformatics. 2009 Jun 15;25(12):i330-8. doi: 10.1093/bioinformatics/btp228.

DOI:10.1093/bioinformatics/btp228

PMID:19478007

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2687982/

Abstract

MOTIVATION

Secondary structure prediction of RNA sequences is an important problem. There have been progresses in this area, but the accuracy of prediction from an RNA sequence is still limited. In many cases, however, homologous RNA sequences are available with the target RNA sequence whose secondary structure is to be predicted.

RESULTS

In this article, we propose a new method for secondary structure predictions of individual RNA sequences by taking the information of their homologous sequences into account without assuming the common secondary structure of the entire sequences. The proposed method is based on posterior decoding techniques, which consider all the suboptimal secondary structures of the target and homologous sequences and all the suboptimal alignments between the target sequence and each of the homologous sequences. In our computational experiments, the proposed method provides better predictions than those performed only on the basis of the formation of individual RNA sequences and those performed by using methods for predicting the common secondary structure of the homologous sequences. Remarkably, we found that the common secondary predictions sometimes give worse predictions for the secondary structure of a target sequence than the predictions from the individual target sequence, while the proposed method always gives good predictions for the secondary structure of target sequences in all tested cases.

AVAILABILITY

Supporting information and software are available online at: http://www.ncrna.org/software/centroidfold/ismb2009/.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

RNA序列的二级结构预测是一个重要问题。该领域已取得进展，但从RNA序列进行预测的准确性仍然有限。然而，在许多情况下，可获得与待预测二级结构的目标RNA序列同源的RNA序列。

结果

在本文中，我们提出了一种新方法，用于通过考虑其同源序列的信息来预测单个RNA序列的二级结构，而无需假设整个序列具有共同的二级结构。所提出的方法基于后验解码技术，该技术考虑了目标序列和同源序列的所有次优二级结构以及目标序列与每个同源序列之间的所有次优比对。在我们的计算实验中，所提出的方法比仅基于单个RNA序列的形成进行的预测以及使用预测同源序列共同二级结构的方法进行的预测提供了更好的预测。值得注意的是，我们发现共同二级结构预测有时对目标序列二级结构的预测比来自单个目标序列的预测更差，而在所测试的所有情况下，所提出的方法始终能对目标序列的二级结构给出良好的预测。

可用性

支持信息和软件可在以下网址在线获取：http://www.ncrna.org/software/centroidfold/ismb2009/。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b58/2687982/78042cb25cd2/btp228f1.jpg

相似文献

Predictions of RNA secondary structure by combining homologous sequence information.

Bioinformatics. 2009 Jun 15;25(12):i330-8. doi: 10.1093/bioinformatics/btp228.

Prediction of RNA secondary structure using generalized centroid estimators.

Bioinformatics. 2009 Feb 15;25(4):465-73. doi: 10.1093/bioinformatics/btn601. Epub 2008 Dec 18.

CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.

Bioinformatics. 2009 Dec 15;25(24):3236-43. doi: 10.1093/bioinformatics/btp580. Epub 2009 Oct 6.

A local multiple alignment method for detection of non-coding RNA sequences.

Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.

CentroidHomfold-LAST: accurate prediction of RNA secondary structure using automatically collected homologous sequences.

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W100-6. doi: 10.1093/nar/gkr290. Epub 2011 May 11.

aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences.

Bioinformatics. 2018 Jul 1;34(13):i70-i78. doi: 10.1093/bioinformatics/bty234.

DAFS: simultaneous aligning and folding of RNA sequences via dual decomposition.

Bioinformatics. 2012 Dec 15;28(24):3218-24. doi: 10.1093/bioinformatics/bts612. Epub 2012 Oct 11.

Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.

Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.

An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots.

Bioinformatics. 2004 Jan 1;20(1):58-66. doi: 10.1093/bioinformatics/btg373.

Robust prediction of consensus secondary structures using averaged base pairing probability matrices.

Bioinformatics. 2007 Feb 15;23(4):434-41. doi: 10.1093/bioinformatics/btl636. Epub 2006 Dec 20.

引用本文的文献

RiNALMo: general-purpose RNA language models can generalize well on structure prediction tasks.

Nat Commun. 2025 Jul 1;16(1):5671. doi: 10.1038/s41467-025-60872-5.

Comprehensive benchmarking of large language models for RNA secondary structure prediction.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf137.

Utilizing RNA-seq Data to Infer Bacterial Transcription Termination Sites and Validate Predictions.

Methods Mol Biol. 2024;2812:345-365. doi: 10.1007/978-1-0716-3886-6_19.

Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery.

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad186.

ConsAlign: simultaneous RNA structural aligner based on rich transfer learning and thermodynamic ensemble model of alignment scoring.

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad255.

Rtools: A Web Server for Various Secondary Structural Analyses on Single RNA Sequences.

Methods Mol Biol. 2023;2586:1-14. doi: 10.1007/978-1-0716-2768-6_1.

LinAliFold and CentroidLinAliFold: fast RNA consensus secondary structure prediction for aligned sequences using beam search methods.

Bioinform Adv. 2022 Oct 22;2(1):vbac078. doi: 10.1093/bioadv/vbac078. eCollection 2022.

rboAnalyzer: A Software to Improve Characterization of Non-coding RNAs From Sequence Database Search Output.

Front Genet. 2020 Jul 28;11:675. doi: 10.3389/fgene.2020.00675. eCollection 2020.

An Algorithm for Template-Based Prediction of Secondary Structures of Individual RNA Sequences.

Front Genet. 2017 Oct 10;8:147. doi: 10.3389/fgene.2017.00147. eCollection 2017.

Biochemical and structural features of extracellular vesicle-binding RNA aptamers.

Biomed Rep. 2017 Jun;6(6):615-626. doi: 10.3892/br.2017.899. Epub 2017 May 3.

本文引用的文献

Prediction of RNA secondary structure using generalized centroid estimators.

Bioinformatics. 2009 Feb 15;25(4):465-73. doi: 10.1093/bioinformatics/btn601. Epub 2008 Dec 18.

Sequence progressive alignment, a framework for practical large-scale probabilistic consistency alignment.

Bioinformatics. 2009 Feb 1;25(3):295-301. doi: 10.1093/bioinformatics/btn630. Epub 2008 Dec 4.

RNAalifold: improved consensus structure prediction for RNA alignments.

BMC Bioinformatics. 2008 Nov 11;9:474. doi: 10.1186/1471-2105-9-474.

Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments.

Nucleic Acids Res. 2008 Nov;36(20):6355-62. doi: 10.1093/nar/gkn544. Epub 2008 Oct 4.

Specific alignment of structured RNA: stochastic grammars and sequence annealing.

Bioinformatics. 2008 Dec 1;24(23):2677-83. doi: 10.1093/bioinformatics/btn495. Epub 2008 Sep 16.

RNA STRAND: the RNA secondary structure and statistical analysis database.

BMC Bioinformatics. 2008 Aug 13;9:340. doi: 10.1186/1471-2105-9-340.

A max-margin model for efficient simultaneous alignment and folding of RNA sequences.

Bioinformatics. 2008 Jul 1;24(13):i68-76. doi: 10.1093/bioinformatics/btn177.

The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data.

Nature. 2008 Mar 6;452(7183):51-5. doi: 10.1038/nature06684.

Centroid estimation in discrete high-dimensional spaces with applications in biology.

Proc Natl Acad Sci U S A. 2008 Mar 4;105(9):3209-14. doi: 10.1073/pnas.0712329105. Epub 2008 Feb 27.

Alignment uncertainty and genomic analysis.

Science. 2008 Jan 25;319(5862):473-6. doi: 10.1126/science.1151532.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过结合同源序列信息预测RNA二级结构。

Predictions of RNA secondary structure by combining homologous sequence information.

作者信息

Hamada Michiaki, Sato Kengo, Kiryu Hisanori, Mituyama Toutai, Asai Kiyoshi

机构信息

Mizuho Information & Research Institute, Inc, Tokyo, Japan.