• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LinAliFold和质心LinAliFold:使用束搜索方法对比对序列进行快速RNA共有二级结构预测。

LinAliFold and CentroidLinAliFold: fast RNA consensus secondary structure prediction for aligned sequences using beam search methods.

作者信息

Fukunaga Tsukasa, Hamada Michiaki

机构信息

Waseda Institute for Advanced Study, Waseda University, Tokyo 1690051, Japan.

Department of Electrical Engineering and Bioscience, Graduate School of Advanced Science and Engineering, Waseda University, Tokyo 1698555, Japan.

出版信息

Bioinform Adv. 2022 Oct 22;2(1):vbac078. doi: 10.1093/bioadv/vbac078. eCollection 2022.

DOI:10.1093/bioadv/vbac078
PMID:36699418
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9710674/
Abstract

MOTIVATION

RNA consensus secondary structure prediction from aligned sequences is a powerful approach for improving the secondary structure prediction accuracy. However, because the computational complexities of conventional prediction tools scale with the cube of the alignment lengths, their application to long RNA sequences, such as viral RNAs or long non-coding RNAs, requires significant computational time.

RESULTS

In this study, we developed LinAliFold and CentroidLinAliFold, fast RNA consensus secondary structure prediction tools based on minimum free energy and maximum expected accuracy principles, respectively. We achieved software acceleration using beam search methods that were successfully used for fast secondary structure prediction from a single RNA sequence. Benchmark analyses showed that LinAliFold and CentroidLinAliFold were much faster than the existing methods while preserving the prediction accuracy. As an empirical application, we predicted the consensus secondary structure of coronaviruses with approximately 30 000 nt in 5 and 79 min by LinAliFold and CentroidLinAliFold, respectively. We confirmed that the predicted consensus secondary structure of coronaviruses was consistent with the experimental results.

AVAILABILITY AND IMPLEMENTATION

The source codes of LinAliFold and CentroidLinAliFold are freely available at https://github.com/fukunagatsu/LinAliFold-CentroidLinAliFold.

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

摘要

动机

从比对序列预测RNA共有二级结构是提高二级结构预测准确性的一种有效方法。然而,由于传统预测工具的计算复杂度与比对长度的立方成正比,因此将其应用于长RNA序列(如病毒RNA或长链非编码RNA)时需要大量的计算时间。

结果

在本研究中,我们分别基于最小自由能和最大期望准确度原则开发了快速RNA共有二级结构预测工具LinAliFold和CentroidLinAliFold。我们使用成功应用于从单个RNA序列进行快速二级结构预测的束搜索方法实现了软件加速。基准分析表明,LinAliFold和CentroidLinAliFold在保持预测准确性的同时,比现有方法快得多。作为一个实证应用,我们分别使用LinAliFold和CentroidLinAliFold在5分钟和79分钟内预测了约30000个核苷酸的冠状病毒共有二级结构。我们证实预测的冠状病毒共有二级结构与实验结果一致。

可用性和实现方式

LinAliFold和CentroidLinAliFold的源代码可在https://github.com/fukunagatsu/LinAliFold-CentroidLinAliFold上免费获取。

补充信息

补充数据可在网上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/e93e1de3e1de/vbac078f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/9c0b6fec1fe5/vbac078f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/f4a900d3167c/vbac078f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/c5784020ffc5/vbac078f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/e93e1de3e1de/vbac078f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/9c0b6fec1fe5/vbac078f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/f4a900d3167c/vbac078f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/c5784020ffc5/vbac078f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e96/9710674/e93e1de3e1de/vbac078f4.jpg

相似文献

1
LinAliFold and CentroidLinAliFold: fast RNA consensus secondary structure prediction for aligned sequences using beam search methods.LinAliFold和质心LinAliFold:使用束搜索方法对比对序列进行快速RNA共有二级结构预测。
Bioinform Adv. 2022 Oct 22;2(1):vbac078. doi: 10.1093/bioadv/vbac078. eCollection 2022.
2
ConsAlifold: considering RNA structural alignments improves prediction accuracy of RNA consensus secondary structures.ConsAlifold:考虑 RNA 结构比对可提高 RNA 一致二级结构预测精度。
Bioinformatics. 2022 Jan 12;38(3):710-719. doi: 10.1093/bioinformatics/btab738.
3
RIblast: an ultrafast RNA-RNA interaction prediction system based on a seed-and-extension approach.RIblast:一种基于种子和扩展方法的超快 RNA-RNA 相互作用预测系统。
Bioinformatics. 2017 Sep 1;33(17):2666-2674. doi: 10.1093/bioinformatics/btx287.
4
Improved RNA secondary structure and tertiary base-pairing prediction using evolutionary profile, mutational coupling and two-dimensional transfer learning.利用进化谱、突变耦合和二维迁移学习改进RNA二级结构和三级碱基配对预测。
Bioinformatics. 2021 Sep 9;37(17):2589-2600. doi: 10.1093/bioinformatics/btab165.
5
CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.CentroidAlign:通过最大化预期对和分数实现结构化 RNA 的快速准确比对。
Bioinformatics. 2009 Dec 15;25(24):3236-43. doi: 10.1093/bioinformatics/btp580. Epub 2009 Oct 6.
6
Predicting RNA distance-based contact maps by integrated deep learning on physics-inferred secondary structure and evolutionary-derived mutational coupling.基于物理推断的二级结构和进化衍生突变耦合的集成深度学习预测 RNA 距离接触图谱。
Bioinformatics. 2022 Aug 10;38(16):3900-3910. doi: 10.1093/bioinformatics/btac421.
7
Improving the accuracy of predicting secondary structure for aligned RNA sequences.提高 RNA 序列比对二级结构预测的准确性。
Nucleic Acids Res. 2011 Jan;39(2):393-402. doi: 10.1093/nar/gkq792. Epub 2010 Sep 15.
8
IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming.IPknot:使用整数规划快速准确地预测具有假结的 RNA 二级结构。
Bioinformatics. 2011 Jul 1;27(13):i85-93. doi: 10.1093/bioinformatics/btr215.
9
Knotty: efficient and accurate prediction of complex RNA pseudoknot structures.Knotty:高效准确地预测复杂 RNA 假结结构。
Bioinformatics. 2018 Nov 15;34(22):3849-3856. doi: 10.1093/bioinformatics/bty420.
10
aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences.aliFreeFold:一种从同源 RNA 序列预测二级结构的无比对方法。
Bioinformatics. 2018 Jul 1;34(13):i70-i78. doi: 10.1093/bioinformatics/bty234.

引用本文的文献

1
LinearAlifold: Linear-time consensus structure prediction for RNA alignments.LinearAlifold:用于 RNA 比对的线性时间共识结构预测。
J Mol Biol. 2024 Sep 1;436(17):168694. doi: 10.1016/j.jmb.2024.168694. Epub 2024 Jul 4.
2
DeepRaccess: high-speed RNA accessibility prediction using deep learning.DeepRaccess:使用深度学习进行高速RNA可及性预测
Front Bioinform. 2023 Oct 10;3:1275787. doi: 10.3389/fbinf.2023.1275787. eCollection 2023.

本文引用的文献

1
LinearAlifold: Linear-time consensus structure prediction for RNA alignments.LinearAlifold:用于 RNA 比对的线性时间共识结构预测。
J Mol Biol. 2024 Sep 1;436(17):168694. doi: 10.1016/j.jmb.2024.168694. Epub 2024 Jul 4.
2
Algorithm for optimized mRNA design improves stability and immunogenicity.优化 mRNA 设计的算法可提高稳定性和免疫原性。
Nature. 2023 Sep;621(7978):396-403. doi: 10.1038/s41586-023-06127-z. Epub 2023 May 2.
3
LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.
LazySampling 和 LinearSampling:用于 SARS-CoV-2 的 RNA 二级结构的快速随机抽样。
Nucleic Acids Res. 2023 Jan 25;51(2):e7. doi: 10.1093/nar/gkac1029.
4
Secondary structural ensembles of the SARS-CoV-2 RNA genome in infected cells.感染细胞中 SARS-CoV-2 基因组的二级结构组合。
Nat Commun. 2022 Mar 2;13(1):1128. doi: 10.1038/s41467-022-28603-2.
5
Cryo-EM advances in RNA structure determination.冷冻电镜技术在 RNA 结构测定方面的进展。
Signal Transduct Target Ther. 2022 Feb 23;7(1):58. doi: 10.1038/s41392-022-00916-0.
6
LinearTurboFold: Linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2.LinearTurboFold:用于 SARS-CoV-2 同源 RNA 保守结构的线性时间全局预测及其应用。
Proc Natl Acad Sci U S A. 2021 Dec 28;118(52). doi: 10.1073/pnas.2116269118.
7
ConsAlifold: considering RNA structural alignments improves prediction accuracy of RNA consensus secondary structures.ConsAlifold:考虑 RNA 结构比对可提高 RNA 一致二级结构预测精度。
Bioinformatics. 2022 Jan 12;38(3):710-719. doi: 10.1093/bioinformatics/btab738.
8
Prediction of RNA secondary structure including pseudoknots for long sequences.长序列 RNA 二级结构(包括假结)的预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab395.
9
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
10
RNA secondary structure prediction using deep learning with thermodynamic integration.使用热力学积分的深度学习进行 RNA 二级结构预测。
Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.