• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LinearTurboFold:用于 SARS-CoV-2 同源 RNA 保守结构的线性时间全局预测及其应用。

LinearTurboFold: Linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2.

机构信息

School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR 97331.

Baidu Research, Sunnyvale, CA 94089.

出版信息

Proc Natl Acad Sci U S A. 2021 Dec 28;118(52). doi: 10.1073/pnas.2116269118.

DOI:10.1073/pnas.2116269118
PMID:34887342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8719904/
Abstract

The constant emergence of COVID-19 variants reduces the effectiveness of existing vaccines and test kits. Therefore, it is critical to identify conserved structures in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genomes as potential targets for variant-proof diagnostics and therapeutics. However, the algorithms to predict these conserved structures, which simultaneously fold and align multiple RNA homologs, scale at best cubically with sequence length and are thus infeasible for coronaviruses, which possess the longest genomes (∼30,000 nt) among RNA viruses. As a result, existing efforts on modeling SARS-CoV-2 structures resort to single-sequence folding as well as local folding methods with short window sizes, which inevitably neglect long-range interactions that are crucial in RNA functions. Here we present LinearTurboFold, an efficient algorithm for folding RNA homologs that scales linearly with sequence length, enabling unprecedented global structural analysis on SARS-CoV-2. Surprisingly, on a group of SARS-CoV-2 and SARS-related genomes, LinearTurboFold's purely in silico prediction not only is close to experimentally guided models for local structures, but also goes far beyond them by capturing the end-to-end pairs between 5' and 3' untranslated regions (UTRs) (∼29,800 nt apart) that match perfectly with a purely experimental work. Furthermore, LinearTurboFold identifies undiscovered conserved structures and conserved accessible regions as potential targets for designing efficient and mutation-insensitive small-molecule drugs, antisense oligonucleotides, small interfering RNAs (siRNAs), CRISPR-Cas13 guide RNAs, and RT-PCR primers. LinearTurboFold is a general technique that can also be applied to other RNA viruses and full-length genome studies and will be a useful tool in fighting the current and future pandemics.

摘要

不断出现的 COVID-19 变体降低了现有疫苗和检测试剂盒的有效性。因此,确定严重急性呼吸综合征冠状病毒 2 (SARS-CoV-2) 基因组中保守结构作为变体-proof 诊断和治疗的潜在靶标至关重要。然而,预测这些保守结构的算法同时折叠和对齐多个 RNA 同源物,其规模最好是与序列长度的立方成正比,因此对于 RNA 病毒中最长基因组(30000nt)的冠状病毒来说是不可行的。因此,目前对 SARS-CoV-2 结构建模的努力都依赖于单序列折叠以及具有短窗口大小的局部折叠方法,这不可避免地忽略了在 RNA 功能中至关重要的长程相互作用。在这里,我们提出了 LinearTurboFold,这是一种用于折叠 RNA 同源物的高效算法,它与序列长度呈线性比例缩放,使对 SARS-CoV-2 的前所未有的全局结构分析成为可能。令人惊讶的是,在一组 SARS-CoV-2 和 SARS 相关的基因组上,LinearTurboFold 的纯计算预测不仅与局部结构的实验指导模型接近,而且通过捕获 5' 和 3' 非翻译区 (UTR) 之间的端到端对(相距29800nt)远远超过了它们,这些端到端对与纯实验工作完全匹配。此外,LinearTurboFold 确定了未发现的保守结构和保守可及区域作为设计高效且突变不敏感的小分子药物、反义寡核苷酸、小干扰 RNA (siRNA)、CRISPR-Cas13 向导 RNA 和 RT-PCR 引物的潜在靶标。LinearTurboFold 是一种通用技术,也可以应用于其他 RNA 病毒和全长基因组研究,将成为应对当前和未来大流行的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/8e01f4581e2f/pnas.2116269118fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/027db58a8de0/pnas.2116269118fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/25c7962b6a45/pnas.2116269118fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/623a1f31ffe5/pnas.2116269118fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/ef6a95812531/pnas.2116269118fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/8e01f4581e2f/pnas.2116269118fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/027db58a8de0/pnas.2116269118fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/25c7962b6a45/pnas.2116269118fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/623a1f31ffe5/pnas.2116269118fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/ef6a95812531/pnas.2116269118fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc91/8719904/8e01f4581e2f/pnas.2116269118fig05.jpg

相似文献

1
LinearTurboFold: Linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2.LinearTurboFold:用于 SARS-CoV-2 同源 RNA 保守结构的线性时间全局预测及其应用。
Proc Natl Acad Sci U S A. 2021 Dec 28;118(52). doi: 10.1073/pnas.2116269118.
2
LinearTurboFold: Linear-Time Global Prediction of Conserved Structures for RNA Homologs with Applications to SARS-CoV-2.线性TurboFold:用于RNA同源物保守结构的线性时间全局预测及其在新冠病毒中的应用
bioRxiv. 2021 Nov 15:2020.11.23.393488. doi: 10.1101/2020.11.23.393488.
3
Pervasive RNA Secondary Structure in the Genomes of SARS-CoV-2 and Other Coronaviruses.SARS-CoV-2 和其他冠状病毒基因组中普遍存在的 RNA 二级结构。
mBio. 2020 Oct 30;11(6):e01661-20. doi: 10.1128/mBio.01661-20.
4
RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look.SARS-CoV-2 和 SARS 相关病毒的 RNA 基因组保守性和二级结构:初探。
RNA. 2020 Aug;26(8):937-959. doi: 10.1261/rna.076141.120. Epub 2020 May 12.
5
Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements.全基因组范围内 SARS-CoV-2 RNA 结构的绘制鉴定出具有治疗相关性的元件。
Nucleic Acids Res. 2020 Dec 16;48(22):12436-12452. doi: 10.1093/nar/gkaa1053.
6
LazySampling and LinearSampling: fast stochastic sampling of RNA secondary structure with applications to SARS-CoV-2.LazySampling 和 LinearSampling:用于 SARS-CoV-2 的 RNA 二级结构的快速随机抽样。
Nucleic Acids Res. 2023 Jan 25;51(2):e7. doi: 10.1093/nar/gkac1029.
7
Stability of RNA sequences derived from the coronavirus genome in human cells.冠状病毒基因组在人细胞中 RNA 序列的稳定性。
Biochem Biophys Res Commun. 2020 Jul 5;527(4):993-999. doi: 10.1016/j.bbrc.2020.05.008. Epub 2020 May 6.
8
LinearAlifold: Linear-time consensus structure prediction for RNA alignments.LinearAlifold:用于 RNA 比对的线性时间共识结构预测。
J Mol Biol. 2024 Sep 1;436(17):168694. doi: 10.1016/j.jmb.2024.168694. Epub 2024 Jul 4.
9
Implications of SARS-CoV-2 Mutations for Genomic RNA Structure and Host microRNA Targeting.SARS-CoV-2 突变对基因组 RNA 结构和宿主 microRNA 靶向的影响。
Int J Mol Sci. 2020 Jul 7;21(13):4807. doi: 10.3390/ijms21134807.
10
Structure-altering mutations of the SARS-CoV-2 frameshifting RNA element.SARS-CoV-2 框架移位 RNA 元件的结构改变突变。
Biophys J. 2021 Mar 16;120(6):1040-1053. doi: 10.1016/j.bpj.2020.10.012. Epub 2020 Oct 21.

引用本文的文献

1
AlignmentFold and AlignmentPartition: Improving the align-then-fold approach for RNA secondary structure prediction.比对折叠与比对划分:改进用于RNA二级结构预测的先比对后折叠方法
bioRxiv. 2025 Jul 28:2025.07.23.666478. doi: 10.1101/2025.07.23.666478.
2
Structural impact of synonymous mutations in six SARS-CoV-2 Variants of Concern.六种新冠病毒变异株中同义突变的结构影响
PLoS One. 2025 Jul 1;20(7):e0325858. doi: 10.1371/journal.pone.0325858. eCollection 2025.
3
Self-splicing introns in genes of Bastillevirinae bacteriophages.

本文引用的文献

1
De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures.从头开始构建 SARS-CoV-2 RNA 元件的共识实验二级结构 3D 模型。
Nucleic Acids Res. 2021 Apr 6;49(6):3092-3108. doi: 10.1093/nar/gkab119.
2
In vivo structural characterization of the SARS-CoV-2 RNA genome identifies host proteins vulnerable to repurposed drugs.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)RNA基因组的体内结构表征确定了易受重新利用药物影响的宿主蛋白。
Cell. 2021 Apr 1;184(7):1865-1883.e20. doi: 10.1016/j.cell.2021.02.008. Epub 2021 Feb 9.
3
Comprehensive in vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms.
巴斯蒂尔病毒科噬菌体基因中的自我剪接内含子。
Nucleic Acids Res. 2025 Feb 27;53(5). doi: 10.1093/nar/gkaf121.
4
Decoding the genome of SARS-CoV-2: a pathway to drug development through translation inhibition.解码严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组:通过翻译抑制进行药物开发的途径。
RNA Biol. 2024 Jan;21(1):1-18. doi: 10.1080/15476286.2024.2433830. Epub 2024 Dec 4.
5
REDalign: accurate RNA structural alignment using residual encoder-decoder network.REDalign:使用残差编码器-解码器网络进行精确的 RNA 结构比对。
BMC Bioinformatics. 2024 Nov 5;25(1):346. doi: 10.1186/s12859-024-05956-7.
6
LinearAlifold: Linear-time consensus structure prediction for RNA alignments.LinearAlifold:用于 RNA 比对的线性时间共识结构预测。
J Mol Biol. 2024 Sep 1;436(17):168694. doi: 10.1016/j.jmb.2024.168694. Epub 2024 Jul 4.
7
CodonBERT large language model for mRNA vaccines.基于 CodonBERT 的 mRNA 疫苗大语言模型。
Genome Res. 2024 Aug 20;34(7):1027-1035. doi: 10.1101/gr.278870.123.
8
Unveiling hidden structural patterns in the SARS-CoV-2 genome: Computational insights and comparative analysis.揭示 SARS-CoV-2 基因组中的隐藏结构模式:计算洞察与比较分析。
PLoS One. 2024 Apr 4;19(4):e0298164. doi: 10.1371/journal.pone.0298164. eCollection 2024.
9
LinearCoFold and LinearCoPartition: linear-time algorithms for secondary structure prediction of interacting RNA molecules.线性 CoFold 和线性 CoPartition:用于预测相互作用 RNA 分子二级结构的线性时间算法。
Nucleic Acids Res. 2023 Oct 13;51(18):e94. doi: 10.1093/nar/gkad664.
10
The CRISPR/Cas System: A Customizable Toolbox for Molecular Detection.CRISPR/Cas 系统:分子检测的可定制工具箱。
Genes (Basel). 2023 Mar 31;14(4):850. doi: 10.3390/genes14040850.
全面的 SARS-CoV-2 基因组体内二级结构揭示了新的调控基序和机制。
Mol Cell. 2021 Feb 4;81(3):584-598.e5. doi: 10.1016/j.molcel.2020.12.041. Epub 2021 Jan 1.
4
Genomic RNA Elements Drive Phase Separation of the SARS-CoV-2 Nucleocapsid.基因组 RNA 元件驱动 SARS-CoV-2 核衣壳的相分离。
Mol Cell. 2020 Dec 17;80(6):1078-1091.e6. doi: 10.1016/j.molcel.2020.11.041. Epub 2020 Nov 27.
5
The Short- and Long-Range RNA-RNA Interactome of SARS-CoV-2.SARS-CoV-2 的短程和长程 RNA-RNA 互作组。
Mol Cell. 2020 Dec 17;80(6):1067-1077.e5. doi: 10.1016/j.molcel.2020.11.004. Epub 2020 Nov 5.
6
Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements.全基因组范围内 SARS-CoV-2 RNA 结构的绘制鉴定出具有治疗相关性的元件。
Nucleic Acids Res. 2020 Dec 16;48(22):12436-12452. doi: 10.1093/nar/gkaa1053.
7
Targeting the SARS-CoV-2 RNA Genome with Small Molecule Binders and Ribonuclease Targeting Chimera (RIBOTAC) Degraders.用小分子结合剂和核糖核酸酶靶向嵌合体(RIBOTAC)降解剂靶向严重急性呼吸综合征冠状病毒2(SARS-CoV-2)RNA基因组。
ACS Cent Sci. 2020 Oct 28;6(10):1713-1721. doi: 10.1021/acscentsci.0c00984. Epub 2020 Sep 30.
8
LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities.LinearPartition:RNA 折叠分区函数和碱基配对概率的线性时间逼近。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i258-i267. doi: 10.1093/bioinformatics/btaa460.
9
Structural and functional conservation of the programmed -1 ribosomal frameshift signal of SARS coronavirus 2 (SARS-CoV-2).SARS-CoV-2 病毒的核糖体移码信号的结构和功能保守性。
J Biol Chem. 2020 Jul 31;295(31):10741-10748. doi: 10.1074/jbc.AC120.013449. Epub 2020 Jun 22.
10
Optimization of primer sets and detection protocols for SARS-CoV-2 of coronavirus disease 2019 (COVID-19) using PCR and real-time PCR.优化用于 2019 年冠状病毒病(COVID-19)的 SARS-CoV-2 冠状病毒的聚合酶链反应(PCR)和实时 PCR 引物和检测方案。
Exp Mol Med. 2020 Jun;52(6):963-977. doi: 10.1038/s12276-020-0452-7. Epub 2020 Jun 16.