Department of Biomedical Engineering, University of Alberta, Edmonton, Alberta, Canada.
PLoS One. 2024 Apr 4;19(4):e0298164. doi: 10.1371/journal.pone.0298164. eCollection 2024.
SARS-CoV-2, the causative agent of COVID-19, is known to exhibit secondary structures in its 5' and 3' untranslated regions, along with the frameshifting stimulatory element situated between ORF1a and 1b. To identify additional regions containing conserved structures, we utilized a multiple sequence alignment with related coronaviruses as a starting point. We applied a computational pipeline developed for identifying non-coding RNA elements. Our pipeline employed three different RNA structural prediction approaches. We identified forty genomic regions likely to harbor structures, with ten of them showing three-way consensus substructure predictions among our predictive utilities. We conducted intracomparisons of the predictive utilities within the pipeline and intercomparisons with four previously published SARS-CoV-2 structural datasets. While there was limited agreement on the precise structure, different approaches seemed to converge on regions likely to contain structures in the viral genome. By comparing and combining various computational approaches, we can predict regions most likely to form structures, as well as a probable structure or ensemble of structures. These predictions can be used to guide surveillance, prophylactic measures, or therapeutic efforts. Data and scripts employed in this study may be found at https://doi.org/10.5281/zenodo.8298680.
SARS-CoV-2,即 COVID-19 的病原体,已知在其 5'和 3'非翻译区存在二级结构,以及位于 ORF1a 和 1b 之间的移码刺激元件。为了鉴定包含保守结构的其他区域,我们利用与相关冠状病毒的多重序列比对作为起点。我们应用了一种用于鉴定非编码 RNA 元件的计算管道。我们的管道采用了三种不同的 RNA 结构预测方法。我们鉴定了四十个可能含有结构的基因组区域,其中十个在我们的预测工具中显示了三向一致的亚结构预测。我们对管道内的预测工具进行了内部比较,并与四个先前发表的 SARS-CoV-2 结构数据集进行了比较。虽然在精确结构上存在有限的一致性,但不同的方法似乎集中在病毒基因组中可能含有结构的区域上。通过比较和组合各种计算方法,我们可以预测最有可能形成结构的区域,以及可能的结构或结构组合。这些预测可以用于指导监测、预防措施或治疗工作。本研究中使用的数据和脚本可在 https://doi.org/10.5281/zenodo.8298680 找到。