School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou St., 15780 Athens, Greece.
Hellenic Air Force Academy, Dekelia Air Base, Acharnes, 13671 Athens, Greece.
Biomolecules. 2023 Feb 6;13(2):308. doi: 10.3390/biom13020308.
The accurate "base pairing" in RNA molecules, which leads to the prediction of RNA secondary structures, is crucial in order to explain unknown biological operations. Recently, COVID-19, a widespread disease, has caused many deaths, affecting humanity in an unprecedented way. SARS-CoV-2, a single-stranded RNA virus, has shown the significance of analyzing these molecules and their structures. This paper aims to create a pioneering framework in the direction of predicting specific RNA structures, leveraging syntactic pattern recognition. The proposed framework, Knotify+, addresses the problem of predicting H-type pseudoknots, including bulges and internal loops, by featuring the power of context-free grammar (CFG). We combine the grammar's advantages with maximum base pairing and minimum free energy to tackle this ambiguous task in a performant way. Specifically, our proposed methodology, Knotify+, outperforms state-of-the-art frameworks with regards to its accuracy in core stems prediction. Additionally, it performs more accurately in small sequences and presents a comparable accuracy rate in larger ones, while it requires a smaller execution time compared to well-known platforms. The Knotify+ source code and implementation details are available as a public repository on GitHub.
RNA 分子中精确的“碱基配对”,可预测 RNA 二级结构,这对于解释未知的生物操作至关重要。最近,COVID-19 这种广泛传播的疾病导致了许多人死亡,以前所未有的方式影响了人类。SARS-CoV-2 是一种单链 RNA 病毒,它突显了分析这些分子及其结构的重要性。本文旨在创建一个开创性的框架,以预测特定的 RNA 结构,利用语法模式识别。所提出的框架 Knotify+ 解决了预测 H 型假结(包括凸起和内部环)的问题,其特点是具有上下文无关语法 (CFG) 的功能。我们将语法的优势与最大碱基配对和最小自由能相结合,以有效地解决这一模糊任务。具体来说,我们提出的方法 Knotify+ 在核心茎预测的准确性方面优于最先进的框架。此外,它在小序列上的表现更准确,在较大序列上的表现也相当,并且与知名平台相比,它需要的执行时间更短。Knotify+ 的源代码和实现细节可在 GitHub 上的公共存储库中获得。