University of Camerino, Via Madonna delle Carceri 9, 62032, Camerino, Italy.
BMC Bioinformatics. 2022 Aug 18;23(Suppl 6):345. doi: 10.1186/s12859-022-04879-5.
Due to its key role in various biological processes, RNA secondary structures have always been the focus of in-depth analyses, with great efforts from mathematicians and biologists, to find a suitable abstract representation for modelling its functional and structural properties. One contribution is due to Kauffman and Magarshak, who modelled RNA secondary structures as mathematical objects constructed in link theory: tangles of the Brauer Monoid. In this paper, we extend the tangle-based model with its minimal prime factorization, useful to analyze patterns that characterize the RNA secondary structure.
By leveraging the mapping between RNA and tangles, we prove that the prime factorizations of tangle-based models share some patterns with RNA folding's features. We analyze the E. coli tRNA and provide some visual examples of interesting patterns.
We formulate an open question on the nature of the class of equivalent factorizations and discuss some research directions in this regard. We also propose some practical applications of the tangle-based method to RNA classification and folding prediction as a useful tool for learning algorithms, even though the full factorization is not known.
由于 RNA 二级结构在各种生物过程中起着关键作用,数学家和生物学家一直致力于深入分析,并寻找合适的抽象表示来模拟其功能和结构特性。Kauffman 和 Magarshak 的贡献之一是将 RNA 二级结构建模为链接理论中构建的数学对象:Brauer 幺半群的扭结。在本文中,我们通过其最小素因子分解扩展了基于扭结的模型,这对于分析特征 RNA 二级结构的模式非常有用。
通过利用 RNA 和扭结之间的映射关系,我们证明了基于扭结的模型的素因子分解与 RNA 折叠的特征具有某些模式。我们分析了大肠杆菌 tRNA,并提供了一些有趣模式的可视化示例。
我们提出了一个关于等价因子分解类本质的开放性问题,并讨论了这方面的一些研究方向。我们还提出了基于扭结的方法在 RNA 分类和折叠预测中的一些实际应用,作为学习算法的有用工具,即使不知道完整的因子分解。