Bilal Anas, Suleman Muhammad Taseer, Almohammadi Khalid, Alzahrani Abdulkareem, Liu Xiaowen
Department of Information Science and Technology, Hainan Normal University, No. 99 Long Kun South Road, Haikou 571158, China.
Department of Computer Science, Bahria University Lahore Campus, 47-Civic Centre, Johar Town, Lahore, Punjab 54782, Pakistan.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf282.
2-O-methylation (2OM) is a vital post-transcriptional modification which is formed by a functional group through the attachment of a methyl (-CH3) group to the second position of an aromatic ring hydroxyl group (-OH). It plays an active part in RNA physical configuration stability and the way different RNA molecules interrelate. Further, this modification plays a pivotal role in changing the epigenetic regulation of cellular processes. Previous approaches like mass spectrometry could not fully enhance the identification of RNA-modified sites. Sequence data were useful in the development of measures that meant the use of computationally intelligent system to identify 2OM sites quickly. This research proposed a new novel method of feature extraction and generation from the available sequences, and the feature dimensionality reduction has been done through the incorporation of statistical moments. The final feature vectors were developed and used to train prediction models. The assessment of prediction models was carried out through independent set tests and k-fold cross-validation. Through rigorous testing, the bagging ensemble model outperformed and revealed optimal accuracy scores. A publicly accessible web-based application has been developed which can be accessed via https://2om-pred-webapp.streamlit.app/.
2-O-甲基化(2OM)是一种重要的转录后修饰,它是通过一个官能团将甲基(-CH3)连接到芳香环羟基(-OH)的第二位而形成的。它在RNA物理结构稳定性以及不同RNA分子的相互关系方面发挥着积极作用。此外,这种修饰在改变细胞过程的表观遗传调控中起着关键作用。以往的方法如质谱分析无法充分提高RNA修饰位点的识别能力。序列数据在开发相关措施时很有用,这些措施意味着使用计算智能系统来快速识别2OM位点。本研究提出了一种从可用序列中提取和生成特征的新方法,并且通过纳入统计矩进行了特征降维。最终开发了特征向量并用于训练预测模型。通过独立集测试和k折交叉验证对预测模型进行评估。经过严格测试,袋装集成模型表现出色并显示出最佳准确率得分。已经开发了一个基于网络的可公开访问的应用程序,可通过https://2om-pred-webapp.streamlit.app/访问。