Suppr超能文献

m5c-iDeep:通过深度学习识别5-甲基胞嘧啶位点

m5c-iDeep: 5-Methylcytosine sites identification through deep learning.

作者信息

Malebary Sharaf J, Alromema Nashwan, Suleman Muhammad Taseer, Saleem Maham

机构信息

Department of Information Technology, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia.

Department of Computer Science, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, P.O. Box 344, Rabigh 21911, Saudi Arabia.

出版信息

Methods. 2024 Oct;230:80-90. doi: 10.1016/j.ymeth.2024.07.008. Epub 2024 Jul 31.

Abstract

5-Methylcytosine (m5c) is a modified cytosine base which is formed as the result of addition of methyl group added at position 5 of carbon. This modification is one of the most common PTM that used to occur in almost all types of RNA. The conventional laboratory methods do not provide quick reliable identification of m5c sites. However, the sequence data readiness has made it feasible to develop computationally intelligent models that optimize the identification process for accuracy and robustness. The present research focused on the development of in-silico methods built using deep learning models. The encoded data was then fed into deep learning models, which included gated recurrent unit (GRU), long short-term memory (LSTM), and bi-directional LSTM (Bi-LSTM). After that, the models were subjected to a rigorous evaluation process that included both independent set testing and 10-fold cross validation. The results revealed that LSTM-based model, m5c-iDeep, outperformed revealing 99.9 % accuracy while comparing with existing m5c predictors. In order to facilitate researchers, m5c-iDeep was also deployed on a web-based server which is accessible at https://taseersuleman-m5c-ideep-m5c-ideep.streamlit.app/.

摘要

5-甲基胞嘧啶(m5c)是一种经过修饰的胞嘧啶碱基,它是在碳的5位添加甲基的结果。这种修饰是几乎所有类型RNA中最常见的翻译后修饰(PTM)之一。传统的实验室方法无法快速可靠地识别m5c位点。然而,序列数据的可得性使得开发计算智能模型成为可能,这些模型可以优化识别过程以提高准确性和稳健性。本研究专注于开发使用深度学习模型构建的计算机模拟方法。然后将编码数据输入深度学习模型,其中包括门控循环单元(GRU)、长短期记忆(LSTM)和双向LSTM(Bi-LSTM)。之后,对模型进行了严格的评估过程,包括独立集测试和10折交叉验证。结果表明,与现有的m5c预测器相比,基于LSTM的模型m5c-iDeep表现出色,准确率达到99.9%。为了方便研究人员,m5c-iDeep还部署在一个基于网络的服务器上,可通过https://taseersuleman-m5c-ideep-m5c-ideep.streamlit.app/访问。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验