Staem5：一种用于准确预测m5C位点的新型计算方法。

Staem5: A novel computational approachfor accurate prediction of m5C site.

作者信息

Chai Di, Jia Cangzhi, Zheng Jia, Zou Quan, Li Fuyi

机构信息

School of Science, Dalian Maritime University, Dalian 116026, China.

Yangtze Delta Region Institute (Quzhou), Quzhou, China.

出版信息

Mol Ther Nucleic Acids. 2021 Oct 20;26:1027-1034. doi: 10.1016/j.omtn.2021.10.012. eCollection 2021 Dec 3.

DOI:10.1016/j.omtn.2021.10.012

PMID:34786208

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8571400/

Abstract

5-Methylcytosine (m5C) is an important post-transcriptional modification that has been extensively found in multiple types of RNAs. Many studies have shown that m5C plays vital roles in many biological functions, such as RNA structure stability and metabolism. Computational approaches act as an efficient way to identify m5C sites from high-throughput RNA sequence data and help interpret the functional mechanism of this important modification. This study proposed a novel species-specific computational approach, Staem5, to accurately predict RNA m5C sites in and . Staem5 was developed by employing feature fusion tactics to leverage informatic sequence profiles, and a stacking ensemble learning framework combined five popular machine learning algorithms. Extensive benchmarking tests demonstrated that Staem5 outperformed state-of-the-art approaches in both cross-validation and independent tests. We provide the source code of Staem5, which is publicly available at https://github.com/Cxd-626/Staem5.git.

摘要

5-甲基胞嘧啶（m5C）是一种重要的转录后修饰，已在多种类型的RNA中广泛发现。许多研究表明，m5C在许多生物学功能中起着至关重要的作用，如RNA结构稳定性和代谢。计算方法是从高通量RNA序列数据中识别m5C位点并帮助解释这种重要修饰功能机制的有效途径。本研究提出了一种新颖的物种特异性计算方法Staem5，用于准确预测[]和[]中的RNA m5C位点。Staem5通过采用特征融合策略利用信息序列概况进行开发，并且一个堆叠集成学习框架结合了五种流行的机器学习算法。广泛的基准测试表明，在交叉验证和独立测试中，Staem5均优于现有方法。我们提供了Staem5的源代码，可在https://github.com/Cxd-626/Staem5.git上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2400/8571400/587ee3319a26/fx1.jpg

相似文献

Staem5: A novel computational approachfor accurate prediction of m5C site.Staem5：一种用于准确预测m5C位点的新型计算方法。

Mol Ther Nucleic Acids. 2021 Oct 20;26:1027-1034. doi: 10.1016/j.omtn.2021.10.012. eCollection 2021 Dec 3.

im5C-DSCGA: A Proposed Hybrid Framework Based on Improved DenseNet and Attention Mechanisms for Identifying 5-methylcytosine Sites in Human RNA.im5C-DSCGA：一种基于改进的 DenseNet 和注意力机制的混合框架，用于识别人类 RNA 中的 5-甲基胞嘧啶位点。

Front Biosci (Landmark Ed). 2023 Dec 26;28(12):346. doi: 10.31083/j.fbl2812346.

m5CPred-SVM: a novel method for predicting m5C sites of RNA.m5CPred-SVM：一种预测 RNA m5C 位点的新方法。

BMC Bioinformatics. 2020 Oct 30;21(1):489. doi: 10.1186/s12859-020-03828-4.

XGBoost framework with feature selection for the prediction of RNA N5-methylcytosine sites.XGBoost 框架与特征选择相结合，用于预测 RNA N5-甲基胞嘧啶位点。

Mol Ther. 2023 Aug 2;31(8):2543-2551. doi: 10.1016/j.ymthe.2023.05.016. Epub 2023 Jun 3.

Evaluation of different computational methods on 5-methylcytosine sites identification.不同计算方法在 5-甲基胞嘧啶位点识别中的评估。

Brief Bioinform. 2020 May 21;21(3):982-995. doi: 10.1093/bib/bbz048.

m5C-Seq: Machine learning-enhanced profiling of RNA 5-methylcytosine modifications.m5C-Seq：基于机器学习的 RNA 5-甲基胞嘧啶修饰谱分析。

Comput Biol Med. 2024 Nov;182:109087. doi: 10.1016/j.compbiomed.2024.109087. Epub 2024 Sep 3.

An improved residual network using deep fusion for identifying RNA 5-methylcytosine sites.一种使用深度融合的改进残差网络，用于识别 RNA 5-甲基胞嘧啶位点。

Bioinformatics. 2022 Sep 15;38(18):4271-4277. doi: 10.1093/bioinformatics/btac532.

Prediction of m5C Modifications in RNA Sequences by Combining Multiple Sequence Features.通过结合多种序列特征预测RNA序列中的m5C修饰

Mol Ther Nucleic Acids. 2020 Sep 4;21:332-342. doi: 10.1016/j.omtn.2020.06.004. Epub 2020 Jun 10.

m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features.m5CRegpred：基于测序特征的 5-甲基胞嘧啶（m5C）调控因子的转录组靶标预测。

Genes (Basel). 2022 Apr 12;13(4):677. doi: 10.3390/genes13040677.

Transcriptome-Wide Annotation of mC RNA Modifications Using Machine Learning.使用机器学习对m⁶A RNA修饰进行全转录组注释

Front Plant Sci. 2018 Apr 18;9:519. doi: 10.3389/fpls.2018.00519. eCollection 2018.

引用本文的文献

Definer: A computational method for accurate identification of RNA pseudouridine sites based on deep learning.定义者：一种基于深度学习的准确识别RNA假尿苷位点的计算方法。

PLoS One. 2025 Apr 24;20(4):e0320077. doi: 10.1371/journal.pone.0320077. eCollection 2025.

A CNN based m5c RNA methylation predictor.基于 CNN 的 m5c RNA 甲基化预测器。

Sci Rep. 2023 Dec 11;13(1):21885. doi: 10.1038/s41598-023-48751-9.

XGBoost framework with feature selection for the prediction of RNA N5-methylcytosine sites.XGBoost 框架与特征选择相结合，用于预测 RNA N5-甲基胞嘧啶位点。

Mol Ther. 2023 Aug 2;31(8):2543-2551. doi: 10.1016/j.ymthe.2023.05.016. Epub 2023 Jun 3.

Predicting Pseudouridine Sites with Porpoise.使用“鼠海豚”预测假尿苷位点。

Methods Mol Biol. 2023;2624:139-151. doi: 10.1007/978-1-0716-2962-8_10.

Dynamic regulation and key roles of ribonucleic acid methylation.核糖核酸甲基化的动态调控及关键作用

Front Cell Neurosci. 2022 Dec 19;16:1058083. doi: 10.3389/fncel.2022.1058083. eCollection 2022.

Epitranscriptomics in parasitic protists: Role of RNA chemical modifications in posttranscriptional gene regulation.RNA 化学修饰在寄生虫原生生物中转录后基因调控中的作用：表观转录组学。

PLoS Pathog. 2022 Dec 22;18(12):e1010972. doi: 10.1371/journal.ppat.1010972. eCollection 2022 Dec.

MLACP 2.0: An updated machine learning tool for anticancer peptide prediction.MLACP 2.0：一种用于抗癌肽预测的更新机器学习工具。

Comput Struct Biotechnol J. 2022 Aug 2;20:4473-4480. doi: 10.1016/j.csbj.2022.07.043. eCollection 2022.

Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.Deepm5C：一种基于深度学习的混合框架，使用堆叠策略识别人类 RNA N5-甲基胞嘧啶位点。

Mol Ther. 2022 Aug 3;30(8):2856-2867. doi: 10.1016/j.ymthe.2022.05.001. Epub 2022 May 6.

Genes (Basel). 2022 Apr 12;13(4):677. doi: 10.3390/genes13040677.

m5Cpred-XS: A New Method for Predicting RNA m5C Sites Based on XGBoost and SHAP.m5Cpred-XS：一种基于XGBoost和SHAP预测RNA m5C位点的新方法。

Front Genet. 2022 Mar 30;13:853258. doi: 10.3389/fgene.2022.853258. eCollection 2022.

本文引用的文献

Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.海豚：一种准确预测 RNA 假尿嘧啶位点的新方法。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab245.

Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications.基于注意力的多标签神经网络，用于十二种广泛存在的 RNA 修饰的综合预测和解释。

Nat Commun. 2021 Jun 29;12(1):4011. doi: 10.1038/s41467-021-24313-3.

DNN-m6A: A Cross-Species Method for Identifying RNA N6-Methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion.基于多信息融合的深度神经网络的跨物种 RNA N6-甲基腺苷位点识别方法 DNN-m6A

Genes (Basel). 2021 Feb 28;12(3):354. doi: 10.3390/genes12030354.

ReCGBM: a gradient boosting-based method for predicting human dicer cleavage sites.ReCGBM：一种基于梯度提升的人类 Dicer 切割位点预测方法。

BMC Bioinformatics. 2021 Feb 10;22(1):63. doi: 10.1186/s12859-021-03993-0.

Anthem: a user customised tool for fast and accurate prediction of binding between peptides and HLA class I molecules. anthem：一种用户自定义工具，用于快速准确地预测肽段与 HLA Ⅰ类分子的结合。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa415.

XG-ac4C: identification of N4-acetylcytidine (ac4C) in mRNA using eXtreme gradient boosting with electron-ion interaction pseudopotentials.使用具有电子-离子相互作用赝势的极端梯度提升法鉴定 mRNA 中的 N4-乙酰胞苷（ac4C）。

Sci Rep. 2020 Dec 1;10(1):20942. doi: 10.1038/s41598-020-77824-2.

An Interpretable Prediction Model for Identifying N-Methylguanosine Sites Based on XGBoost and SHAP.一种基于XGBoost和SHAP的用于识别N-甲基鸟苷位点的可解释预测模型。

Mol Ther Nucleic Acids. 2020 Aug 25;22:362-372. doi: 10.1016/j.omtn.2020.08.022. eCollection 2020 Dec 4.

Computational identification of eukaryotic promoters based on cascaded deep capsule neural networks.基于级联深度胶囊神经网络的真核启动子计算识别。

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa299.

m5CPred-SVM: a novel method for predicting m5C sites of RNA.m5CPred-SVM：一种预测 RNA m5C 位点的新方法。

BMC Bioinformatics. 2020 Oct 30;21(1):489. doi: 10.1186/s12859-020-03828-4.

Identifying Circular RNA and Predicting Its Regulatory Interactions by Machine Learning.通过机器学习识别环状RNA并预测其调控相互作用

Front Genet. 2020 Jul 21;11:655. doi: 10.3389/fgene.2020.00655. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Staem5：一种用于准确预测m5C位点的新型计算方法。

Staem5: A novel computational approachfor accurate prediction of m5C site.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献