• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

m5c-iEnsem:通过集成模型进行5-甲基胞嘧啶位点识别。

m5c-iEnsem: 5-methylcytosine sites identification through ensemble models.

作者信息

Bilal Anas, Alarfaj Fawaz Khaled, Khan Rafaqat Alam, Suleman Muhammad Taseer, Long Haixia

机构信息

College of Information Science and Technology, Hainan Normal University, Haikou 571158, China.

Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, Haikou 571158, China.

出版信息

Bioinformatics. 2022 Jan 1;41(1). doi: 10.1093/bioinformatics/btae722.

DOI:10.1093/bioinformatics/btae722
PMID:39657957
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11911556/
Abstract

MOTIVATION

5-Methylcytosine (m5c), a modified cytosine base, arises from adding a methyl group at the 5th carbon position. This modification is a prevalent form of post-transcriptional modification (PTM) found in various types of RNA. Traditional laboratory techniques often fail to provide rapid and accurate identification of m5c sites. However, with the growing accessibility of sequence data, expanding computational models offers a more efficient and reliable approach to m5c site detection. This research focused on creating advanced in-silico methods using ensemble learning techniques. The encoded data was processed through ensemble models, including bagging and boosting techniques. These models were then rigorously evaluated through independent testing and 10-fold cross-validation.

RESULTS

Among the models tested, the Bagging ensemble-based predictor, m5C-iEnsem, demonstrated superior performance to existing m5c prediction tools.

AVAILABILITY AND IMPLEMENTATION

To further support the research community, m5c-iEnsem has been made available via a user-friendly web server at https://m5c-iensem.streamlit.app/.

摘要

动机

5-甲基胞嘧啶(m5c)是一种经过修饰的胞嘧啶碱基,通过在第5个碳位置添加一个甲基而产生。这种修饰是在各种类型的RNA中发现的一种普遍的转录后修饰(PTM)形式。传统的实验室技术常常无法快速准确地识别m5c位点。然而,随着序列数据的获取越来越容易,扩展计算模型为m5c位点检测提供了一种更高效、更可靠的方法。本研究专注于使用集成学习技术创建先进的计算机模拟方法。编码数据通过包括装袋和提升技术在内的集成模型进行处理。然后通过独立测试和10折交叉验证对这些模型进行严格评估。

结果

在测试的模型中,基于装袋集成的预测器m5C-iEnsem表现出优于现有m5c预测工具的性能。

可用性和实现方式

为了进一步支持研究界,m5c-iEnsem已通过一个用户友好的网络服务器在https://m5c-iensem.streamlit.app/上提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/193eef7d5113/btae722f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/864872d91bba/btae722f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/9bc7c17337c4/btae722f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/b3e7c35f92f6/btae722f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/bb9e9dd0684d/btae722f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/021e14533154/btae722f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/194183d91a24/btae722f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/b23df8050a1b/btae722f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/2a9c98171f36/btae722f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/fffabfd9fdd4/btae722f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/74aea3770b27/btae722f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/193eef7d5113/btae722f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/864872d91bba/btae722f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/9bc7c17337c4/btae722f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/b3e7c35f92f6/btae722f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/bb9e9dd0684d/btae722f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/021e14533154/btae722f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/194183d91a24/btae722f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/b23df8050a1b/btae722f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/2a9c98171f36/btae722f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/fffabfd9fdd4/btae722f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/74aea3770b27/btae722f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb18/11911556/193eef7d5113/btae722f11.jpg

相似文献

1
m5c-iEnsem: 5-methylcytosine sites identification through ensemble models.m5c-iEnsem:通过集成模型进行5-甲基胞嘧啶位点识别。
Bioinformatics. 2022 Jan 1;41(1). doi: 10.1093/bioinformatics/btae722.
2
m5c-iDeep: 5-Methylcytosine sites identification through deep learning.m5c-iDeep:通过深度学习识别5-甲基胞嘧啶位点
Methods. 2024 Oct;230:80-90. doi: 10.1016/j.ymeth.2024.07.008. Epub 2024 Jul 31.
3
im5C-DSCGA: A Proposed Hybrid Framework Based on Improved DenseNet and Attention Mechanisms for Identifying 5-methylcytosine Sites in Human RNA.im5C-DSCGA:一种基于改进的 DenseNet 和注意力机制的混合框架,用于识别人类 RNA 中的 5-甲基胞嘧啶位点。
Front Biosci (Landmark Ed). 2023 Dec 26;28(12):346. doi: 10.31083/j.fbl2812346.
4
Staem5: A novel computational approachfor accurate prediction of m5C site.Staem5:一种用于准确预测m5C位点的新型计算方法。
Mol Ther Nucleic Acids. 2021 Oct 20;26:1027-1034. doi: 10.1016/j.omtn.2021.10.012. eCollection 2021 Dec 3.
5
m5CPred-SVM: a novel method for predicting m5C sites of RNA.m5CPred-SVM:一种预测 RNA m5C 位点的新方法。
BMC Bioinformatics. 2020 Oct 30;21(1):489. doi: 10.1186/s12859-020-03828-4.
6
m5C-Seq: Machine learning-enhanced profiling of RNA 5-methylcytosine modifications.m5C-Seq:基于机器学习的 RNA 5-甲基胞嘧啶修饰谱分析。
Comput Biol Med. 2024 Nov;182:109087. doi: 10.1016/j.compbiomed.2024.109087. Epub 2024 Sep 3.
7
RNAm5Cfinder: A Web-server for Predicting RNA 5-methylcytosine (m5C) Sites Based on Random Forest.RNAm5Cfinder:一个基于随机森林预测 RNA 5-甲基胞嘧啶(m5C)位点的网络服务器。
Sci Rep. 2018 Nov 23;8(1):17299. doi: 10.1038/s41598-018-35502-4.
8
Trans-m5C: A transformer-based model for predicting 5-methylcytosine (m5C) sites.Trans-m5C:一种基于Transformer的用于预测5-甲基胞嘧啶(m5C)位点的模型。
Methods. 2025 Feb;234:178-186. doi: 10.1016/j.ymeth.2024.12.010. Epub 2024 Dec 30.
9
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models.m1A-Ensem:通过集成模型准确识别1-甲基腺苷位点。
BioData Min. 2024 Feb 15;17(1):4. doi: 10.1186/s13040-023-00353-x.
10
Evaluation of different computational methods on 5-methylcytosine sites identification.不同计算方法在 5-甲基胞嘧啶位点识别中的评估。
Brief Bioinform. 2020 May 21;21(3):982-995. doi: 10.1093/bib/bbz048.

引用本文的文献

1
Artificial neural network-driven approaches to improved forecasting of disability care expenditures in an aging Kingdom of Saudi Arabia population.人工神经网络驱动的方法用于改善对沙特阿拉伯王国老龄化人口残疾护理支出的预测。
Sci Rep. 2025 Jul 1;15(1):20538. doi: 10.1038/s41598-025-05364-8.
2
2OM-Pred: prediction of 2-O-methylation sites in ribonucleic acid using diverse classifiers.2OM-Pred:使用多种分类器预测核糖核酸中的2-O-甲基化位点。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf282.
3
Dual level dengue diagnosis using lightweight multilayer perceptron with XAI in fog computing environment and rule based inference.

本文引用的文献

1
Vital roles of mC RNA modification in cancer and immune cell biology.mC RNA 修饰在癌症和免疫细胞生物学中的重要作用。
Front Immunol. 2023 May 31;14:1207371. doi: 10.3389/fimmu.2023.1207371. eCollection 2023.
2
iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models.iDHU-Ensem:通过集成学习模型识别二氢尿苷位点。
Digit Health. 2023 Mar 29;9:20552076231165963. doi: 10.1177/20552076231165963. eCollection 2023 Jan-Dec.
3
IGPred-HDnet: Prediction of Immunoglobulin Proteins Using Graphical Features and the Hierarchal Deep Learning-Based Approach.
在雾计算环境中使用具有可解释人工智能的轻量级多层感知器和基于规则的推理进行双级登革热诊断
Sci Rep. 2025 May 13;15(1):16548. doi: 10.1038/s41598-025-98365-6.
4
Advanced leukocyte classification using attention mechanisms and dual channel U-Net architecture.基于注意力机制和双通道U-Net架构的高级白细胞分类
Sci Rep. 2025 Apr 22;15(1):13825. doi: 10.1038/s41598-025-96918-3.
5
mRNA Transcript Variants Expressed in Mammalian Cells.在哺乳动物细胞中表达的信使核糖核酸转录变体
Int J Mol Sci. 2025 Jan 26;26(3):1052. doi: 10.3390/ijms26031052.
IGPred-HDnet:基于图特征和层次深度学习的免疫球蛋白蛋白预测方法。
Comput Intell Neurosci. 2023 Jan 25;2023:2465414. doi: 10.1155/2023/2465414. eCollection 2023.
4
DeepGpgs: a novel deep learning framework for predicting arginine methylation sites combined with Gaussian prior and gated self-attention mechanism.深度Gpgs:一种结合高斯先验和门控自注意力机制预测精氨酸甲基化位点的新型深度学习框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad018.
5
m5C regulator-mediated modification patterns and tumor microenvironment infiltration characterization in colorectal cancer: One step closer to precision medicine.m5C 调节剂介导的修饰模式与结直肠癌肿瘤微环境浸润特征:向精准医学迈进了一步。
Front Immunol. 2022 Dec 1;13:1049435. doi: 10.3389/fimmu.2022.1049435. eCollection 2022.
6
Convolutional ProteinUnetLM competitive with long short-term memory-based protein secondary structure predictors.卷积蛋白UnetLM与基于长短期记忆的蛋白质二级结构预测器具有竞争力。
Proteins. 2023 May;91(5):608-618. doi: 10.1002/prot.26452. Epub 2022 Dec 5.
7
DHU-Pred: accurate prediction of dihydrouridine sites using position and composition variant features on diverse classifiers.DHU-Pred:使用多种分类器上的位置和组成变体特征准确预测二氢尿嘧啶位点。
PeerJ. 2022 Oct 27;10:e14104. doi: 10.7717/peerj.14104. eCollection 2022.
8
A machine learning technique for identifying DNA enhancer regions utilizing CIS-regulatory element patterns.一种利用 CIS 调控元件模式识别 DNA 增强子区域的机器学习技术。
Sci Rep. 2022 Sep 7;12(1):15183. doi: 10.1038/s41598-022-19099-3.
9
Machine learning techniques for identification of carcinogenic mutations, which cause breast adenocarcinoma.机器学习技术用于鉴定致癌突变,这些突变导致乳腺腺癌。
Sci Rep. 2022 Jul 11;12(1):11738. doi: 10.1038/s41598-022-15533-8.
10
m1A-pred: Prediction of Modified 1-methyladenosine Sites in RNA Sequences through Artificial Intelligence.m1A-pred:通过人工智能预测 RNA 序列中的修饰 1-甲基腺苷位点。
Comb Chem High Throughput Screen. 2022;25(14):2473-2484. doi: 10.2174/1386207325666220617152743.