• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习对m⁶A RNA修饰进行全转录组注释

Transcriptome-Wide Annotation of mC RNA Modifications Using Machine Learning.

作者信息

Song Jie, Zhai Jingjing, Bian Enze, Song Yujia, Yu Jiantao, Ma Chuang

机构信息

State Key Laboratory of Crop Stress Biology for Arid Areas, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Shaanxi, China.

Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Shaanxi, China.

出版信息

Front Plant Sci. 2018 Apr 18;9:519. doi: 10.3389/fpls.2018.00519. eCollection 2018.

DOI:10.3389/fpls.2018.00519
PMID:29720995
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5915569/
Abstract

The emergence of epitranscriptome opened a new chapter in gene regulation. 5-methylcytosine (mC), as an important post-transcriptional modification, has been identified to be involved in a variety of biological processes such as subcellular localization and translational fidelity. Though high-throughput experimental technologies have been developed and applied to profile mC modifications under certain conditions, transcriptome-wide studies of mC modifications are still hindered by the dynamic and reversible nature of mC and the lack of computational prediction methods. In this study, we introduced PEA-m5C, a machine learning-based mC predictor trained with features extracted from the flanking sequence of mC modifications. PEA-m5C yielded an average AUC (area under the receiver operating characteristic) of 0.939 in 10-fold cross-validation experiments based on known mC modifications. A rigorous independent testing showed that PEA-m5C (Accuracy [Acc] = 0.835, Matthews correlation coefficient [MCC] = 0.688) is remarkably superior to the recently developed mC predictor iRNAm5C-PseDNC (Acc = 0.665, MCC = 0.332). PEA-m5C has been applied to predict candidate mC modifications in annotated transcripts. Further analysis of these mC candidates showed that 4nt downstream of the translational start site is the most frequently methylated position. PEA-m5C is freely available to academic users at: https://github.com/cma2015/PEA-m5C.

摘要

表观转录组的出现开启了基因调控的新篇章。5-甲基胞嘧啶(mC)作为一种重要的转录后修饰,已被证实参与多种生物学过程,如亚细胞定位和翻译保真度。尽管高通量实验技术已得到发展并应用于在特定条件下描绘mC修饰,但mC修饰的全转录组研究仍受到mC的动态可逆性质以及缺乏计算预测方法的阻碍。在本研究中,我们引入了PEA-m5C,这是一种基于机器学习的mC预测器,使用从mC修饰侧翼序列提取的特征进行训练。在基于已知mC修饰的10折交叉验证实验中,PEA-m5C的平均AUC(受试者工作特征曲线下面积)为0.939。严格的独立测试表明,PEA-m5C(准确率[Acc]=0.835,马修斯相关系数[MCC]=0.688)明显优于最近开发的mC预测器iRNAm5C-PseDNC(Acc = 0.665,MCC = 0.332)。PEA-m5C已被应用于预测注释转录本中的候选mC修饰。对这些mC候选物的进一步分析表明,翻译起始位点下游4nt是最常发生甲基化的位置。学术用户可通过以下链接免费获取PEA-m5C:https://github.com/cma2015/PEA-m5C 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/3add353e2c2c/fpls-09-00519-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/e5d7c2a49b11/fpls-09-00519-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/eec78455d7bd/fpls-09-00519-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/0fc3c74dc487/fpls-09-00519-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/5de2a799bef1/fpls-09-00519-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/919b0923b0a0/fpls-09-00519-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/b9e4f3b048e9/fpls-09-00519-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/aba4f917d367/fpls-09-00519-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/3add353e2c2c/fpls-09-00519-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/e5d7c2a49b11/fpls-09-00519-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/eec78455d7bd/fpls-09-00519-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/0fc3c74dc487/fpls-09-00519-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/5de2a799bef1/fpls-09-00519-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/919b0923b0a0/fpls-09-00519-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/b9e4f3b048e9/fpls-09-00519-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/aba4f917d367/fpls-09-00519-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61f7/5915569/3add353e2c2c/fpls-09-00519-g0008.jpg

相似文献

1
Transcriptome-Wide Annotation of mC RNA Modifications Using Machine Learning.使用机器学习对m⁶A RNA修饰进行全转录组注释
Front Plant Sci. 2018 Apr 18;9:519. doi: 10.3389/fpls.2018.00519. eCollection 2018.
2
PEA: an integrated R toolkit for plant epitranscriptome analysis.PEA:一个用于植物表观转录组分析的集成 R 工具包。
Bioinformatics. 2018 Nov 1;34(21):3747-3749. doi: 10.1093/bioinformatics/bty421.
3
Prediction of m5C Modifications in RNA Sequences by Combining Multiple Sequence Features.通过结合多种序列特征预测RNA序列中的m5C修饰
Mol Ther Nucleic Acids. 2020 Sep 4;21:332-342. doi: 10.1016/j.omtn.2020.06.004. Epub 2020 Jun 10.
4
m5C-Seq: Machine learning-enhanced profiling of RNA 5-methylcytosine modifications.m5C-Seq:基于机器学习的 RNA 5-甲基胞嘧啶修饰谱分析。
Comput Biol Med. 2024 Nov;182:109087. doi: 10.1016/j.compbiomed.2024.109087. Epub 2024 Sep 3.
5
im5C-DSCGA: A Proposed Hybrid Framework Based on Improved DenseNet and Attention Mechanisms for Identifying 5-methylcytosine Sites in Human RNA.im5C-DSCGA:一种基于改进的 DenseNet 和注意力机制的混合框架,用于识别人类 RNA 中的 5-甲基胞嘧啶位点。
Front Biosci (Landmark Ed). 2023 Dec 26;28(12):346. doi: 10.31083/j.fbl2812346.
6
Staem5: A novel computational approachfor accurate prediction of m5C site.Staem5:一种用于准确预测m5C位点的新型计算方法。
Mol Ther Nucleic Acids. 2021 Oct 20;26:1027-1034. doi: 10.1016/j.omtn.2021.10.012. eCollection 2021 Dec 3.
7
Episo: quantitative estimation of RNA 5-methylcytosine at isoform level by high-throughput sequencing of RNA treated with bisulfite.片段:通过 RNA 经亚硫酸氢盐处理后的高通量测序对 RNA 5-甲基胞嘧啶进行同型水平的定量估计。
Bioinformatics. 2020 Apr 1;36(7):2033-2039. doi: 10.1093/bioinformatics/btz900.
8
m5C-Atlas: a comprehensive database for decoding and annotating the 5-methylcytosine (m5C) epitranscriptome.m5C-Atlas:一个全面的数据库,用于解码和注释 5-甲基胞嘧啶(m5C)转录组内表观遗传学修饰。
Nucleic Acids Res. 2022 Jan 7;50(D1):D196-D203. doi: 10.1093/nar/gkab1075.
9
Accurate RNA 5-methylcytosine site prediction based on heuristic physical-chemical properties reduction and classifier ensemble.基于启发式理化性质约简和分类器集成的准确RNA 5-甲基胞嘧啶位点预测
Anal Biochem. 2018 Jun 1;550:41-48. doi: 10.1016/j.ab.2018.03.027. Epub 2018 Apr 9.
10
m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features.m5CRegpred:基于测序特征的 5-甲基胞嘧啶(m5C)调控因子的转录组靶标预测。
Genes (Basel). 2022 Apr 12;13(4):677. doi: 10.3390/genes13040677.

引用本文的文献

1
m5CStack: An integrated framework for m5C site prediction using multi-feature stacking.m5CStack:一种使用多特征堆叠进行m5C位点预测的集成框架。
Comput Struct Biotechnol J. 2025 May 12;27:1901-1912. doi: 10.1016/j.csbj.2025.05.004. eCollection 2025.
2
A tale of two parasites: a glimpse into the RNA methylome of patient-derived Plasmodium falciparum and Plasmodium vivax isolates.两种寄生虫的故事:深入了解源自患者的恶性疟原虫和间日疟原虫分离株的RNA甲基化组。
Malar J. 2025 May 2;24(1):139. doi: 10.1186/s12936-025-05376-9.
3
Detection, molecular function and mechanisms of m5C in cancer.

本文引用的文献

1
PEA: an integrated R toolkit for plant epitranscriptome analysis.PEA:一个用于植物表观转录组分析的集成 R 工具包。
Bioinformatics. 2018 Nov 1;34(21):3747-3749. doi: 10.1093/bioinformatics/bty421.
2
5-Methylcytosine RNA Methylation in Arabidopsis Thaliana.拟南芥中 5-甲基胞嘧啶 RNA 甲基化。
Mol Plant. 2017 Nov 6;10(11):1387-1399. doi: 10.1016/j.molp.2017.09.013. Epub 2017 Sep 28.
3
iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition.
癌症中m5C的检测、分子功能及机制
Clin Transl Med. 2025 Mar;15(3):e70239. doi: 10.1002/ctm2.70239.
4
Bioinformatics for Inosine: Tools and Approaches to Trace This Elusive RNA Modification.肌苷的生物信息学:追踪这种难以捉摸的 RNA 修饰的工具和方法。
Genes (Basel). 2024 Jul 29;15(8):996. doi: 10.3390/genes15080996.
5
CDK13 promotes lipid deposition and prostate cancer progression by stimulating NSUN5-mediated m5C modification of ACC1 mRNA.CDK13 通过促进 NSUN5 介导的 ACC1 mRNA 的 m5C 修饰促进脂质沉积和前列腺癌进展。
Cell Death Differ. 2023 Dec;30(12):2462-2476. doi: 10.1038/s41418-023-01223-z. Epub 2023 Oct 16.
6
Identification of mC-related molecular subtypes and prediction models in the prognosis and tumor microenvironment infiltration of soft tissue sarcoma.软组织肉瘤中与mC相关的分子亚型鉴定及其在预后和肿瘤微环境浸润中的预测模型
Heliyon. 2023 Sep 1;9(9):e19680. doi: 10.1016/j.heliyon.2023.e19680. eCollection 2023 Sep.
7
XGBoost framework with feature selection for the prediction of RNA N5-methylcytosine sites.XGBoost 框架与特征选择相结合,用于预测 RNA N5-甲基胞嘧啶位点。
Mol Ther. 2023 Aug 2;31(8):2543-2551. doi: 10.1016/j.ymthe.2023.05.016. Epub 2023 Jun 3.
8
Dynamic regulation and key roles of ribonucleic acid methylation.核糖核酸甲基化的动态调控及关键作用
Front Cell Neurosci. 2022 Dec 19;16:1058083. doi: 10.3389/fncel.2022.1058083. eCollection 2022.
9
Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.Deepm5C:一种基于深度学习的混合框架,使用堆叠策略识别人类 RNA N5-甲基胞嘧啶位点。
Mol Ther. 2022 Aug 3;30(8):2856-2867. doi: 10.1016/j.ymthe.2022.05.001. Epub 2022 May 6.
10
m5Cpred-XS: A New Method for Predicting RNA m5C Sites Based on XGBoost and SHAP.m5Cpred-XS:一种基于XGBoost和SHAP预测RNA m5C位点的新方法。
Front Genet. 2022 Mar 30;13:853258. doi: 10.3389/fgene.2022.853258. eCollection 2022.
iRNAm5C-PseDNC:通过将理化性质融入伪二核苷酸组成来识别RNA 5-甲基胞嘧啶位点
Oncotarget. 2017 Jun 20;8(25):41178-41188. doi: 10.18632/oncotarget.17104.
4
agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update.agriGO v2.0:农业社区的 GO 分析工具包,2017 年更新。
Nucleic Acids Res. 2017 Jul 3;45(W1):W122-W129. doi: 10.1093/nar/gkx382.
5
5-methylcytosine promotes mRNA export - NSUN2 as the methyltransferase and ALYREF as an mC reader.5-甲基胞嘧啶促进mRNA输出——NSUN2作为甲基转移酶,ALYREF作为mC阅读器。
Cell Res. 2017 May;27(5):606-625. doi: 10.1038/cr.2017.55. Epub 2017 Apr 18.
6
Detecting RNA modifications in the epitranscriptome: predict and validate.检测外转录组中的 RNA 修饰:预测和验证。
Nat Rev Genet. 2017 May;18(5):275-291. doi: 10.1038/nrg.2016.169. Epub 2017 Feb 20.
7
Detecting N-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines.基于集成支持向量机从 RNA 转录组中检测 N6-甲基腺苷位点。
Sci Rep. 2017 Jan 12;7:40242. doi: 10.1038/srep40242.
8
Distinct 5-methylcytosine profiles in poly(A) RNA from mouse embryonic stem cells and brain.来自小鼠胚胎干细胞和大脑的聚腺苷酸RNA中独特的5-甲基胞嘧啶谱。
Genome Biol. 2017 Jan 5;18(1):1. doi: 10.1186/s13059-016-1139-1.
9
Transcriptome-Wide Mapping of RNA 5-Methylcytosine in Arabidopsis mRNAs and Noncoding RNAs.拟南芥信使核糖核酸和非编码核糖核酸中RNA 5-甲基胞嘧啶的全转录组图谱
Plant Cell. 2017 Mar;29(3):445-460. doi: 10.1105/tpc.16.00751. Epub 2017 Jan 6.
10
Epitranscriptome sequencing technologies: decoding RNA modifications.转录组测序技术:解码 RNA 修饰。
Nat Methods. 2016 Dec 29;14(1):23-31. doi: 10.1038/nmeth.4110.