广泛存在于细胞系和组织中的 m6A RNA 修饰的可解释预测模型。

Interpretable prediction models for widespread m6A RNA modification across cell lines and tissues.

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.

出版信息

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad709.

DOI:10.1093/bioinformatics/btad709

PMID:37995291

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10697738/

Abstract

MOTIVATION

RNA N6-methyladenosine (m6A) in Homo sapiens plays vital roles in a variety of biological functions. Precise identification of m6A modifications is thus essential to elucidation of their biological functions and underlying molecular-level mechanisms. Currently available high-throughput single-nucleotide-resolution m6A modification data considerably accelerated the identification of RNA modification sites through the development of data-driven computational methods. Nevertheless, existing methods have limitations in terms of the coverage of single-nucleotide-resolution cell lines and have poor capability in model interpretations, thereby having limited applicability.

RESULTS

In this study, we present CLSM6A, comprising a set of deep learning-based models designed for predicting single-nucleotide-resolution m6A RNA modification sites across eight different cell lines and three tissues. Extensive benchmarking experiments are conducted on well-curated datasets and accordingly, CLSM6A achieves superior performance than current state-of-the-art methods. Furthermore, CLSM6A is capable of interpreting the prediction decision-making process by excavating critical motifs activated by filters and pinpointing highly concerned positions in both forward and backward propagations. CLSM6A exhibits better portability on similar cross-cell line/tissue datasets, reveals a strong association between highly activated motifs and high-impact motifs, and demonstrates complementary attributes of different interpretation strategies.

AVAILABILITY AND IMPLEMENTATION

The webserver is available at http://csbio.njust.edu.cn/bioinf/clsm6a. The datasets and code are available at https://github.com/zhangying-njust/CLSM6A/.

摘要

动机

人类的 RNA N6-甲基腺苷（m6A）在各种生物功能中起着至关重要的作用。因此，精确识别 m6A 修饰对于阐明其生物学功能和潜在的分子水平机制至关重要。目前可用的高通量单核苷酸分辨率 m6A 修饰数据通过开发数据驱动的计算方法极大地加速了 RNA 修饰位点的识别。然而，现有的方法在单核苷酸分辨率细胞系的覆盖范围方面存在局限性，并且在模型解释方面能力较差，因此适用性有限。

结果

在这项研究中，我们提出了 CLSM6A，它由一组基于深度学习的模型组成，旨在预测跨越八种不同细胞系和三种组织的单核苷酸分辨率 m6A RNA 修饰位点。在精心整理的数据集上进行了广泛的基准测试实验，结果表明 CLSM6A 的性能优于当前最先进的方法。此外，CLSM6A 能够通过挖掘由过滤器激活的关键基序并确定正向和反向传播中高度关注的位置，来解释预测决策过程。CLSM6A 在类似的跨细胞系/组织数据集上具有更好的可移植性，揭示了高度激活的基序和高影响基序之间的强相关性，并展示了不同解释策略的互补属性。

可用性和实现

该网络服务器可在 http://csbio.njust.edu.cn/bioinf/clsm6a 访问。数据集和代码可在 https://github.com/zhangying-njust/CLSM6A/ 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c63/10697738/35ce6c3a94b2/btad709f1.jpg

相似文献

Interpretable prediction models for widespread m6A RNA modification across cell lines and tissues.广泛存在于细胞系和组织中的 m6A RNA 修饰的可解释预测模型。

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad709.

Discovering Consensus Regions for Interpretable Identification of RNA N6-Methyladenosine Modification Sites via Graph Contrastive Clustering.通过图对比聚类发现可解释的 RNA N6-甲基腺苷修饰位点识别的共识区域。

IEEE J Biomed Health Inform. 2024 Apr;28(4):2362-2372. doi: 10.1109/JBHI.2024.3357979. Epub 2024 Apr 4.

Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。

Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.

Modeling multi-species RNA modification through multi-task curriculum learning.通过多任务课程学习对多物种 RNA 修饰进行建模。

Nucleic Acids Res. 2021 Apr 19;49(7):3719-3734. doi: 10.1093/nar/gkab124.

DeepM6ASeq: prediction and characterization of m6A-containing sequences using deep learning.DeepM6ASeq：使用深度学习预测和描述 m6A 序列

BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):524. doi: 10.1186/s12859-018-2516-4.

RNAMethPre: A Web Server for the Prediction and Query of mRNA m6A Sites.RNAMethPre：一个用于预测和查询mRNA m6A位点的网络服务器。

PLoS One. 2016 Oct 10;11(10):e0162707. doi: 10.1371/journal.pone.0162707. eCollection 2016.

ELMo4m6A: A Contextual Language Embedding-Based Predictor for Detecting RNA N6-Methyladenosine Sites.ELMo4m6A：一种基于上下文语言嵌入的RNA N6-甲基腺苷位点检测预测器。

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):944-954. doi: 10.1109/TCBB.2022.3173323. Epub 2023 Apr 3.

PEA-m6A: an ensemble learning framework for accurately predicting N6-methyladenosine modifications in plants.PEA-m6A：一种用于准确预测植物中 N6-甲基腺苷修饰的集成学习框架。

Plant Physiol. 2024 May 31;195(2):1200-1213. doi: 10.1093/plphys/kiae120.

Improving N(6)-methyladenosine site prediction with heuristic selection of nucleotide physical-chemical properties.通过启发式选择核苷酸物理化学性质改进N(6)-甲基腺苷位点预测

Anal Biochem. 2016 Sep 1;508:104-13. doi: 10.1016/j.ab.2016.06.001. Epub 2016 Jun 11.

RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘

BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.

引用本文的文献

Multimodal zero-shot learning of previously unseen epitranscriptomes from RNA-seq data.从RNA测序数据中对以前未见过的表观转录组进行多模态零样本学习。

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf332.

Interpretability-guided RNA N-methyladenosine modification site prediction with invertible neural networks.基于可逆神经网络的可解释性引导的RNA N-甲基腺苷修饰位点预测

Commun Biol. 2025 Jul 8;8(1):1022. doi: 10.1038/s42003-025-08265-8.

Methyl-GP: accurate generic DNA methylation prediction based on a language model and representation learning.甲基化基因组图谱（Methyl-GP）：基于语言模型和表征学习的准确通用DNA甲基化预测

Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf223.

Statistical modeling of single-cell epitranscriptomics enabled trajectory and regulatory inference of RNA methylation.单细胞表观转录组学的统计建模实现了RNA甲基化的轨迹和调控推断。

Cell Genom. 2025 Jan 8;5(1):100702. doi: 10.1016/j.xgen.2024.100702. Epub 2024 Dec 5.

Interpretable deep cross networks unveiled common signatures of dysregulated epitranscriptomes across 12 cancer types.可解释的深度交叉网络揭示了12种癌症类型中失调的表观转录组的共同特征。

Mol Ther Nucleic Acids. 2024 Oct 29;35(4):102376. doi: 10.1016/j.omtn.2024.102376. eCollection 2024 Dec 10.

本文引用的文献

TS-m6A-DL: Tissue-specific identification of N6-methyladenosine sites using a universal deep learning model.TS-m6A-DL：使用通用深度学习模型对N6-甲基腺嘌呤位点进行组织特异性识别。

Comput Struct Biotechnol J. 2021 Aug 10;19:4619-4625. doi: 10.1016/j.csbj.2021.08.014. eCollection 2021.

The functions and prognostic values of m6A RNA methylation regulators in thyroid carcinoma.m6A RNA甲基化调节剂在甲状腺癌中的功能及预后价值

Cancer Cell Int. 2021 Jul 19;21(1):385. doi: 10.1186/s12935-021-02090-9.

Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications.基于注意力的多标签神经网络，用于十二种广泛存在的 RNA 修饰的综合预测和解释。

Nat Commun. 2021 Jun 29;12(1):4011. doi: 10.1038/s41467-021-24313-3.

Splice site mA methylation prevents binding of U2AF35 to inhibit RNA splicing.剪接位点 mA 甲基化阻止 U2AF35 结合，从而抑制 RNA 剪接。

Cell. 2021 Jun 10;184(12):3125-3142.e25. doi: 10.1016/j.cell.2021.03.062. Epub 2021 Apr 29.

Modeling multi-species RNA modification through multi-task curriculum learning.通过多任务课程学习对多物种 RNA 修饰进行建模。

Nucleic Acids Res. 2021 Apr 19;49(7):3719-3734. doi: 10.1093/nar/gkab124.

im6A-TS-CNN: Identifying the N-Methyladenine Site in Multiple Tissues by Using the Convolutional Neural Network.im6A-TS-CNN：利用卷积神经网络识别多种组织中的N-甲基腺嘌呤位点

Mol Ther Nucleic Acids. 2020 Sep 4;21:1044-1049. doi: 10.1016/j.omtn.2020.07.034. Epub 2020 Jul 31.

m6A-Atlas: a comprehensive knowledgebase for unraveling the N6-methyladenosine (m6A) epitranscriptome.m6A-Atlas：一个全面的知识库，用于揭示 N6-甲基腺苷（m6A）转录组内的修饰信息。

Nucleic Acids Res. 2021 Jan 8;49(D1):D134-D143. doi: 10.1093/nar/gkaa692.

Computational identification of N6-methyladenosine sites in multiple tissues of mammals.哺乳动物多个组织中N6-甲基腺嘌呤位点的计算识别

Comput Struct Biotechnol J. 2020 Apr 30;18:1084-1091. doi: 10.1016/j.csbj.2020.04.015. eCollection 2020.

Writers, readers and erasers of RNA modifications in cancer.癌症中 RNA 修饰的书写者、读者和擦除器。

Cancer Lett. 2020 Apr 1;474:127-137. doi: 10.1016/j.canlet.2020.01.021. Epub 2020 Jan 25.

RNA mA Methyltransferase METTL3 Promotes The Growth Of Prostate Cancer By Regulating Hedgehog Pathway.RNA N⁶-甲基腺嘌呤甲基转移酶METTL3通过调控刺猬信号通路促进前列腺癌生长。

Onco Targets Ther. 2019 Nov 5;12:9143-9152. doi: 10.2147/OTT.S226796. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

广泛存在于细胞系和组织中的 m6A RNA 修饰的可解释预测模型。

Interpretable prediction models for widespread m6A RNA modification across cell lines and tissues.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

动机

结果

可用性和实现

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献