• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

硫氧碳钡矿:使用集成预测器改进长链非编码RNA甲基化位点预测

LITHOPHONE: Improving lncRNA Methylation Site Prediction Using an Ensemble Predictor.

作者信息

Liu Lian, Lei Xiujuan, Fang Zengqiang, Tang Yujiao, Meng Jia, Wei Zhen

机构信息

School of Computer Sciences, Shannxi Normal University, Xi'an, China.

Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, China.

出版信息

Front Genet. 2020 Jun 9;11:545. doi: 10.3389/fgene.2020.00545. eCollection 2020.

DOI:10.3389/fgene.2020.00545
PMID:32582286
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7297269/
Abstract

-methyladenosine (mA) is one of the most widely studied epigenetic modifications, which plays an important role in many biological processes, such as splicing, RNA localization, and degradation. Studies have shown that mA on lncRNA has important functions, including regulating the expression and functions of lncRNA, regulating the synthesis of pre-mRNA, promoting the proliferation of cancer cells, and affecting cell differentiation and many others. Although a number of methods have been proposed to predict mA RNA methylation sites, most of these methods aimed at general mA sites prediction without noticing the uniqueness of the lncRNA methylation prediction problem. Since many lncRNAs do not have a polyA tail and cannot be captured in the polyA selection step of the most widely adopted RNA-seq library preparation protocol, lncRNA methylation sites cannot be effectively captured and are thus likely to be significantly underrepresented in existing experimental data affecting the accuracy of existing predictors. In this paper, we propose a new computational framework, , which stands for ong noncodng RNA meylatin sites rediction from sequence caracteristics and genmic iformation with an nsemble predictor. We show that the methylation sites of lncRNA and mRNA have different patterns exhibited in the extracted features and should be differently handled when making predictions. Due to the used experiment protocols, the number of known lncRNA mA sites is limited, and insufficient to train a reliable predictor; thus, the performance can be improved by combining both lncRNA and mRNA data using an ensemble predictor. We show that the newly developed LITHOPHONE approach achieved a reasonably good performance when tested on independent datasets (AUC: 0.966 and 0.835 under full transcript and mature mRNA modes, respectively), marking a substantial improvement compared with existing methods. Additionally, LITHOPHONE was applied to scan the entire human lncRNAome for all possible lncRNA mA sites, and the results are freely accessible at: http://180.208.58.19/lith/.

摘要

N6-甲基腺苷(mA)是研究最为广泛的表观遗传修饰之一,它在许多生物学过程中发挥着重要作用,如剪接、RNA定位和降解。研究表明,长链非编码RNA(lncRNA)上的mA具有重要功能,包括调节lncRNA的表达和功能、调控前体mRNA的合成、促进癌细胞增殖以及影响细胞分化等诸多方面。尽管已经提出了许多方法来预测mA RNA甲基化位点,但这些方法大多旨在预测一般的mA位点,而没有注意到lncRNA甲基化预测问题的独特性。由于许多lncRNA没有polyA尾,无法在最广泛采用的RNA测序文库制备方案的polyA选择步骤中被捕获,lncRNA甲基化位点无法被有效捕获,因此在现有实验数据中可能显著代表性不足,影响了现有预测器的准确性。在本文中,我们提出了一种新的计算框架LITHOPHONE,它代表基于序列特征和基因组信息并结合集成预测器的长链非编码RNA甲基化位点预测。我们表明,lncRNA和mRNA的甲基化位点在提取的特征中表现出不同的模式,在进行预测时应区别对待。由于所使用的实验方案,已知的lncRNA mA位点数量有限,不足以训练出一个可靠的预测器;因此,通过使用集成预测器结合lncRNA和mRNA数据可以提高性能。我们表明,新开发的LITHOPHONE方法在独立数据集上测试时取得了相当不错的性能(在完整转录本和成熟mRNA模式下,AUC分别为0.966和0.835),与现有方法相比有了显著改进。此外,LITHOPHONE被应用于扫描整个人类lncRNA组以寻找所有可能的lncRNA mA位点,结果可在以下网址免费获取:http://180.208.58.19/lith/ 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/36e1e2921778/fgene-11-00545-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/30ae92768cd3/fgene-11-00545-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/3e3b5169d272/fgene-11-00545-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/36e1e2921778/fgene-11-00545-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/30ae92768cd3/fgene-11-00545-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/3e3b5169d272/fgene-11-00545-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c30/7297269/36e1e2921778/fgene-11-00545-g0003.jpg

相似文献

1
LITHOPHONE: Improving lncRNA Methylation Site Prediction Using an Ensemble Predictor.硫氧碳钡矿:使用集成预测器改进长链非编码RNA甲基化位点预测
Front Genet. 2020 Jun 9;11:545. doi: 10.3389/fgene.2020.00545. eCollection 2020.
2
WITMSG: Large-scale Prediction of Human Intronic mA RNA Methylation Sites from Sequence and Genomic Features.WITMSG:基于序列和基因组特征对人类内含子m⁶A RNA甲基化位点进行大规模预测
Curr Genomics. 2020 Jan;21(1):67-76. doi: 10.2174/1389202921666200211104140.
3
EDLmAPred: ensemble deep learning approach for mRNA mA site prediction.EDLmAPred:用于预测 mRNA mA 位点的集成深度学习方法。
BMC Bioinformatics. 2021 May 29;22(1):288. doi: 10.1186/s12859-021-04206-4.
4
WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach.WHISTLE:一种使用机器学习方法预测的人类 N6-甲基腺苷(m6A)转录组表观遗传学图谱。
Nucleic Acids Res. 2019 Apr 23;47(7):e41. doi: 10.1093/nar/gkz074.
5
mA Reader: Epitranscriptome Target Prediction and Functional Characterization of -Methyladenosine (mA) Readers.mA 阅读器:N6-甲基腺苷(mA)阅读器的表观转录组靶点预测与功能表征
Front Cell Dev Biol. 2020 Aug 11;8:741. doi: 10.3389/fcell.2020.00741. eCollection 2020.
6
EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction.EMDLP:用于 RNA 甲基化位点预测的集成多尺度深度学习模型。
BMC Bioinformatics. 2022 Jun 8;23(1):221. doi: 10.1186/s12859-022-04756-1.
7
WHISTLE: A Functionally Annotated High-Accuracy Map of Human mA Epitranscriptome.WHISTLE:人类 mA 转录组功能注释的高精度图谱。
Methods Mol Biol. 2021;2284:519-529. doi: 10.1007/978-1-0716-1307-8_28.
8
N6-methyladenosine methylation analysis of long noncoding RNAs and mRNAs in 5-FU-resistant colon cancer cells.5-FU 耐药结肠癌细胞中长非编码 RNA 和 mRNAs 的 N6-甲基腺苷甲基化分析。
Epigenetics. 2024 Dec;19(1):2298058. doi: 10.1080/15592294.2023.2298058. Epub 2023 Dec 25.
9
DRUM: Inference of Disease-Associated mA RNA Methylation Sites From a Multi-Layer Heterogeneous Network.DRUM:从多层异构网络推断疾病相关的mA RNA甲基化位点
Front Genet. 2019 Apr 3;10:266. doi: 10.3389/fgene.2019.00266. eCollection 2019.
10
M6A-GSMS: Computational identification of N-methyladenosine sites with GBDT and stacking learning in multiple species.M6A-GSMS:利用梯度提升决策树(GBDT)和堆叠学习在多个物种中对N-甲基腺苷位点进行计算识别
J Biomol Struct Dyn. 2022;40(22):12380-12391. doi: 10.1080/07391102.2021.1970628. Epub 2021 Aug 30.

引用本文的文献

1
Dynamic m6A-ncRNAs association and their impact on cancer pathogenesis, immune regulation and therapeutic response.动态m6A修饰的非编码RNA关联及其对癌症发病机制、免疫调节和治疗反应的影响。
Genes Dis. 2021 Nov 10;10(1):135-150. doi: 10.1016/j.gendis.2021.10.004. eCollection 2023 Jan.
2
Dynamic regulation and key roles of ribonucleic acid methylation.核糖核酸甲基化的动态调控及关键作用
Front Cell Neurosci. 2022 Dec 19;16:1058083. doi: 10.3389/fncel.2022.1058083. eCollection 2022.
3
MTDeepM6A-2S: A two-stage multi-task deep learning method for predicting RNA N6-methyladenosine sites of .

本文引用的文献

1
WITMSG: Large-scale Prediction of Human Intronic mA RNA Methylation Sites from Sequence and Genomic Features.WITMSG:基于序列和基因组特征对人类内含子m⁶A RNA甲基化位点进行大规模预测
Curr Genomics. 2020 Jan;21(1):67-76. doi: 10.2174/1389202921666200211104140.
2
PIANO: A Web Server for Pseudouridine-Site (Ψ) Identification and Functional Annotation.PIANO:用于假尿苷位点(Ψ)识别和功能注释的网络服务器。
Front Genet. 2020 Mar 12;11:88. doi: 10.3389/fgene.2020.00088. eCollection 2020.
3
WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach.
MTDeepM6A - 2S:一种用于预测……的RNA N6 - 甲基腺嘌呤位点的两阶段多任务深度学习方法。 (原文中“of”后面内容缺失)
Front Microbiol. 2022 Oct 5;13:999506. doi: 10.3389/fmicb.2022.999506. eCollection 2022.
4
Novel insight into the functions of N‑methyladenosine modified lncRNAs in cancers (Review).新型洞察 N6-甲基腺苷修饰的长链非编码 RNA 在癌症中的功能(综述)。
Int J Oncol. 2022 Dec;61(6). doi: 10.3892/ijo.2022.5442. Epub 2022 Oct 20.
5
The integration of single-cell sequencing, TCGA, and GEO data analysis revealed that PRRT3-AS1 is a biomarker and therapeutic target of SKCM.单细胞测序、TCGA 和 GEO 数据分析的整合表明,PRRT3-AS1 是 SKCM 的一个生物标志物和治疗靶点。
Front Immunol. 2022 Sep 23;13:919145. doi: 10.3389/fimmu.2022.919145. eCollection 2022.
6
Long noncoding RNA lnc-H2AFV-1 promotes cell growth by regulating aberrant m6A RNA modification in head and neck squamous cell carcinoma.长链非编码 RNA lnc-H2AFV-1 通过调节头颈部鳞状细胞癌中异常的 m6A RNA 修饰促进细胞生长。
Cancer Sci. 2022 Jun;113(6):2071-2084. doi: 10.1111/cas.15366. Epub 2022 Apr 22.
7
Construction of Prognostic Risk Model of 5-Methylcytosine-Related Long Non-Coding RNAs and Evaluation of the Characteristics of Tumor-Infiltrating Immune Cells in Breast Cancer.5-甲基胞嘧啶相关长链非编码RNA的乳腺癌预后风险模型构建及肿瘤浸润免疫细胞特征评估
Front Genet. 2021 Oct 29;12:748279. doi: 10.3389/fgene.2021.748279. eCollection 2021.
8
Comprehensive analysis of differences of N-methyladenosine of lncRNAs between atrazine-induced and normal Xenopus laevis testis.阿特拉津诱导的非洲爪蟾睾丸与正常非洲爪蟾睾丸lncRNAs的N-甲基腺苷差异的综合分析
Genes Environ. 2021 Nov 6;43(1):49. doi: 10.1186/s41021-021-00223-0.
9
Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications.基于注意力的多标签神经网络,用于十二种广泛存在的 RNA 修饰的综合预测和解释。
Nat Commun. 2021 Jun 29;12(1):4011. doi: 10.1038/s41467-021-24313-3.
10
Recent advances in functional annotation and prediction of the epitranscriptome.表观转录组功能注释与预测的最新进展。
Comput Struct Biotechnol J. 2021 May 21;19:3015-3026. doi: 10.1016/j.csbj.2021.05.030. eCollection 2021.
WHISTLE:一种使用机器学习方法预测的人类 N6-甲基腺苷(m6A)转录组表观遗传学图谱。
Nucleic Acids Res. 2019 Apr 23;47(7):e41. doi: 10.1093/nar/gkz074.
4
Global analysis of N6-methyladenosine functions and its disease association using deep learning and network-based methods.基于深度学习和网络方法的 N6-甲基腺苷功能的全局分析及其与疾病的关联
PLoS Comput Biol. 2019 Jan 2;15(1):e1006663. doi: 10.1371/journal.pcbi.1006663. eCollection 2019 Jan.
5
DeepM6ASeq: prediction and characterization of m6A-containing sequences using deep learning.DeepM6ASeq:使用深度学习预测和描述 m6A 序列
BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):524. doi: 10.1186/s12859-018-2516-4.
6
Gene2vec: gene subsequence embedding for prediction of mammalian -methyladenosine sites from mRNA.Gene2vec:基于基因子序列的嵌体模型,用于从 mRNA 预测哺乳动物 m6A 修饰位点。
RNA. 2019 Feb;25(2):205-218. doi: 10.1261/rna.069112.118. Epub 2018 Nov 13.
7
BERMP: a cross-species classifier for predicting mA sites by integrating a deep learning algorithm and a random forest approach.BERMP:一种跨物种的 mA 位点预测分类器,它集成了深度学习算法和随机森林方法。
Int J Biol Sci. 2018 Sep 7;14(12):1669-1677. doi: 10.7150/ijbs.27819. eCollection 2018.
8
M6AMRFS: Robust Prediction of N6-Methyladenosine Sites With Sequence-Based Features in Multiple Species.M6AMRFS:利用多物种基于序列的特征对N6-甲基腺苷位点进行稳健预测
Front Genet. 2018 Oct 25;9:495. doi: 10.3389/fgene.2018.00495. eCollection 2018.
9
Imbalance learning for the prediction of N-Methylation sites in mRNAs.不平衡学习在预测 mRNAs 中的 N-甲基化位点中的应用。
BMC Genomics. 2018 Aug 1;19(1):574. doi: 10.1186/s12864-018-4928-y.
10
iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.iProt-Sub:一个全面的软件包,用于准确地映射和预测蛋白酶特异性底物和切割位点。
Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028.