• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合多种特征来鉴定 N7-甲基鸟苷位点。

Identifying N7-methylguanosine sites by integrating multiple features.

机构信息

School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China.

出版信息

Biopolymers. 2022 Feb;113(2):e23480. doi: 10.1002/bip.23480. Epub 2021 Oct 28.

DOI:10.1002/bip.23480
PMID:34709657
Abstract

Recent studies reported that N7-methylguanosine (m7G) plays a vital role in gene expression regulation. As a consequence, determining the distribution of m7G is a crucial step towards further understanding its biological functions. Although biological experimental approaches are capable of accurately locating m7G sites, they are labor-intensive, costly, and time-consuming. Therefore, it is necessary to develop more effective and robust computational methods to replace, or at least complement current experimental methods. In this study, we developed a novel sequence-based computational tool to identify RNA m7G sites. In this model, 22 kinds of dinucleotide physicochemical (PC) properties were employed to encode the RNA sequence. Three types of descriptors, including auto-covariance, cross-covariance, and discrete wavelet transform were adopted to extract effective features from the PC matrix. The least absolute shrinkage and selection operator (LASSO) algorithm was utilized to reduce the influence of irrelevant or redundant features. Finally, these selected features were fed into a support vector machine (SVM) for distinguishing m7G from non-m7G sites. The proposed method significantly outperforms existing predictors across all evaluation metrics. It indicates that the approach is effective in identifying RNA m7G sites.

摘要

最近的研究表明,N7-甲基鸟苷(m7G)在基因表达调控中起着至关重要的作用。因此,确定 m7G 的分布是进一步了解其生物学功能的关键步骤。虽然生物实验方法能够准确地定位 m7G 位点,但它们费时、费力且成本高昂。因此,有必要开发更有效、更强大的计算方法来替代,或者至少补充现有的实验方法。在这项研究中,我们开发了一种新的基于序列的计算工具来识别 RNA m7G 位点。在这个模型中,使用了 22 种二核苷酸物理化学(PC)特性来编码 RNA 序列。采用了三种描述符,包括自协方差、互协方差和离散小波变换,从 PC 矩阵中提取有效特征。最小绝对收缩和选择算子(LASSO)算法用于减少不相关或冗余特征的影响。最后,这些选定的特征被输入支持向量机(SVM)中,以区分 m7G 和非 m7G 位点。该方法在所有评估指标上均显著优于现有预测器。这表明该方法在识别 RNA m7G 位点方面是有效的。

相似文献

1
Identifying N7-methylguanosine sites by integrating multiple features.通过整合多种特征来鉴定 N7-甲基鸟苷位点。
Biopolymers. 2022 Feb;113(2):e23480. doi: 10.1002/bip.23480. Epub 2021 Oct 28.
2
m7G-DPP: Identifying N7-methylguanosine sites based on dinucleotide physicochemical properties of RNA.m7G-DPP:基于RNA二核苷酸理化性质识别N7-甲基鸟苷位点。
Biophys Chem. 2021 Dec;279:106697. doi: 10.1016/j.bpc.2021.106697. Epub 2021 Oct 5.
3
Iterative feature representation algorithm to improve the predictive performance of N7-methylguanosine sites.迭代特征表示算法提高 N7-甲基鸟苷位点的预测性能。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa278.
4
Prediction of N7-methylguanosine sites in human RNA based on optimal sequence features.基于最优序列特征预测人 RNA 中的 N7-甲基鸟苷位点。
Genomics. 2020 Nov;112(6):4342-4347. doi: 10.1016/j.ygeno.2020.07.035. Epub 2020 Jul 25.
5
iDHS-DT: Identifying DNase I hypersensitive sites by integrating DNA dinucleotide and trinucleotide information.iDHS-DT:通过整合DNA二核苷酸和三核苷酸信息识别DNase I超敏位点。
Biophys Chem. 2022 Feb;281:106717. doi: 10.1016/j.bpc.2021.106717. Epub 2021 Nov 14.
6
BERT-m7G: A Transformer Architecture Based on BERT and Stacking Ensemble to Identify RNA N7-Methylguanosine Sites from Sequence Information.BERT-m7G:一种基于 BERT 和堆叠集成的转换器架构,用于从序列信息中识别 RNA N7-甲基鸟苷位点。
Comput Math Methods Med. 2021 Aug 25;2021:7764764. doi: 10.1155/2021/7764764. eCollection 2021.
7
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites.THRONE:一种准确预测人类 RNA N7-甲基鸟苷位点的新方法。
J Mol Biol. 2022 Jun 15;434(11):167549. doi: 10.1016/j.jmb.2022.167549. Epub 2022 Mar 16.
8
m7GPredictor: An improved machine learning-based model for predicting internal m7G modifications using sequence properties.m7GPredictor:一种基于机器学习的改进模型,用于使用序列特性预测内部 m7G 修饰。
Anal Biochem. 2020 Nov 15;609:113905. doi: 10.1016/j.ab.2020.113905. Epub 2020 Aug 14.
9
An Interpretable Prediction Model for Identifying N-Methylguanosine Sites Based on XGBoost and SHAP.一种基于XGBoost和SHAP的用于识别N-甲基鸟苷位点的可解释预测模型。
Mol Ther Nucleic Acids. 2020 Aug 25;22:362-372. doi: 10.1016/j.omtn.2020.08.022. eCollection 2020 Dec 4.
10
iRNA5hmC-HOC: High-order correlation information for identifying RNA 5-hydroxymethylcytosine modification.iRNA5hmC-HOC:用于鉴定 RNA 5-羟甲基胞嘧啶修饰的高阶相关信息。
J Bioinform Comput Biol. 2022 Aug;20(4):2250017. doi: 10.1142/S0219720022500172. Epub 2022 Aug 3.

引用本文的文献

1
m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features.m5CRegpred:基于测序特征的 5-甲基胞嘧啶(m5C)调控因子的转录组靶标预测。
Genes (Basel). 2022 Apr 12;13(4):677. doi: 10.3390/genes13040677.