• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GenoM7GNet:一种基于核苷酸语言模型的高效N-甲基鸟苷位点预测方法。

GenoM7GNet: An Efficient N-Methylguanosine Site Prediction Approach Based on a Nucleotide Language Model.

作者信息

Li Chuang, Wang Heshi, Wen Yanhua, Yin Rui, Zeng Xiangxiang, Li Keqin

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2258-2268. doi: 10.1109/TCBB.2024.3459870. Epub 2024 Dec 10.

DOI:10.1109/TCBB.2024.3459870
PMID:39302806
Abstract

N-methylguanosine (m7G), one of the mainstream post-transcriptional RNA modifications, occupies an exceedingly significant place in medical treatments. However, classic approaches for identifying m7G sites are costly both in time and equipment. Meanwhile, the existing machine learning methods extract limited hidden information from RNA sequences, thus making it difficult to improve the accuracy. Therefore, we put forward to a deep learning network, called "GenoM7GNet," for m7G site identification. This model utilizes a Bidirectional Encoder Representation from Transformers (BERT) and is pretrained on nucleotide sequences data to capture hidden patterns from RNA sequences for m7G site prediction. Moreover, through detailed comparative experiments with various deep learning models, we discovered that the one-dimensional convolutional neural network (CNN) exhibits outstanding performance in sequence feature learning and classification. The proposed GenoM7GNet model achieved 0.953in accuracy, 0.932in sensitivity, 0.976in specificity, 0.907in Matthews Correlation Coefficient and 0.984in Area Under the receiver operating characteristic Curve on performance evaluation. Extensive experimental results further prove that our GenoM7GNet model markedly surpasses other state-of-the-art models in predicting m7G sites, exhibiting high computing performance.

摘要

N-甲基鸟苷(m7G)是转录后RNA修饰的主流类型之一,在医学治疗中占据极其重要的地位。然而,传统的m7G位点识别方法在时间和设备方面成本都很高。同时,现有的机器学习方法从RNA序列中提取的隐藏信息有限,因此难以提高准确性。因此,我们提出了一种名为“GenoM7GNet”的深度学习网络用于m7G位点识别。该模型利用来自变换器的双向编码器表示(BERT),并在核苷酸序列数据上进行预训练,以捕获RNA序列中的隐藏模式用于m7G位点预测。此外,通过与各种深度学习模型进行详细的对比实验,我们发现一维卷积神经网络(CNN)在序列特征学习和分类方面表现出色。所提出的GenoM7GNet模型在性能评估中的准确率为0.953,灵敏度为0.932,特异性为0.976,马修斯相关系数为0.907,受试者工作特征曲线下面积为0.984。大量实验结果进一步证明,我们的GenoM7GNet模型在预测m7G位点方面明显优于其他现有最先进的模型,展现出较高的计算性能。

相似文献

1
GenoM7GNet: An Efficient N-Methylguanosine Site Prediction Approach Based on a Nucleotide Language Model.GenoM7GNet:一种基于核苷酸语言模型的高效N-甲基鸟苷位点预测方法。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2258-2268. doi: 10.1109/TCBB.2024.3459870. Epub 2024 Dec 10.
2
Deep-m7G: A contrastive learning-based deep biological language model for identifying RNA N7-methylguanosine sites.深度m7G:一种基于对比学习的用于识别RNA N7-甲基鸟苷位点的深度生物语言模型。
Int J Biol Macromol. 2025 Jul;318(Pt 4):145341. doi: 10.1016/j.ijbiomac.2025.145341. Epub 2025 Jun 18.
3
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
4
A deep learning model for predicting systemic lupus erythematosus-associated epitopes.一种用于预测系统性红斑狼疮相关表位的深度学习模型。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):230. doi: 10.1186/s12911-025-03056-x.
5
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
6
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
7
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
8
Short-Term Memory Impairment短期记忆障碍
9
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
10
EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。
Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.

引用本文的文献

1
A comprehensive in silico and invitro analysis revealed the diagnostic, prognostic and therapeutic potential of GNAI family genes in colon adenocarcinoma (COAD).一项全面的计算机模拟和体外分析揭示了GNAI家族基因在结肠腺癌(COAD)中的诊断、预后和治疗潜力。
Hereditas. 2025 Aug 16;162(1):162. doi: 10.1186/s41065-025-00523-3.
2
EDNTOM: An Ensemble Learning and Weight Mechanism-Based Nanopore Methylation Detection Tool.EDNTOM:一种基于集成学习和权重机制的纳米孔甲基化检测工具。
ACS Omega. 2025 Jul 23;10(30):33031-33044. doi: 10.1021/acsomega.5c01924. eCollection 2025 Aug 5.
3
AttnW2V-Enhancer: Leveraging attention and Word2Vec for enhanced enhancer prediction.
注意力加权词向量增强器:利用注意力机制和词向量进行增强的增强子预测。
Comput Struct Biotechnol J. 2025 Jul 23;27:3275-3284. doi: 10.1016/j.csbj.2025.07.008. eCollection 2025.
4
Exploring the Role of Artificial Intelligence in Smart Healthcare: A Capability and Function-Oriented Review.探索人工智能在智能医疗中的作用:一项基于能力和功能的综述。
Healthcare (Basel). 2025 Jul 8;13(14):1642. doi: 10.3390/healthcare13141642.
5
EnsembleNPPred: A Robust Approach to Neuropeptide Prediction and Recognition Using Ensemble Machine Learning and Deep Learning Methods.集成神经肽预测:一种使用集成机器学习和深度学习方法进行神经肽预测与识别的稳健方法。
Life (Basel). 2025 Jun 25;15(7):1010. doi: 10.3390/life15071010.
6
Transforming Hemophilia Management: Lessons from Gene Therapy Clinical Trials.血友病治疗变革:基因治疗临床试验的经验教训
Mol Biotechnol. 2025 Jun 30. doi: 10.1007/s12033-025-01464-y.
7
Optimized deep learning approach for lung cancer detection using flying fox optimization and bidirectional generative adversarial networks.使用狐蝠优化算法和双向生成对抗网络的肺癌检测优化深度学习方法。
PeerJ Comput Sci. 2025 May 27;11:e2853. doi: 10.7717/peerj-cs.2853. eCollection 2025.
8
2OM-Pred: prediction of 2-O-methylation sites in ribonucleic acid using diverse classifiers.2OM-Pred:使用多种分类器预测核糖核酸中的2-O-甲基化位点。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf282.
9
A novel clustered-based binary grey wolf optimizer to solve the feature selection problem for uncovering the genetic links between non-Hodgkin lymphomas and rheumatologic diseases.一种基于聚类的新型二进制灰狼优化器,用于解决特征选择问题,以揭示非霍奇金淋巴瘤与风湿性疾病之间的遗传联系。
Health Inf Sci Syst. 2025 May 2;13(1):34. doi: 10.1007/s13755-025-00350-w. eCollection 2025 Dec.
10
Integrative bioinformatics analysis of high-throughput sequencing and in vitro functional analysis leads to uncovering key hub genes in esophageal squamous cell carcinoma.高通量测序的整合生物信息学分析与体外功能分析有助于揭示食管鳞状细胞癌中的关键枢纽基因。
Hereditas. 2025 Mar 14;162(1):38. doi: 10.1186/s41065-025-00398-4.