• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习和判别特征整合进行N6-甲基腺嘌呤识别

N6-methyladenine identification using deep learning and discriminative feature integration.

作者信息

Khan Salman, Uddin Islam, Noor Sumaiya, AlQahtani Salman A, Ahmad Nijad

机构信息

Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan.

Business and Management Sciences Department, Purdue University, West Lafayette, IN, USA.

出版信息

BMC Med Genomics. 2025 Mar 29;18(1):58. doi: 10.1186/s12920-025-02131-6.

DOI:10.1186/s12920-025-02131-6
PMID:40158097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11955129/
Abstract

N6-methyladenine (6 mA) is a pivotal DNA modification that plays a crucial role in epigenetic regulation, gene expression, and various biological processes. With advancements in sequencing technologies and computational biology, there is an increasing focus on developing accurate methods for 6 mA site identification to enhance early detection and understand its biological significance. Despite the rapid progress of machine learning in bioinformatics, accurately detecting 6 mA sites remains a challenge due to the limited generalizability and efficiency of existing approaches. In this study, we present Deep-N6mA, a novel Deep Neural Network (DNN) model incorporating optimal hybrid features for precise 6 mA site identification. The proposed framework captures complex patterns from DNA sequences through a comprehensive feature extraction process, leveraging k-mer, Dinucleotide-based Cross Covariance (DCC), Trinucleotide-based Auto Covariance (TAC), Pseudo Single Nucleotide Composition (PseSNC), Pseudo Dinucleotide Composition (PseDNC), and Pseudo Trinucleotide Composition (PseTNC). To optimize computational efficiency and eliminate irrelevant or noisy features, an unsupervised Principal Component Analysis (PCA) algorithm is employed, ensuring the selection of the most informative features. A multilayer DNN serves as the classification algorithm to identify N6-methyladenine sites accurately. The robustness and generalizability of Deep-N6mA were rigorously validated using fivefold cross-validation on two benchmark datasets. Experimental results reveal that Deep-N6mA achieves an average accuracy of 97.70% on the F. vesca dataset and 95.75% on the R. chinensis dataset, outperforming existing methods by 4.12% and 4.55%, respectively. These findings underscore the effectiveness of Deep-N6mA as a reliable tool for early 6 mA site detection, contributing to epigenetic research and advancing the field of computational biology.

摘要

N6-甲基腺嘌呤(6mA)是一种关键的DNA修饰,在表观遗传调控、基因表达和各种生物过程中发挥着至关重要的作用。随着测序技术和计算生物学的进步,人们越来越关注开发准确的6mA位点识别方法,以加强早期检测并了解其生物学意义。尽管机器学习在生物信息学方面取得了快速进展,但由于现有方法的通用性和效率有限,准确检测6mA位点仍然是一项挑战。在本研究中,我们提出了Deep-N6mA,这是一种新型的深度神经网络(DNN)模型,它结合了最优混合特征以精确识别6mA位点。所提出的框架通过全面的特征提取过程从DNA序列中捕获复杂模式,利用k-mer、基于二核苷酸的交叉协方差(DCC)、基于三核苷酸的自协方差(TAC)、伪单核苷酸组成(PseSNC)、伪二核苷酸组成(PseDNC)和伪三核苷酸组成(PseTNC)。为了优化计算效率并消除不相关或有噪声的特征,采用了无监督主成分分析(PCA)算法,确保选择最具信息性的特征。多层DNN用作分类算法以准确识别N6-甲基腺嘌呤位点。使用两个基准数据集进行五折交叉验证,对Deep-N6mA的稳健性和通用性进行了严格验证。实验结果表明,Deep-N6mA在野草莓数据集上的平均准确率达到97.70%,在中华猕猴桃数据集上达到95.75%,分别比现有方法高出4.12%和4.55%。这些发现强调了Deep-N6mA作为早期6mA位点检测可靠工具的有效性,有助于表观遗传学研究并推动计算生物学领域的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/940a1a4dcdc6/12920_2025_2131_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/37b3f1ca9e32/12920_2025_2131_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/a429c7ce7288/12920_2025_2131_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/b110a57e9790/12920_2025_2131_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/c921c8c0ee60/12920_2025_2131_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/6820a1cf2866/12920_2025_2131_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/44a2778dad89/12920_2025_2131_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/940a1a4dcdc6/12920_2025_2131_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/37b3f1ca9e32/12920_2025_2131_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/a429c7ce7288/12920_2025_2131_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/b110a57e9790/12920_2025_2131_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/c921c8c0ee60/12920_2025_2131_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/6820a1cf2866/12920_2025_2131_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/44a2778dad89/12920_2025_2131_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f10/11955129/940a1a4dcdc6/12920_2025_2131_Fig7_HTML.jpg

相似文献

1
N6-methyladenine identification using deep learning and discriminative feature integration.利用深度学习和判别特征整合进行N6-甲基腺嘌呤识别
BMC Med Genomics. 2025 Mar 29;18(1):58. doi: 10.1186/s12920-025-02131-6.
2
Using k-mer embeddings learned from a Skip-gram based neural network for building a cross-species DNA N6-methyladenine site prediction model.利用基于 Skip-gram 的神经网络学习的 k-mer 嵌入来构建跨物种 DNA N6-甲基腺嘌呤位点预测模型。
Plant Mol Biol. 2021 Dec;107(6):533-542. doi: 10.1007/s11103-021-01204-1. Epub 2021 Nov 29.
3
Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species.Deep6mA:一个用于探索不同物种中 DNA N6-甲基腺嘌呤位点相似模式的深度学习框架。
PLoS Comput Biol. 2021 Feb 18;17(2):e1008767. doi: 10.1371/journal.pcbi.1008767. eCollection 2021 Feb.
4
A convolution based computational approach towards DNA N6-methyladenine site identification and motif extraction in rice genome.基于卷积的计算方法,用于鉴定水稻基因组中的 DNA N6-甲基腺嘌呤位点并提取其基序。
Sci Rep. 2021 May 14;11(1):10357. doi: 10.1038/s41598-021-89850-9.
5
PSSM-Sumo: deep learning based intelligent model for prediction of sumoylation sites using discriminative features.PSSM-Sumo:基于深度学习的智能模型,用于使用判别特征预测类泛素化位点。
BMC Bioinformatics. 2024 Aug 30;25(1):284. doi: 10.1186/s12859-024-05917-0.
6
i6mA-VC: A Multi-Classifier Voting Method for the Computational Identification of DNA N6-methyladenine Sites.i6mA-VC:一种用于计算鉴定 DNA N6-甲基腺嘌呤位点的多分类器投票方法。
Interdiscip Sci. 2021 Sep;13(3):413-425. doi: 10.1007/s12539-021-00429-4. Epub 2021 Apr 8.
7
i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature representation.i6mA-Fuse:通过融合多种特征表示来改进和增强蔷薇科基因组中 DNA 6mA 位点的预测
Plant Mol Biol. 2020 May;103(1-2):225-234. doi: 10.1007/s11103-020-00988-y. Epub 2020 Mar 5.
8
Deep transformers and convolutional neural network in identifying DNA N6-methyladenine sites in cross-species genomes.深度转换器和卷积神经网络在跨物种基因组中识别 DNA N6-甲基腺嘌呤位点。
Methods. 2022 Aug;204:199-206. doi: 10.1016/j.ymeth.2021.12.004. Epub 2021 Dec 13.
9
i6mA-stack: A stacking ensemble-based computational prediction of DNA N6-methyladenine (6mA) sites in the Rosaceae genome.i6mA-stack:基于堆叠集成法对蔷薇科基因组中DNA N6-甲基腺嘌呤(6mA)位点的计算预测。
Genomics. 2021 Jan;113(1 Pt 2):582-592. doi: 10.1016/j.ygeno.2020.09.054. Epub 2020 Oct 1.
10
DNN-m6A: A Cross-Species Method for Identifying RNA N6-Methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion.基于多信息融合的深度神经网络的跨物种 RNA N6-甲基腺苷位点识别方法 DNN-m6A
Genes (Basel). 2021 Feb 28;12(3):354. doi: 10.3390/genes12030354.

引用本文的文献

1
An intelligent identification for pest and disease detection in wheat leaf based on environmental data using multimodal data fusion.基于环境数据的多模态数据融合在小麦叶片病虫害检测中的智能识别
Front Plant Sci. 2025 Aug 26;16:1608515. doi: 10.3389/fpls.2025.1608515. eCollection 2025.

本文引用的文献

1
Deep-ProBind: binding protein prediction with transformer-based deep learning model.深度蛋白质结合预测:基于Transformer的深度学习模型进行结合蛋白预测。
BMC Bioinformatics. 2025 Mar 22;26(1):88. doi: 10.1186/s12859-025-06101-8.
2
XGBoost-enhanced ensemble model using discriminative hybrid features for the prediction of sumoylation sites.使用判别性混合特征的XGBoost增强集成模型用于预测SUMO化位点。
BioData Min. 2025 Feb 3;18(1):12. doi: 10.1186/s13040-024-00415-8.
3
Deep-m5U: a deep learning-based approach for RNA 5-methyluridine modification prediction using optimized feature integration.
Deep-m5U:一种基于深度学习的方法,用于使用优化的特征集成进行 RNA 5-甲基尿嘧啶修饰预测。
BMC Bioinformatics. 2024 Nov 19;25(1):360. doi: 10.1186/s12859-024-05978-1.
4
A hybrid residue based sequential encoding mechanism with XGBoost improved ensemble model for identifying 5-hydroxymethylcytosine modifications.基于残基的混合序贯编码机制与 XGBoost 改进的集成模型用于识别 5-羟甲基胞嘧啶修饰。
Sci Rep. 2024 Sep 6;14(1):20819. doi: 10.1038/s41598-024-71568-z.
5
Sequence based model using deep neural network and hybrid features for identification of 5-hydroxymethylcytosine modification.基于序列的深度学习神经网络模型和混合特征用于 5-羟甲基胞嘧啶修饰的识别。
Sci Rep. 2024 Apr 20;14(1):9116. doi: 10.1038/s41598-024-59777-y.
6
ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks.ENet-6mA:使用弹性网络和神经网络鉴定植物基因组中的 6mA 修饰位点。
Int J Mol Sci. 2022 Jul 27;23(15):8314. doi: 10.3390/ijms23158314.
7
Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species.Deep6mA:一个用于探索不同物种中 DNA N6-甲基腺嘌呤位点相似模式的深度学习框架。
PLoS Comput Biol. 2021 Feb 18;17(2):e1008767. doi: 10.1371/journal.pcbi.1008767. eCollection 2021 Feb.
8
Prediction of Recombination Spots Using Novel Hybrid Feature Extraction Method via Deep Learning Approach.通过深度学习方法使用新型混合特征提取方法预测重组位点
Front Genet. 2020 Sep 17;11:539227. doi: 10.3389/fgene.2020.539227. eCollection 2020.
9
i6mA-stack: A stacking ensemble-based computational prediction of DNA N6-methyladenine (6mA) sites in the Rosaceae genome.i6mA-stack:基于堆叠集成法对蔷薇科基因组中DNA N6-甲基腺嘌呤(6mA)位点的计算预测。
Genomics. 2021 Jan;113(1 Pt 2):582-592. doi: 10.1016/j.ygeno.2020.09.054. Epub 2020 Oct 1.
10
DNA6mA-MINT: DNA-6mA Modification Identification Neural Tool.DNA6mA-MINT:DNA-6mA 修饰识别神经工具。
Genes (Basel). 2020 Aug 5;11(8):898. doi: 10.3390/genes11080898.