• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DCNN-4mC:基于密集连接神经网络的多物种N4-甲基胞嘧啶位点预测

DCNN-4mC: Densely connected neural network based N4-methylcytosine site prediction in multiple species.

作者信息

Rehman Mobeen Ur, Tayara Hilal, Chong Kil To

机构信息

Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea.

Department of Avionics Engineering, Air University, Islamabad 44000, Pakistan.

出版信息

Comput Struct Biotechnol J. 2021 Nov 1;19:6009-6019. doi: 10.1016/j.csbj.2021.10.034. eCollection 2021.

DOI:10.1016/j.csbj.2021.10.034
PMID:34849205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8605313/
Abstract

DNA N4-methylcytosine (4mC) being a significant genetic modification holds a dominant role in controlling different biological functions, i.e., DNA replication, DNA repair, gene regulations and gene expression levels. The identification of 4mC sites is important to get insight information regarding different organics mechanisms. However, getting modification prediction from experimental methods is a challenging task due to high expenses and time-consuming techniques. Therefore, computational tools can be a great option for modification identification. Various computational tools are proposed in literature but their generalization and prediction performance require improvement. For this motive, we have proposed a neural network based tool named DCNN-4mC for identifying 4mC sites. The proposed model involves a set of neural network layers with a skip connection which allows to share the shallow features with dense layers. Skip connection have allowed to gather crucial information regarding 4mC sites. In literature, different models are employed on different species hence in many cases different datasets are available for a single species. In this research, we have combined all available datasets to create a single benchmark dataset for every species. To the best of our knowledge, no model in literature is employed on more than six different species. To ensure the generalizability of DCNN-4mC we have used 12 different species for performance evaluation. The DCNN-4mC tool has attained 2% to 14% higher accuracy than state-of-the-art tools on all available datasets of different species. Furthermore, independent test datasets are also engaged and DCNN-4mC have overall yielded high performance in them as well.

摘要

DNA N4-甲基胞嘧啶(4mC)作为一种重要的基因修饰,在控制不同生物学功能(即DNA复制、DNA修复、基因调控和基因表达水平)中起着主导作用。识别4mC位点对于深入了解不同的有机机制很重要。然而,由于实验方法成本高且技术耗时,通过实验方法进行修饰预测是一项具有挑战性的任务。因此,计算工具可能是修饰识别的一个很好的选择。文献中提出了各种计算工具,但其泛化能力和预测性能仍需改进。出于这个目的,我们提出了一种基于神经网络的工具DCNN-4mC来识别4mC位点。所提出的模型包含一组带有跳跃连接的神经网络层,这允许与密集层共享浅层特征。跳跃连接有助于收集有关4mC位点的关键信息。在文献中,不同的模型应用于不同的物种,因此在许多情况下,单个物种有不同的数据集。在本研究中,我们将所有可用数据集合并,为每个物种创建了一个单一的基准数据集。据我们所知,文献中没有模型应用于超过六个不同的物种。为确保DCNN-4mC的泛化能力,我们使用了12个不同的物种进行性能评估。在不同物种的所有可用数据集上,DCNN-4mC工具比现有工具的准确率高出2%至14%。此外,我们还使用了独立测试数据集,DCNN-4mC在这些数据集中也总体表现出高性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/7e3bcc26e6fd/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/0649b5d07f0e/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/962043e28b69/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/b5a6dc7127a5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/12a7487338eb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/cff6b2eb4a4e/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/7e3bcc26e6fd/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/0649b5d07f0e/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/962043e28b69/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/b5a6dc7127a5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/12a7487338eb/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/cff6b2eb4a4e/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/627a/8605313/7e3bcc26e6fd/gr5.jpg

相似文献

1
DCNN-4mC: Densely connected neural network based N4-methylcytosine site prediction in multiple species.DCNN-4mC:基于密集连接神经网络的多物种N4-甲基胞嘧啶位点预测
Comput Struct Biotechnol J. 2021 Nov 1;19:6009-6019. doi: 10.1016/j.csbj.2021.10.034. eCollection 2021.
2
Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.Deep4mC:通过深度学习对 DNA N4-甲基胞嘧啶位点进行系统评估和计算预测。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa099.
3
Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation.使用基于分布式特征表示的深度学习模型识别蔷薇科基因组中的DNA N4-甲基胞嘧啶位点。
Comput Struct Biotechnol J. 2021 Mar 19;19:1612-1619. doi: 10.1016/j.csbj.2021.03.015. eCollection 2021.
4
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction.Hyb4mC:一种基于 DNA2vec 的混合模型,用于预测 DNA N4-甲基胞嘧啶位点。
BMC Bioinformatics. 2022 Jun 29;23(1):258. doi: 10.1186/s12859-022-04789-6.
5
A Deep Neural Network for Identifying DNA N4-Methylcytosine Sites.用于识别DNA N4-甲基胞嘧啶位点的深度神经网络
Front Genet. 2020 Mar 6;11:209. doi: 10.3389/fgene.2020.00209. eCollection 2020.
6
Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.Meta-4mCpred:一种基于序列的元预测器,用于通过有效特征表示准确预测DNA 4mC位点。
Mol Ther Nucleic Acids. 2019 Jun 7;16:733-744. doi: 10.1016/j.omtn.2019.04.019. Epub 2019 Apr 30.
7
DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites.DeepTorrent:一种基于深度学习的方法,用于预测 DNA N4-甲基胞嘧啶位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa124.
8
Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.基于深度学习的DNA N4-甲基胞嘧啶位点的系统分析与准确识别
Front Microbiol. 2022 Mar 15;13:843425. doi: 10.3389/fmicb.2022.843425. eCollection 2022.
9
4mC-CGRU: Identification of N4-Methylcytosine (4mC) sites using convolution gated recurrent unit in Rosaceae genome.利用卷积门控循环单元在蔷薇科基因组中识别 N4-甲基胞嘧啶(4mC)位点。
Comput Biol Chem. 2023 Dec;107:107974. doi: 10.1016/j.compbiolchem.2023.107974. Epub 2023 Oct 30.
10
DNA4mC-LIP: a linear integration method to identify N4-methylcytosine site in multiple species.DNA4mC-LIP:一种在多个物种中鉴定 N4-甲基胞嘧啶位点的线性整合方法。
Bioinformatics. 2020 Jun 1;36(11):3327-3335. doi: 10.1093/bioinformatics/btaa143.

引用本文的文献

1
CpGFuse: a holistic approach for accurate identification of methylation states of DNA CpG sites.CpGFuse:一种用于准确识别DNA CpG位点甲基化状态的整体方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf063.
2
Composite Ensemble Learning Framework for Passive Drone Radio Frequency Fingerprinting in Sixth-Generation Networks.用于第六代网络中无源无人机射频指纹识别的复合集成学习框架
Sensors (Basel). 2024 Aug 29;24(17):5618. doi: 10.3390/s24175618.
3
DRANetSplicer: A Splice Site Prediction Model Based on Deep Residual Attention Networks.

本文引用的文献

1
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation.在二分类混淆矩阵评估中,马修斯相关系数(MCC)比平衡准确率、庄家知情度和标记度更可靠。
BioData Min. 2021 Feb 4;14(1):13. doi: 10.1186/s13040-021-00244-z.
2
DNA6mA-MINT: DNA-6mA Modification Identification Neural Tool.DNA6mA-MINT:DNA-6mA 修饰识别神经工具。
Genes (Basel). 2020 Aug 5;11(8):898. doi: 10.3390/genes11080898.
3
DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites.
DRANetSplicer:一种基于深度残差注意力网络的剪接位点预测模型。
Genes (Basel). 2024 Mar 26;15(4):404. doi: 10.3390/genes15040404.
4
DLC-ac4C: A Prediction Model for N4-acetylcytidine Sites in Human mRNA Based on DenseNet and Bidirectional LSTM Methods.DLC-ac4C:一种基于密集连接网络(DenseNet)和双向长短期记忆网络(Bidirectional LSTM)方法的人类mRNA中N4-乙酰胞苷位点预测模型
Curr Genomics. 2023 Nov 22;24(3):171-186. doi: 10.2174/0113892029270191231013111911.
5
iCpG-Pos: an accurate computational approach for identification of CpG sites using positional features on single-cell whole genome sequence data.iCpG-Pos:一种使用单细胞全基因组序列数据上的位置特征来识别 CpG 位点的准确计算方法。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad474.
6
EMDL-ac4C: identifying N4-acetylcytidine based on ensemble two-branch residual connection DenseNet and attention.EMDL-ac4C:基于集成双分支残差连接密集网络和注意力机制识别N4-乙酰胞苷
Front Genet. 2023 Jul 13;14:1232038. doi: 10.3389/fgene.2023.1232038. eCollection 2023.
7
A Grid Search-Based Multilayer Dynamic Ensemble System to Identify DNA N4-Methylcytosine Using Deep Learning Approach.基于网格搜索的多层动态集成系统,利用深度学习方法识别 DNA N4-甲基胞嘧啶。
Genes (Basel). 2023 Feb 25;14(3):582. doi: 10.3390/genes14030582.
8
Attention-Based Graph Neural Network for Molecular Solubility Prediction.基于注意力机制的图神经网络用于分子溶解度预测。
ACS Omega. 2023 Jan 12;8(3):3236-3244. doi: 10.1021/acsomega.2c06702. eCollection 2023 Jan 24.
9
Dynamic regulation and key roles of ribonucleic acid methylation.核糖核酸甲基化的动态调控及关键作用
Front Cell Neurosci. 2022 Dec 19;16:1058083. doi: 10.3389/fncel.2022.1058083. eCollection 2022.
10
XML-CIMT: Explainable Machine Learning (XML) Model for Predicting Chemical-Induced Mitochondrial Toxicity.XML-CIMT:用于预测化学诱导的线粒体毒性的可解释机器学习(XML)模型。
Int J Mol Sci. 2022 Dec 9;23(24):15655. doi: 10.3390/ijms232415655.
DeepTorrent:一种基于深度学习的方法,用于预测 DNA N4-甲基胞嘧啶位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa124.
4
Developing a Multi-Layer Deep Learning Based Predictive Model to Identify DNA N4-Methylcytosine Modifications.开发一种基于多层深度学习的预测模型以识别DNA N4-甲基胞嘧啶修饰。
Front Bioeng Biotechnol. 2020 Apr 21;8:274. doi: 10.3389/fbioe.2020.00274. eCollection 2020.
5
iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.iDNA-MS:一种用于检测多个基因组中DNA修饰位点的综合计算工具。
iScience. 2020 Apr 24;23(4):100991. doi: 10.1016/j.isci.2020.100991. Epub 2020 Mar 19.
6
A Deep Neural Network for Identifying DNA N4-Methylcytosine Sites.用于识别DNA N4-甲基胞嘧啶位点的深度神经网络
Front Genet. 2020 Mar 6;11:209. doi: 10.3389/fgene.2020.00209. eCollection 2020.
7
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
8
i4mC-ROSE, a bioinformatics tool for the identification of DNA N4-methylcytosine sites in the Rosaceae genome.i4mC-ROSE,一种用于鉴定蔷薇科基因组中 DNA N4-甲基胞嘧啶位点的生物信息学工具。
Int J Biol Macromol. 2020 Aug 15;157:752-758. doi: 10.1016/j.ijbiomac.2019.12.009. Epub 2019 Dec 2.
9
4mCpred-EL: An Ensemble Learning Framework for Identification of DNA -methylcytosine Sites in the Mouse Genome.4mCpred-EL:用于鉴定小鼠基因组中 DNA-甲基胞嘧啶位点的集成学习框架。
Cells. 2019 Oct 28;8(11):1332. doi: 10.3390/cells8111332.
10
-Cytosine DNA Methylation Is Involved in the Maintenance of Genomic Stability in .- 胞嘧啶DNA甲基化参与维持……中的基因组稳定性
Front Microbiol. 2019 Aug 21;10:1905. doi: 10.3389/fmicb.2019.01905. eCollection 2019.