• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别DNA N4-甲基胞嘧啶位点的深度神经网络

A Deep Neural Network for Identifying DNA N4-Methylcytosine Sites.

作者信息

Zeng Feng, Fang Guanyun, Yao Lan

机构信息

School of Computer Science and Engineering, Central South University, Changsha, China.

College of Mathematics and Econometrics, Hunan University, Changsha, China.

出版信息

Front Genet. 2020 Mar 6;11:209. doi: 10.3389/fgene.2020.00209. eCollection 2020.

DOI:10.3389/fgene.2020.00209
PMID:32211035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7067889/
Abstract

N4-methylcytosine (4mC) plays an important role in host defense and transcriptional regulation. Accurate identification of 4mc sites provides a more comprehensive understanding of its biological effects. At present, the traditional machine learning algorithms are used in the research on 4mC sites prediction, but the complexity of the algorithms is relatively high, which is not suitable for the processing of large data sets, and the accuracy of prediction needs to be improved. Therefore, it is necessary to develop a new and effective method to accurately identify 4mC sites. In this work, we found a large number of 4mC sites and non 4mC sites of () from the latest MethSMRT website, which greatly expanded the dataset of , and developed a hybrid deep neural network framework named 4mcDeep-CBI, aiming to identify 4mC sites. In order to obtain the high latitude information of the feature, we input the preliminary extracted features into the Convolutional Neural Network (CNN) and Bidirectional Long Short Term Memory network (BLSTM) to generate advanced features. Taking the advanced features as algorithm input, we have proposed an integrated algorithm to improve feature representation. Experimental results on large new dataset show that the proposed predictor is able to achieve generally better performance in identifying 4mC sites as compared to the state-of-art predictor. Notably, this is the first study of identifying 4mC sites using deep neural network. Moreover, our model runs much faster than the state-of-art predictor.

摘要

N4-甲基胞嘧啶(4mC)在宿主防御和转录调控中发挥着重要作用。准确识别4mC位点有助于更全面地了解其生物学效应。目前,传统机器学习算法被用于4mC位点预测研究,但算法复杂度较高,不适用于大数据集处理,预测准确性有待提高。因此,有必要开发一种新的有效方法来准确识别4mC位点。在这项工作中,我们从最新的MethSMRT网站上找到了大量()的4mC位点和非4mC位点,极大地扩展了()的数据集,并开发了一种名为4mcDeep-CBI的混合深度神经网络框架,旨在识别4mC位点。为了获取特征的高纬度信息,我们将初步提取的特征输入到卷积神经网络(CNN)和双向长短期记忆网络(BLSTM)中以生成高级特征。以高级特征作为算法输入,我们提出了一种集成算法来改进特征表示。在大型新数据集上的实验结果表明,与现有最佳预测器相比,所提出的预测器在识别4mC位点方面通常能够取得更好的性能。值得注意的是,这是首次使用深度神经网络识别4mC位点的研究。此外,我们的模型运行速度比现有最佳预测器快得多。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/22ede3786304/fgene-11-00209-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/d740edd80073/fgene-11-00209-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/f1c94cc21a19/fgene-11-00209-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/4922ef52327b/fgene-11-00209-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/8a418e0cba40/fgene-11-00209-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/2d256a3bf53b/fgene-11-00209-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/de33c3af1a5a/fgene-11-00209-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/22ede3786304/fgene-11-00209-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/d740edd80073/fgene-11-00209-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/f1c94cc21a19/fgene-11-00209-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/4922ef52327b/fgene-11-00209-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/8a418e0cba40/fgene-11-00209-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/2d256a3bf53b/fgene-11-00209-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/de33c3af1a5a/fgene-11-00209-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f41c/7067889/22ede3786304/fgene-11-00209-g0007.jpg

相似文献

1
A Deep Neural Network for Identifying DNA N4-Methylcytosine Sites.用于识别DNA N4-甲基胞嘧啶位点的深度神经网络
Front Genet. 2020 Mar 6;11:209. doi: 10.3389/fgene.2020.00209. eCollection 2020.
2
Developing a Multi-Layer Deep Learning Based Predictive Model to Identify DNA N4-Methylcytosine Modifications.开发一种基于多层深度学习的预测模型以识别DNA N4-甲基胞嘧啶修饰。
Front Bioeng Biotechnol. 2020 Apr 21;8:274. doi: 10.3389/fbioe.2020.00274. eCollection 2020.
3
Iterative feature representations improve N4-methylcytosine site prediction.迭代特征表示可提高 N4-甲基胞嘧啶位点预测的准确性。
Bioinformatics. 2019 Dec 1;35(23):4930-4937. doi: 10.1093/bioinformatics/btz408.
4
DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites.DeepTorrent:一种基于深度学习的方法,用于预测 DNA N4-甲基胞嘧啶位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa124.
5
Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation.使用基于分布式特征表示的深度学习模型识别蔷薇科基因组中的DNA N4-甲基胞嘧啶位点。
Comput Struct Biotechnol J. 2021 Mar 19;19:1612-1619. doi: 10.1016/j.csbj.2021.03.015. eCollection 2021.
6
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.探索基于序列的特征,以提高在多个物种中预测 DNA N4-甲基胞嘧啶位点的能力。
Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824.
7
Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.Deep4mC:通过深度学习对 DNA N4-甲基胞嘧啶位点进行系统评估和计算预测。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa099.
8
i4mC-Deep: An Intelligent Predictor of N4-Methylcytosine Sites Using a Deep Learning Approach with Chemical Properties.i4mC-Deep:一种基于深度学习方法并结合化学性质预测 N4-甲基胞嘧啶位点的智能预测器。
Genes (Basel). 2021 Jul 23;12(8):1117. doi: 10.3390/genes12081117.
9
DRSN4mCPred: accurately predicting sites of DNA N4-methylcytosine using deep residual shrinkage network for diagnosis and treatment of gastrointestinal cancer in the precision medicine era.DRSN4mCPred:在精准医学时代,使用深度残差收缩网络准确预测DNA N4-甲基胞嘧啶位点以用于胃肠道癌的诊断和治疗。
Front Med (Lausanne). 2023 May 4;10:1187430. doi: 10.3389/fmed.2023.1187430. eCollection 2023.
10
4mC-CGRU: Identification of N4-Methylcytosine (4mC) sites using convolution gated recurrent unit in Rosaceae genome.利用卷积门控循环单元在蔷薇科基因组中识别 N4-甲基胞嘧啶(4mC)位点。
Comput Biol Chem. 2023 Dec;107:107974. doi: 10.1016/j.compbiolchem.2023.107974. Epub 2023 Oct 30.

引用本文的文献

1
DRSN4mCPred: accurately predicting sites of DNA N4-methylcytosine using deep residual shrinkage network for diagnosis and treatment of gastrointestinal cancer in the precision medicine era.DRSN4mCPred:在精准医学时代,使用深度残差收缩网络准确预测DNA N4-甲基胞嘧啶位点以用于胃肠道癌的诊断和治疗。
Front Med (Lausanne). 2023 May 4;10:1187430. doi: 10.3389/fmed.2023.1187430. eCollection 2023.
2
Systematic Analysis and Accurate Identification of DNA N4-Methylcytosine Sites by Deep Learning.基于深度学习的DNA N4-甲基胞嘧啶位点的系统分析与准确识别
Front Microbiol. 2022 Mar 15;13:843425. doi: 10.3389/fmicb.2022.843425. eCollection 2022.
3

本文引用的文献

1
LDAH2V: Exploring Meta-Paths Across Multiple Networks for lncRNA-Disease Association Prediction.LDAH2V:跨多个网络探索元路径用于长链非编码RNA-疾病关联预测
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1572-1581. doi: 10.1109/TCBB.2019.2946257. Epub 2021 Aug 6.
2
DrugCombDB: a comprehensive database of drug combinations toward the discovery of combinatorial therapy.DrugCombDB:一个综合性的药物组合数据库,旨在发现组合疗法。
Nucleic Acids Res. 2020 Jan 8;48(D1):D871-D881. doi: 10.1093/nar/gkz1007.
3
Iterative feature representations improve N4-methylcytosine site prediction.
DCNN-4mC: Densely connected neural network based N4-methylcytosine site prediction in multiple species.
DCNN-4mC:基于密集连接神经网络的多物种N4-甲基胞嘧啶位点预测
Comput Struct Biotechnol J. 2021 Nov 1;19:6009-6019. doi: 10.1016/j.csbj.2021.10.034. eCollection 2021.
4
Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation.使用基于分布式特征表示的深度学习模型识别蔷薇科基因组中的DNA N4-甲基胞嘧啶位点。
Comput Struct Biotechnol J. 2021 Mar 19;19:1612-1619. doi: 10.1016/j.csbj.2021.03.015. eCollection 2021.
5
DNA sequences performs as natural language processing by exploiting deep learning algorithm for the identification of N4-methylcytosine.DNA 序列通过利用深度学习算法进行自然语言处理,用于识别 N4-甲基胞嘧啶。
Sci Rep. 2021 Jan 8;11(1):212. doi: 10.1038/s41598-020-80430-x.
6
Empirical Comparison and Analysis of Web-Based DNA -Methylcytosine Site Prediction Tools.基于网络的DNA甲基胞嘧啶位点预测工具的实证比较与分析
Mol Ther Nucleic Acids. 2020 Sep 16;22:406-420. doi: 10.1016/j.omtn.2020.09.010. eCollection 2020 Dec 4.
7
DNC4mC-Deep: Identification and Analysis of DNA N4-Methylcytosine Sites Based on Different Encoding Schemes By Using Deep Learning.DNC4mC-Deep:基于深度学习的不同编码方案识别和分析 DNA N4-甲基胞嘧啶位点。
Cells. 2020 Jul 22;9(8):1756. doi: 10.3390/cells9081756.
8
iMethyl-Deep: N6 Methyladenosine Identification of Yeast Genome with Automatic Feature Extraction Technique by Using Deep Learning Algorithm.iMethyl-Deep:利用深度学习算法通过自动特征提取技术鉴定酵母基因组中的 N6 甲基腺苷。
Genes (Basel). 2020 May 9;11(5):529. doi: 10.3390/genes11050529.
9
i4mC-Mouse: Improved identification of DNA N4-methylcytosine sites in the mouse genome using multiple encoding schemes.i4mC-Mouse:使用多种编码方案改进对小鼠基因组中DNA N4-甲基胞嘧啶位点的识别。
Comput Struct Biotechnol J. 2020 Apr 8;18:906-912. doi: 10.1016/j.csbj.2020.04.001. eCollection 2020.
迭代特征表示可提高 N4-甲基胞嘧啶位点预测的准确性。
Bioinformatics. 2019 Dec 1;35(23):4930-4937. doi: 10.1093/bioinformatics/btz408.
4
Enhanced Prediction of Hot Spots at Protein-Protein Interfaces Using Extreme Gradient Boosting.利用极端梯度提升增强蛋白质-蛋白质界面热点预测。
Sci Rep. 2018 Sep 24;8(1):14285. doi: 10.1038/s41598-018-32511-1.
5
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.探索基于序列的特征,以提高在多个物种中预测 DNA N4-甲基胞嘧啶位点的能力。
Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824.
6
4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction.4mCPred:用于 DNA N4-甲基胞嘧啶位点预测的机器学习方法。
Bioinformatics. 2019 Feb 15;35(4):593-601. doi: 10.1093/bioinformatics/bty668.
7
Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.通过结合局部和全局深度卷积神经网络预测 RNA 与蛋白质的结合位点和基序。
Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.
8
Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.使用集成方法计算蛋白质 - RNA 复合物中结合能热点。
Bioinformatics. 2018 May 1;34(9):1473-1480. doi: 10.1093/bioinformatics/btx822.
9
iDNA4mC: identifying DNA N4-methylcytosine sites based on nucleotide chemical properties.iDNA4mC:基于核苷酸化学性质鉴定 DNA N4-甲基胞嘧啶位点。
Bioinformatics. 2017 Nov 15;33(22):3518-3523. doi: 10.1093/bioinformatics/btx479.
10
MethSMRT: an integrative database for DNA N6-methyladenine and N4-methylcytosine generated by single-molecular real-time sequencing.MethSMRT:一个通过单分子实时测序生成的用于DNA N6-甲基腺嘌呤和N4-甲基胞嘧啶的综合数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D85-D89. doi: 10.1093/nar/gkw950. Epub 2016 Oct 18.