• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多维特征编码和双卷积全连接卷积神经网络的 DNA 甲基化预测。

Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network.

机构信息

College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, China.

出版信息

PLoS Comput Biol. 2023 Aug 28;19(8):e1011370. doi: 10.1371/journal.pcbi.1011370. eCollection 2023 Aug.

DOI:10.1371/journal.pcbi.1011370
PMID:37639434
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10461834/
Abstract

DNA methylation takes on critical significance to the regulation of gene expression by affecting the stability of DNA and changing the structure of chromosomes. DNA methylation modification sites should be identified, which lays a solid basis for gaining more insights into their biological functions. Existing machine learning-based methods of predicting DNA methylation have not fully exploited the hidden multidimensional information in DNA gene sequences, such that the prediction accuracy of models is significantly limited. Besides, most models have been built in terms of a single methylation type. To address the above-mentioned issues, a deep learning-based method was proposed in this study for DNA methylation site prediction, termed the MEDCNN model. The MEDCNN model is capable of extracting feature information from gene sequences in three dimensions (i.e., positional information, biological information, and chemical information). Moreover, the proposed method employs a convolutional neural network model with double convolutional layers and double fully connected layers while iteratively updating the gradient descent algorithm using the cross-entropy loss function to increase the prediction accuracy of the model. Besides, the MEDCNN model can predict different types of DNA methylation sites. As indicated by the experimental results,the deep learning method based on coding from multiple dimensions outperformed single coding methods, and the MEDCNN model was highly applicable and outperformed existing models in predicting DNA methylation between different species. As revealed by the above-described findings, the MEDCNN model can be effective in predicting DNA methylation sites.

摘要

DNA 甲基化通过影响 DNA 的稳定性和改变染色体的结构,对基因表达的调控起着至关重要的作用。应识别 DNA 甲基化修饰位点,为深入了解其生物学功能奠定坚实基础。现有的基于机器学习的 DNA 甲基化预测方法尚未充分利用 DNA 基因序列中隐藏的多维信息,从而显著限制了模型的预测精度。此外,大多数模型都是基于单一的甲基化类型构建的。针对上述问题,本研究提出了一种基于深度学习的 DNA 甲基化位点预测方法,称为 MEDCNN 模型。MEDCNN 模型能够从基因序列中提取三维(即位置信息、生物信息和化学信息)的特征信息。此外,所提出的方法采用了具有双卷积层和双全连接层的卷积神经网络模型,同时使用交叉熵损失函数迭代更新梯度下降算法,以提高模型的预测精度。此外,MEDCNN 模型可以预测不同类型的 DNA 甲基化位点。实验结果表明,基于多维编码的深度学习方法优于单一编码方法,并且 MEDCNN 模型在预测不同物种之间的 DNA 甲基化方面具有高度的适用性和优于现有模型。综上所述,MEDCNN 模型可有效预测 DNA 甲基化位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/9bbd1bcaf08a/pcbi.1011370.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/f9425afe2c33/pcbi.1011370.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/62691d2625fd/pcbi.1011370.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/e9f063c00de4/pcbi.1011370.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/1bcae7348805/pcbi.1011370.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/ae63968b5968/pcbi.1011370.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/4d369f8f0611/pcbi.1011370.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/38c1b1c1a395/pcbi.1011370.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/9bbd1bcaf08a/pcbi.1011370.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/f9425afe2c33/pcbi.1011370.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/62691d2625fd/pcbi.1011370.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/e9f063c00de4/pcbi.1011370.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/1bcae7348805/pcbi.1011370.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/ae63968b5968/pcbi.1011370.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/4d369f8f0611/pcbi.1011370.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/38c1b1c1a395/pcbi.1011370.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b4/10461834/9bbd1bcaf08a/pcbi.1011370.g008.jpg

相似文献

1
Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network.基于多维特征编码和双卷积全连接卷积神经网络的 DNA 甲基化预测。
PLoS Comput Biol. 2023 Aug 28;19(8):e1011370. doi: 10.1371/journal.pcbi.1011370. eCollection 2023 Aug.
2
Time series-based hybrid ensemble learning model with multivariate multidimensional feature coding for DNA methylation prediction.基于时间序列的混合集成学习模型,具有多维多维特征编码,用于 DNA 甲基化预测。
BMC Genomics. 2023 Dec 11;24(1):758. doi: 10.1186/s12864-023-09866-5.
3
A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。
iScience. 2024 May 19;27(6):110030. doi: 10.1016/j.isci.2024.110030. eCollection 2024 Jun 21.
4
Essential genes identification model based on sequence feature map and graph convolutional neural network.基于序列特征图和图卷积神经网络的必需基因识别模型。
BMC Genomics. 2024 Jan 10;25(1):47. doi: 10.1186/s12864-024-09958-w.
5
EpiTEAmDNA: Sequence feature representation via transfer learning and ensemble learning for identifying multiple DNA epigenetic modification types across species.EpiTEAmDNA:通过迁移学习和集成学习进行序列特征表示,以跨物种识别多种 DNA 表观遗传修饰类型。
Comput Biol Med. 2023 Jun;160:107030. doi: 10.1016/j.compbiomed.2023.107030. Epub 2023 May 11.
6
Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in by Using Correlation-Based Feature Selection Technique.深度4mCGP:一种基于相关性特征选择技术预测4mC位点的深度学习方法。
Int J Mol Sci. 2022 Jan 23;23(3):1251. doi: 10.3390/ijms23031251.
7
Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction.使用卷积神经网络从患者多变量时间序列数据中学习隐藏模式:以医疗保健成本预测为例。
J Biomed Inform. 2020 Nov;111:103565. doi: 10.1016/j.jbi.2020.103565. Epub 2020 Sep 25.
8
Low-Rank Deep Convolutional Neural Network for Multitask Learning.低秩深度卷积神经网络的多任务学习
Comput Intell Neurosci. 2019 May 20;2019:7410701. doi: 10.1155/2019/7410701. eCollection 2019.
9
Deep-2'-O-Me: Predicting 2'-O-methylation sites by Convolutional Neural Networks.深度2'-O-甲基化:通过卷积神经网络预测2'-O-甲基化位点
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:2394-2397. doi: 10.1109/EMBC.2018.8512780.
10
Using multiple convolutional window scanning of convolutional neural network for an efficient prediction of ATP-binding sites in transport proteins.利用卷积神经网络的多卷积窗口扫描实现运输蛋白中 ATP 结合位点的高效预测。
Proteins. 2022 Jul;90(7):1486-1492. doi: 10.1002/prot.26329. Epub 2022 Mar 12.

引用本文的文献

1
Multi-kernel feature extraction with dynamic fusion and downsampled residual feature embedding for predicting rice RNA N6-methyladenine sites.用于预测水稻RNA N6-甲基腺嘌呤位点的具有动态融合和下采样残差特征嵌入的多核特征提取
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae647.
2
Early Detection, Precision Treatment, Recurrence Monitoring: Liquid Biopsy Transforms Colorectal Cancer Therapy.早期检测、精准治疗、复发监测:液体活检改变结直肠癌治疗方式
Curr Cancer Drug Targets. 2025;25(6):586-619. doi: 10.2174/0115680096295070240318075023.

本文引用的文献

1
Metheor: Ultrafast DNA methylation heterogeneity calculation from bisulfite read alignments.Metheor:从亚硫酸氢盐读取比对中计算超快 DNA 甲基化异质性。
PLoS Comput Biol. 2023 Mar 20;19(3):e1010946. doi: 10.1371/journal.pcbi.1010946. eCollection 2023 Mar.
2
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction.Hyb4mC:一种基于 DNA2vec 的混合模型,用于预测 DNA N4-甲基胞嘧啶位点。
BMC Bioinformatics. 2022 Jun 29;23(1):258. doi: 10.1186/s12859-022-04789-6.
3
EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction.
EMDLP:用于 RNA 甲基化位点预测的集成多尺度深度学习模型。
BMC Bioinformatics. 2022 Jun 8;23(1):221. doi: 10.1186/s12859-022-04756-1.
4
Deep6mAPred: A CNN and Bi-LSTM-based deep learning method for predicting DNA N6-methyladenosine sites across plant species.Deep6mAPred:一种基于 CNN 和 Bi-LSTM 的深度学习方法,用于预测跨植物物种的 DNA N6-甲基腺苷位点。
Methods. 2022 Aug;204:142-150. doi: 10.1016/j.ymeth.2022.04.011. Epub 2022 Apr 25.
5
DNAcycP: a deep learning tool for DNA cyclizability prediction.DNAcycP:一种用于 DNA 环化能力预测的深度学习工具。
Nucleic Acids Res. 2022 Apr 8;50(6):3142-3154. doi: 10.1093/nar/gkac162.
6
BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches.BERT6mA:基于深度学习的方法预测 DNA N6-甲基腺嘌呤位点。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac053.
7
Mouse4mC-BGRU: Deep learning for predicting DNA N4-methylcytosine sites in mouse genome.Mouse4mC-BGRU:用于预测小鼠基因组中 DNA N4-甲基胞嘧啶位点的深度学习方法。
Methods. 2022 Aug;204:258-262. doi: 10.1016/j.ymeth.2022.01.009. Epub 2022 Jan 31.
8
BiLSTM-5mC: A Bidirectional Long Short-Term Memory-Based Approach for Predicting 5-Methylcytosine Sites in Genome-Wide DNA Promoters.基于双向长短时记忆网络(BiLSTM)的 5-甲基胞嘧啶(5mC)位点预测方法:全基因组 DNA 启动子研究
Molecules. 2021 Dec 7;26(24):7414. doi: 10.3390/molecules26247414.
9
Deep transformers and convolutional neural network in identifying DNA N6-methyladenine sites in cross-species genomes.深度转换器和卷积神经网络在跨物种基因组中识别 DNA N6-甲基腺嘌呤位点。
Methods. 2022 Aug;204:199-206. doi: 10.1016/j.ymeth.2021.12.004. Epub 2021 Dec 13.
10
DNA methylation-based classifier and gene expression signatures detect BRCAness in osteosarcoma.基于 DNA 甲基化的分类器和基因表达特征可检测骨肉瘤中的 BRCA 样特征。
PLoS Comput Biol. 2021 Nov 11;17(11):e1009562. doi: 10.1371/journal.pcbi.1009562. eCollection 2021 Nov.