一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。

A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.

作者信息

Hu Wenxing, Li Yelin, Wu Yan, Guan Lixin, Li Mengshan

机构信息

College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.

出版信息

iScience. 2024 May 19;27(6):110030. doi: 10.1016/j.isci.2024.110030. eCollection 2024 Jun 21.

DOI:10.1016/j.isci.2024.110030

PMID:38868182

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11167433/

Abstract

Enhancers, genomic DNA elements, regulate neighboring gene expression crucial for biological processes like cell differentiation and stress response. However, current machine learning methods for predicting DNA enhancers often underutilize hidden features in gene sequences, limiting model accuracy. Hence, this article proposes the PDCNN model, a deep learning-based enhancer prediction method. PDCNN extracts statistical nucleotide representations from gene sequences, discerning positional distribution information of nucleotides in modifier-like DNA sequences. With a convolutional neural network structure, PDCNN employs dual convolutional and fully connected layers. The cross-entropy loss function iteratively updates using a gradient descent algorithm, enhancing prediction accuracy. Model parameters are fine-tuned to select optimal combinations for training, achieving over 95% accuracy. Comparative analysis with traditional methods and existing models demonstrates PDCNN's robust feature extraction capability. It outperforms advanced machine learning methods in identifying DNA enhancers, presenting an effective method with broad implications for genomics, biology, and medical research.

摘要

增强子作为基因组DNA元件，可调节邻近基因的表达，这对细胞分化和应激反应等生物学过程至关重要。然而，当前用于预测DNA增强子的机器学习方法常常未充分利用基因序列中的隐藏特征，从而限制了模型的准确性。因此，本文提出了PDCNN模型，这是一种基于深度学习的增强子预测方法。PDCNN从基因序列中提取统计核苷酸表征，识别类似修饰物的DNA序列中核苷酸的位置分布信息。借助卷积神经网络结构，PDCNN采用双重卷积层和全连接层。交叉熵损失函数使用梯度下降算法进行迭代更新，提高了预测准确性。对模型参数进行微调以选择训练用的最佳组合，实现了超过95%的准确率。与传统方法和现有模型的对比分析表明了PDCNN强大的特征提取能力。在识别DNA增强子时，它优于先进的机器学习方法，为基因组学、生物学和医学研究提供了一种具有广泛意义的有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4181/11167433/0e1ecdf5e068/fx1.jpg

相似文献

A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.

iScience. 2024 May 19;27(6):110030. doi: 10.1016/j.isci.2024.110030. eCollection 2024 Jun 21.

Prediction of DNA Methylation based on Multi-dimensional feature encoding and double convolutional fully connected convolutional neural network.

PLoS Comput Biol. 2023 Aug 28;19(8):e1011370. doi: 10.1371/journal.pcbi.1011370. eCollection 2023 Aug.

Essential genes identification model based on sequence feature map and graph convolutional neural network.

BMC Genomics. 2024 Jan 10;25(1):47. doi: 10.1186/s12864-024-09958-w.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

PorcineAI-Enhancer: Prediction of Pig Enhancer Sequences Using Convolutional Neural Networks.

Animals (Basel). 2023 Sep 15;13(18):2935. doi: 10.3390/ani13182935.

ADH-Enhancer: an attention-based deep hybrid framework for enhancer identification and strength prediction.

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae030.

RicENN: Prediction of Rice Enhancers with Neural Network Based on DNA Sequences.

Interdiscip Sci. 2022 Jun;14(2):555-565. doi: 10.1007/s12539-022-00503-5. Epub 2022 Feb 21.

Prediction of Enhancers in DNA Sequence Data using a Hybrid CNN-DLSTM Model.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1327-1336. doi: 10.1109/TCBB.2022.3167090. Epub 2023 Apr 3.

Predicting enhancers with deep convolutional neural networks.

BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):478. doi: 10.1186/s12859-017-1878-3.

DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers.

Genomics Proteomics Bioinformatics. 2021 Aug;19(4):565-577. doi: 10.1016/j.gpb.2019.04.006. Epub 2021 Feb 11.

本文引用的文献

iEnhancer-DCSA: identifying enhancers via dual-scale convolution and spatial attention.

BMC Genomics. 2023 Jul 13;24(1):393. doi: 10.1186/s12864-023-09468-1.

A Novel Repetition Frequency-Based DNA Encoding Scheme to Predict Human and Mouse DNA Enhancers with Deep Learning.

Biomimetics (Basel). 2023 May 23;8(2):218. doi: 10.3390/biomimetics8020218.

Decoding enhancer complexity with machine learning and high-throughput discovery.

Genome Biol. 2023 May 12;24(1):116. doi: 10.1186/s13059-023-02955-4.

iEnhancer-ELM: improve enhancer identification by extracting position-related multiscale contextual information based on enhancer language models.

Bioinform Adv. 2023 Mar 25;3(1):vbad043. doi: 10.1093/bioadv/vbad043. eCollection 2023.

PEACOCK: a machine learning approach to assess the validity of cell type-specific enhancer-gene regulatory relationships.

NPJ Syst Biol Appl. 2023 Apr 3;9(1):9. doi: 10.1038/s41540-023-00270-z.

iEnhancer-DCSV: Predicting enhancers and their strength based on DenseNet and improved convolutional block attention module.

Front Genet. 2023 Mar 1;14:1132018. doi: 10.3389/fgene.2023.1132018. eCollection 2023.

DNA-MP: a generalized DNA modifications predictor for multiple species based on powerful sequence encoding method.

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac546.

iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework.

BMC Bioinformatics. 2022 Nov 14;23(1):480. doi: 10.1186/s12859-022-05033-x.

DeepLncPro: an interpretable convolutional neural network model for identifying long non-coding RNA promoters.

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac447.

Cross-species enhancer prediction using machine learning.

Genomics. 2022 Sep;114(5):110454. doi: 10.1016/j.ygeno.2022.110454. Epub 2022 Aug 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。

A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.

作者信息

Hu Wenxing, Li Yelin, Wu Yan, Guan Lixin, Li Mengshan

机构信息

College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.

出版信息

iScience. 2024 May 19;27(6):110030. doi: 10.1016/j.isci.2024.110030. eCollection 2024 Jun 21.

DOI:10.1016/j.isci.2024.110030

PMID:38868182

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11167433/

Abstract

摘要

一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。

A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。

A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.

作者信息

机构信息

出版信息

相似文献

本文引用的文献