• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LGC-DBP:基于位置特异性得分矩阵(PSSM)和深度学习的DNA结合蛋白识别方法。

LGC-DBP: the method of DNA-binding protein identification based on PSSM and deep learning.

作者信息

Zhu Yiqi, Sun Ailun

机构信息

Department of Computer Science and Technology, College of Computer and Control Engineering, Northeast Forestry University, Harbin, China.

出版信息

Front Genet. 2024 Jun 5;15:1411847. doi: 10.3389/fgene.2024.1411847. eCollection 2024.

DOI:10.3389/fgene.2024.1411847
PMID:38903752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11188361/
Abstract

The recognition of DNA Binding Proteins (DBPs) plays a crucial role in understanding biological functions such as replication, transcription, and repair. Although current sequence-based methods have shown some effectiveness, they often fail to fully utilize the potential of deep learning in capturing complex patterns. This study introduces a novel model, LGC-DBP, which integrates Long Short-Term Memory (LSTM), Gated Inception Convolution, and Improved Channel Attention mechanisms to enhance the prediction of DBPs. Initially, the model transforms protein sequences into Position Specific Scoring Matrices (PSSM), then processed through our deep learning framework. Within this framework, Gated Inception Convolution merges the concepts of gating units with the advantages of Graph Convolutional Network (GCN) and Dilated Convolution, significantly surpassing traditional convolution methods. The Improved Channel Attention mechanism substantially enhances the model's responsiveness and accuracy by shifting from a single input to three inputs and integrating three sigmoid functions along with an additional layer output. These innovative combinations have significantly improved model performance, enabling LGC-DBP to recognize and interpret the complex relationships within DBP features more accurately. The evaluation results show that LGC-DBP achieves an accuracy of 88.26% and a Matthews correlation coefficient of 0.701, both surpassing existing methods. These achievements demonstrate the model's strong capability in integrating and analyzing multi-dimensional data and mark a significant advancement over traditional methods by capturing deeper, nonlinear interactions within the data.

摘要

DNA结合蛋白(DBP)的识别在理解诸如复制、转录和修复等生物学功能方面起着至关重要的作用。尽管当前基于序列的方法已显示出一定的有效性,但它们往往未能充分利用深度学习在捕捉复杂模式方面的潜力。本研究引入了一种新型模型LGC-DBP,该模型集成了长短期记忆(LSTM)、门控 inception 卷积和改进的通道注意力机制,以增强对DBP的预测。最初,该模型将蛋白质序列转换为位置特异性评分矩阵(PSSM),然后通过我们的深度学习框架进行处理。在此框架内,门控 inception 卷积将门控单元的概念与图卷积网络(GCN)和扩张卷积的优势相结合,显著超越了传统卷积方法。改进的通道注意力机制通过从单个输入转变为三个输入,并集成三个 sigmoid 函数以及一个额外的层输出,大幅提高了模型的响应能力和准确性。这些创新组合显著提升了模型性能,使LGC-DBP能够更准确地识别和解释DBP特征中的复杂关系。评估结果表明,LGC-DBP的准确率达到88.26%,马修斯相关系数为0.701,均超过了现有方法。这些成果证明了该模型在整合和分析多维度数据方面的强大能力,并通过捕捉数据中更深层次的非线性相互作用,标志着相对于传统方法的重大进步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/a1706ec8e847/fgene-15-1411847-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/ab3fd0c0be54/fgene-15-1411847-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/4b3d0783081c/fgene-15-1411847-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/9b08912c9a4e/fgene-15-1411847-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/6adcc3c262e0/fgene-15-1411847-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/a78cfd664860/fgene-15-1411847-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/a1706ec8e847/fgene-15-1411847-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/ab3fd0c0be54/fgene-15-1411847-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/4b3d0783081c/fgene-15-1411847-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/9b08912c9a4e/fgene-15-1411847-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/6adcc3c262e0/fgene-15-1411847-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/a78cfd664860/fgene-15-1411847-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b19/11188361/a1706ec8e847/fgene-15-1411847-g006.jpg

相似文献

1
LGC-DBP: the method of DNA-binding protein identification based on PSSM and deep learning.LGC-DBP:基于位置特异性得分矩阵(PSSM)和深度学习的DNA结合蛋白识别方法。
Front Genet. 2024 Jun 5;15:1411847. doi: 10.3389/fgene.2024.1411847. eCollection 2024.
2
Improving DNA-Binding Protein Prediction Using Three-Part Sequence-Order Feature Extraction and a Deep Neural Network Algorithm.利用三部分序列顺序特征提取和深度神经网络算法提高 DNA 结合蛋白预测。
J Chem Inf Model. 2023 Feb 13;63(3):1044-1057. doi: 10.1021/acs.jcim.2c00943. Epub 2023 Jan 31.
3
BiCaps-DBP: Predicting DNA-binding proteins from protein sequences using Bi-LSTM and a 1D-capsule network.BiCaps-DBP:使用 Bi-LSTM 和 1D-capsule 网络从蛋白质序列预测 DNA 结合蛋白。
Comput Biol Med. 2023 Sep;163:107241. doi: 10.1016/j.compbiomed.2023.107241. Epub 2023 Jul 8.
4
DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins.DeepDNAbP:一种基于深度学习的混合方法,用于提高脱氧核糖核酸结合蛋白的识别能力。
Comput Biol Med. 2022 Jun;145:105433. doi: 10.1016/j.compbiomed.2022.105433. Epub 2022 Mar 30.
5
STM-ac4C: a hybrid model for identification of N4-acetylcytidine (ac4C) in human mRNA based on selective kernel convolution, temporal convolutional network, and multi-head self-attention.STM-ac4C:一种基于选择性核卷积、时间卷积网络和多头自注意力机制的用于识别人类mRNA中N4-乙酰胞嘧啶(ac4C)的混合模型。
Front Genet. 2024 May 30;15:1408688. doi: 10.3389/fgene.2024.1408688. eCollection 2024.
6
MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN:用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。
Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.
7
DBP-iDWT: Improving DNA-Binding Proteins Prediction Using Multi-Perspective Evolutionary Profile and Discrete Wavelet Transform.DBP-iDWT:利用多视角进化特征和离散小波变换提高 DNA 结合蛋白预测
Comput Intell Neurosci. 2022 Sep 28;2022:2987407. doi: 10.1155/2022/2987407. eCollection 2022.
8
Identification of adaptor proteins by incorporating deep learning and PSSM profiles.通过整合深度学习和 PSSM 特征来鉴定衔接蛋白。
Methods. 2023 Jan;209:10-17. doi: 10.1016/j.ymeth.2022.11.001. Epub 2022 Nov 22.
9
Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning.基于深度学习利用局部特征和与一级序列的长期依赖性预测DNA结合蛋白。
PeerJ. 2021 May 3;9:e11262. doi: 10.7717/peerj.11262. eCollection 2021.
10
Comprehensive Study on Enhancing Low-Quality Position-Specific Scoring Matrix with Deep Learning for Accurate Protein Structure Property Prediction: Using Bagging Multiple Sequence Alignment Learning.利用Bagging多序列比对学习,通过深度学习增强低质量位置特异性评分矩阵以进行准确蛋白质结构特性预测的综合研究
J Comput Biol. 2021 Apr;28(4):346-361. doi: 10.1089/cmb.2020.0416. Epub 2021 Feb 22.

本文引用的文献

1
DBPboost:A method of classification of DNA-binding proteins based on improved differential evolution algorithm and feature extraction.DBPboost:一种基于改进差分进化算法和特征提取的 DNA 结合蛋白分类方法。
Methods. 2024 Mar;223:56-64. doi: 10.1016/j.ymeth.2024.01.005. Epub 2024 Jan 17.
2
KS-CMI: A circRNA-miRNA interaction prediction method based on the signed graph neural network and denoising autoencoder.KS-CMI:一种基于带符号图神经网络和去噪自动编码器的环状RNA-微RNA相互作用预测方法。
iScience. 2023 Jul 26;26(8):107478. doi: 10.1016/j.isci.2023.107478. eCollection 2023 Aug 18.
3
A feature extraction method based on noise reduction for circRNA-miRNA interaction prediction combining multi-structure features in the association networks.
基于关联网络中多结构特征的降噪的 circRNA-miRNA 相互作用预测的特征提取方法。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad111.
4
RF-SVM: Identification of DNA-binding proteins based on comprehensive feature representation methods and support vector machine.RF-SVM:基于综合特征表示方法和支持向量机的 DNA 结合蛋白识别。
Proteins. 2022 Feb;90(2):395-404. doi: 10.1002/prot.26229. Epub 2021 Sep 9.
5
TargetDBP+: Enhancing the Performance of Identifying DNA-Binding Proteins via Weighted Convolutional Features.TargetDBP+:通过加权卷积特征提高 DNA 结合蛋白识别性能。
J Chem Inf Model. 2021 Jan 25;61(1):505-515. doi: 10.1021/acs.jcim.0c00735. Epub 2021 Jan 7.
6
Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。
Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.
7
MsDBP: Exploring DNA-Binding Proteins by Integrating Multiscale Sequence Information via Chou's Five-Step Rule.MsDBP:通过整合多尺度序列信息和周的五步法则探索 DNA 结合蛋白
J Proteome Res. 2019 Aug 2;18(8):3119-3132. doi: 10.1021/acs.jproteome.9b00226. Epub 2019 Jul 17.
8
TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning.目标 DBP:基于序列的多视图特征学习的准确 DNA 结合蛋白预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1419-1429. doi: 10.1109/TCBB.2019.2893634. Epub 2019 Jan 18.
9
DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.DPP-PseAAC:一种基于 Chou 的通用 PseAAC 的 DNA 结合蛋白预测模型。
J Theor Biol. 2018 Sep 7;452:22-34. doi: 10.1016/j.jtbi.2018.05.006. Epub 2018 May 16.
10
iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features.iDNAProt-ES:利用进化和结构特征鉴定 DNA 结合蛋白。
Sci Rep. 2017 Nov 2;7(1):14938. doi: 10.1038/s41598-017-14945-1.