• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepDNAbP:一种基于深度学习的混合方法,用于提高脱氧核糖核酸结合蛋白的识别能力。

DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins.

机构信息

Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh.

Department of Computer Science, American International University-Bangladesh (AIUB), Kuratoli, Dhaka, 1229, Bangladesh.

出版信息

Comput Biol Med. 2022 Jun;145:105433. doi: 10.1016/j.compbiomed.2022.105433. Epub 2022 Mar 30.

DOI:10.1016/j.compbiomed.2022.105433
PMID:35378437
Abstract

Accurate identification of DNA-binding proteins (DBPs) is critical for both understanding protein function and drug design. DBPs also play essential roles in different kinds of biological activities such as DNA replication, repair, transcription, and splicing. As experimental identification of DBPs is time-consuming and sometimes biased toward prediction, constructing an effective DBP model represents an urgent need, and computational methods that can accurately predict potential DBPs based on sequence information are highly desirable. In this paper, a novel predictor called DeepDNAbP has been developed to accurately predict DBPs from sequences using a convolutional neural network (CNN) model. First, we perform three feature extraction methods, namely position-specific scoring matrix (PSSM), pseudo-amino acid composition (PseAAC) and tripeptide composition (TPC), to represent protein sequence patterns. Secondly, SHapley Additive exPlanations (SHAP) are employed to remove the redundant and irrelevant features for predicting DBPs. Finally, the best features are provided to the CNN classifier to construct the DeepDNAbP model for identifying DBPs. The final DeepDNAbP predictor achieves superior prediction performance in K-fold cross-validation tests and outperforms other existing predictors of DNA-protein binding methods. DeepDNAbP is poised to be a powerful computational resource for the prediction of DBPs. The web application and curated datasets in this study are freely available at: http://deepdbp.sblog360.blog/.

摘要

准确识别 DNA 结合蛋白 (DBP) 对于理解蛋白质功能和药物设计都至关重要。DBP 还在不同的生物活性中发挥着重要作用,如 DNA 复制、修复、转录和剪接。由于实验鉴定 DBP 既耗时又有时偏向于预测,因此构建有效的 DBP 模型迫在眉睫,而能够基于序列信息准确预测潜在 DBP 的计算方法是非常需要的。在本文中,我们开发了一种名为 DeepDNAbP 的新型预测器,该预测器使用卷积神经网络 (CNN) 模型从序列中准确预测 DBP。首先,我们使用三种特征提取方法,即位置特异性评分矩阵 (PSSM)、伪氨基酸组成 (PseAAC) 和三肽组成 (TPC),来表示蛋白质序列模式。其次,我们采用 SHapley Additive exPlanations (SHAP) 来去除预测 DBP 时冗余和不相关的特征。最后,将最佳特征提供给 CNN 分类器,以构建用于识别 DBP 的 DeepDNAbP 模型。最终的 DeepDNAbP 预测器在 K 折交叉验证测试中表现出优异的预测性能,优于其他现有的 DNA-蛋白质结合方法预测器。DeepDNAbP 有望成为预测 DBP 的强大计算资源。本研究中的网络应用程序和经过整理的数据集可免费在:http://deepdbp.sblog360.blog/ 获取。

相似文献

1
DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins.DeepDNAbP:一种基于深度学习的混合方法,用于提高脱氧核糖核酸结合蛋白的识别能力。
Comput Biol Med. 2022 Jun;145:105433. doi: 10.1016/j.compbiomed.2022.105433. Epub 2022 Mar 30.
2
Deep-WET: a deep learning-based approach for predicting DNA-binding proteins using word embedding techniques with weighted features.深度WET:一种基于深度学习的方法,利用带加权特征的词嵌入技术预测DNA结合蛋白。
Sci Rep. 2024 Feb 5;14(1):2961. doi: 10.1038/s41598-024-52653-9.
3
PreDBP-PLMs: Prediction of DNA-binding proteins based on pre-trained protein language models and convolutional neural networks.PreDBP-PLMs:基于预训练蛋白质语言模型和卷积神经网络的DNA结合蛋白预测
Anal Biochem. 2024 Nov;694:115603. doi: 10.1016/j.ab.2024.115603. Epub 2024 Jul 8.
4
Target-DBPPred: An intelligent model for prediction of DNA-binding proteins using discrete wavelet transform based compression and light eXtreme gradient boosting.目标-DBPPred:一种使用基于离散小波变换的压缩和轻极限梯度提升的智能 DNA 结合蛋白预测模型。
Comput Biol Med. 2022 Jun;145:105533. doi: 10.1016/j.compbiomed.2022.105533. Epub 2022 Apr 16.
5
iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network.iDRBP_MMC:基于多标签学习模型和基于模体的卷积神经网络的 DNA 结合蛋白和 RNA 结合蛋白的鉴定。
J Mol Biol. 2020 Nov 6;432(22):5860-5875. doi: 10.1016/j.jmb.2020.09.008. Epub 2020 Sep 11.
6
DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.DP-BINDER:一种通过融合进化和物理化学信息来预测 DNA 结合蛋白的机器学习模型。
J Comput Aided Mol Des. 2019 Jul;33(7):645-658. doi: 10.1007/s10822-019-00207-x. Epub 2019 May 23.
7
TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning.目标 DBP:基于序列的多视图特征学习的准确 DNA 结合蛋白预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1419-1429. doi: 10.1109/TCBB.2019.2893634. Epub 2019 Jan 18.
8
DBP-GAPred: An intelligent method for prediction of DNA-binding proteins types by enhanced evolutionary profile features with ensemble learning.DBP-GAPred:一种通过增强进化轮廓特征与集成学习预测 DNA 结合蛋白类型的智能方法。
J Bioinform Comput Biol. 2021 Aug;19(4):2150018. doi: 10.1142/S0219720021500189. Epub 2021 Jul 21.
9
PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method.PredDBP-Stack:基于堆叠集成方法的使用 HMM 轮廓预测 DNA 结合蛋白
Biomed Res Int. 2020 Apr 13;2020:7297631. doi: 10.1155/2020/7297631. eCollection 2020.
10
Improved prediction of DNA and RNA binding proteins with deep learning models.深度学习模型提高 DNA 和 RNA 结合蛋白的预测能力。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae285.

引用本文的文献

1
Efficient prediction of anticancer peptides through deep learning.通过深度学习高效预测抗癌肽
PeerJ Comput Sci. 2024 Jul 19;10:e2171. doi: 10.7717/peerj-cs.2171. eCollection 2024.
2
A deep learning method to predict bacterial ADP-ribosyltransferase toxins.一种预测细菌 ADP-ribosyltransferase 毒素的深度学习方法。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae378.
3
Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition.基于两亲性伪氨基酸组成预测凋亡蛋白的亚细胞定位
Front Genet. 2023 Feb 28;14:1157021. doi: 10.3389/fgene.2023.1157021. eCollection 2023.