• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BiCaps-DBP:使用 Bi-LSTM 和 1D-capsule 网络从蛋白质序列预测 DNA 结合蛋白。

BiCaps-DBP: Predicting DNA-binding proteins from protein sequences using Bi-LSTM and a 1D-capsule network.

机构信息

School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia; Department of Informatics Engineering, Universal University, Batam, Indonesia.

School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia.

出版信息

Comput Biol Med. 2023 Sep;163:107241. doi: 10.1016/j.compbiomed.2023.107241. Epub 2023 Jul 8.

DOI:10.1016/j.compbiomed.2023.107241
PMID:37437362
Abstract

Predicting DNA-binding proteins (DBPs) based solely on primary sequences is one of the most challenging problems in genome annotation. DBPs play a crucial role in various biological processes, including DNA replication, transcription, repair, and splicing. Some DBPs are essential in pharmaceutical research on various human cancers and autoimmune diseases. Existing experimental methods for identifying DBPs are time-consuming and costly. Therefore, developing a rapid and accurate computational technique is necessary to address the issue. This study introduces BiCaps-DBP, a deep learning-based method that improves DBP prediction performance by combining bidirectional long short-term memory with a 1D-capsule network. This study uses three training and independent datasets to evaluate the proposed model's generalizability and robustness. Based on three independent datasets, BiCaps-DBP achieved 1.05%, 5.79% and 0.40% higher accuracies than an existing predictor for PDB2272, PDB186 and PDB20000, respectively. These outcomes indicate that the proposed method is a promising DBP predictor.

摘要

仅基于一级序列预测 DNA 结合蛋白(DBP)是基因组注释中最具挑战性的问题之一。DBP 在各种生物过程中发挥着关键作用,包括 DNA 复制、转录、修复和剪接。一些 DBP 在各种人类癌症和自身免疫性疾病的药物研究中是必不可少的。现有的用于鉴定 DBP 的实验方法既耗时又昂贵。因此,开发一种快速而准确的计算技术是必要的,以解决这个问题。本研究介绍了 BiCaps-DBP,这是一种基于深度学习的方法,通过将双向长短期记忆与一维胶囊网络相结合,提高了 DBP 预测性能。本研究使用三个训练和独立数据集来评估所提出模型的泛化能力和稳健性。基于三个独立数据集,BiCaps-DBP 在 PDB2272、PDB186 和 PDB20000 上的准确率分别比现有预测器高出 1.05%、5.79%和 0.40%。这些结果表明,该方法是一种很有前途的 DBP 预测器。

相似文献

1
BiCaps-DBP: Predicting DNA-binding proteins from protein sequences using Bi-LSTM and a 1D-capsule network.BiCaps-DBP:使用 Bi-LSTM 和 1D-capsule 网络从蛋白质序列预测 DNA 结合蛋白。
Comput Biol Med. 2023 Sep;163:107241. doi: 10.1016/j.compbiomed.2023.107241. Epub 2023 Jul 8.
2
DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins.DeepDNAbP:一种基于深度学习的混合方法,用于提高脱氧核糖核酸结合蛋白的识别能力。
Comput Biol Med. 2022 Jun;145:105433. doi: 10.1016/j.compbiomed.2022.105433. Epub 2022 Mar 30.
3
Improving DNA-Binding Protein Prediction Using Three-Part Sequence-Order Feature Extraction and a Deep Neural Network Algorithm.利用三部分序列顺序特征提取和深度神经网络算法提高 DNA 结合蛋白预测。
J Chem Inf Model. 2023 Feb 13;63(3):1044-1057. doi: 10.1021/acs.jcim.2c00943. Epub 2023 Jan 31.
4
DeepDRBP-2L: A New Genome Annotation Predictor for Identifying DNA-Binding Proteins and RNA-Binding Proteins Using Convolutional Neural Network and Long Short-Term Memory.DeepDRBP-2L:一种新的基因组注释预测器,用于使用卷积神经网络和长短期记忆识别 DNA 结合蛋白和 RNA 结合蛋白。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1451-1463. doi: 10.1109/TCBB.2019.2952338. Epub 2021 Aug 6.
5
Improved prediction of DNA and RNA binding proteins with deep learning models.深度学习模型提高 DNA 和 RNA 结合蛋白的预测能力。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae285.
6
Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning.基于深度学习利用局部特征和与一级序列的长期依赖性预测DNA结合蛋白。
PeerJ. 2021 May 3;9:e11262. doi: 10.7717/peerj.11262. eCollection 2021.
7
PreDBP-PLMs: Prediction of DNA-binding proteins based on pre-trained protein language models and convolutional neural networks.PreDBP-PLMs:基于预训练蛋白质语言模型和卷积神经网络的DNA结合蛋白预测
Anal Biochem. 2024 Nov;694:115603. doi: 10.1016/j.ab.2024.115603. Epub 2024 Jul 8.
8
DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence.DeepD2V:一种基于深度学习的新型框架,用于从组合 DNA 序列预测转录因子结合位点。
Int J Mol Sci. 2021 May 24;22(11):5521. doi: 10.3390/ijms22115521.
9
LGC-DBP: the method of DNA-binding protein identification based on PSSM and deep learning.LGC-DBP:基于位置特异性得分矩阵(PSSM)和深度学习的DNA结合蛋白识别方法。
Front Genet. 2024 Jun 5;15:1411847. doi: 10.3389/fgene.2024.1411847. eCollection 2024.
10
Deep-WET: a deep learning-based approach for predicting DNA-binding proteins using word embedding techniques with weighted features.深度WET:一种基于深度学习的方法,利用带加权特征的词嵌入技术预测DNA结合蛋白。
Sci Rep. 2024 Feb 5;14(1):2961. doi: 10.1038/s41598-024-52653-9.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
Using the TSA-LSTM two-stage model to predict cancer incidence and mortality.使用TSA-LSTM两阶段模型预测癌症发病率和死亡率。
PLoS One. 2025 Feb 20;20(2):e0317148. doi: 10.1371/journal.pone.0317148. eCollection 2025.