• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pACP-HybDeep:基于二叉树生长的变压器和深度混合学习的结构特征编码预测抗癌肽

pACP-HybDeep: predicting anticancer peptides using binary tree growth based transformer and structural feature encoding with deep-hybrid learning.

作者信息

Hayat Maqsood, Alghamdi Wajdi, Akbar Shahid, Raza Ali, Kadir Rabiah Abdul, Sarker Mahidur R

机构信息

Department of Computer Science, Abdul Wali Khan University Mardan, Mardan, 23200, KP, Pakistan.

Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.

出版信息

Sci Rep. 2025 Jan 2;15(1):565. doi: 10.1038/s41598-024-84146-0.

DOI:10.1038/s41598-024-84146-0
PMID:39747941
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11695694/
Abstract

Worldwide, Cancer remains a significant health concern due to its high mortality rates. Despite numerous traditional therapies and wet-laboratory methods for treating cancer-affected cells, these approaches often face limitations, including high costs and substantial side effects. Recently the high selectivity of peptides has garnered significant attention from scientists due to their reliable targeted actions and minimal adverse effects. Furthermore, keeping the significant outcomes of the existing computational models, we propose a highly reliable and effective model namely, pACP-HybDeep for the accurate prediction of anticancer peptides. In this model, training peptides are numerically encoded using an attention-based ProtBERT-BFD encoder to extract semantic features along with CTDT-based structural information. Furthermore, a k-nearest neighbor-based binary tree growth (BTG) algorithm is employed to select an optimal feature set from the multi-perspective vector. The selected feature vector is subsequently trained using a CNN + RNN-based deep learning model. Our proposed pACP-HybDeep model demonstrated a high training accuracy of 95.33%, and an AUC of 0.97. To validate the generalization capabilities of the model, our pACP-HybDeep model achieved accuracies of 94.92%, 92.26%, and 91.16% on independent datasets Ind-S1, Ind-S2, and Ind-S3, respectively. The demonstrated efficacy, and reliability of the pACP-HybDeep model using test datasets establish it as a valuable tool for researchers in academia and pharmaceutical drug design.

摘要

在全球范围内,癌症因其高死亡率仍然是一个重大的健康问题。尽管有许多传统疗法和用于治疗癌细胞的湿实验室方法,但这些方法往往面临局限性,包括高成本和严重的副作用。最近,肽的高选择性因其可靠的靶向作用和最小的副作用而受到科学家的广泛关注。此外,考虑到现有计算模型的显著成果,我们提出了一种高度可靠且有效的模型,即pACP-HybDeep,用于准确预测抗癌肽。在该模型中,训练肽使用基于注意力的ProtBERT-BFD编码器进行数值编码,以提取语义特征以及基于CTDT的结构信息。此外,采用基于k近邻的二叉树生长(BTG)算法从多视角向量中选择最优特征集。随后,使用基于CNN + RNN的深度学习模型对所选特征向量进行训练。我们提出的pACP-HybDeep模型展示了95.33%的高训练准确率和0.97的AUC。为了验证该模型的泛化能力,我们的pACP-HybDeep模型在独立数据集Ind-S1、Ind-S2和Ind-S3上分别达到了94.92%、92.26%和91.16%的准确率。使用测试数据集所展示的pACP-HybDeep模型的有效性和可靠性使其成为学术界研究人员和药物设计的宝贵工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/35b1a057e634/41598_2024_84146_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/dd4c2b81280e/41598_2024_84146_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/2d393711ad39/41598_2024_84146_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/4835352639da/41598_2024_84146_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/052add7232a3/41598_2024_84146_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/1c03d7ad7376/41598_2024_84146_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/07dcbde14f65/41598_2024_84146_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/35b1a057e634/41598_2024_84146_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/dd4c2b81280e/41598_2024_84146_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/2d393711ad39/41598_2024_84146_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/4835352639da/41598_2024_84146_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/052add7232a3/41598_2024_84146_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/1c03d7ad7376/41598_2024_84146_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/07dcbde14f65/41598_2024_84146_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bbf/11695694/35b1a057e634/41598_2024_84146_Fig6_HTML.jpg

相似文献

1
pACP-HybDeep: predicting anticancer peptides using binary tree growth based transformer and structural feature encoding with deep-hybrid learning.pACP-HybDeep:基于二叉树生长的变压器和深度混合学习的结构特征编码预测抗癌肽
Sci Rep. 2025 Jan 2;15(1):565. doi: 10.1038/s41598-024-84146-0.
2
pACPs-DNN: Predicting anticancer peptides using novel peptide transformation into evolutionary and structure matrix-based images with self-attention deep learning model.pACPs-DNN:利用新型肽转化为基于进化和结构矩阵的图像以及自注意力深度学习模型预测抗癌肽。
Comput Biol Chem. 2025 Aug;117:108441. doi: 10.1016/j.compbiolchem.2025.108441. Epub 2025 Mar 28.
3
DeepAIPs-Pred: Predicting Anti-Inflammatory Peptides Using Local Evolutionary Transformation Images and Structural Embedding-Based Optimal Descriptors with Self-Normalized BiTCNs.深度人工智能预测:使用局部进化变换图像和基于结构嵌入的最优描述符与自归一化双向卷积网络预测抗炎肽
J Chem Inf Model. 2024 Dec 23;64(24):9609-9625. doi: 10.1021/acs.jcim.4c01758. Epub 2024 Dec 3.
4
Deepstacked-AVPs: predicting antiviral peptides using tri-segment evolutionary profile and word embedding based multi-perspective features with deep stacking model.深度堆叠 AVPs:使用三片段进化特征和基于单词嵌入的多视角特征与深度堆叠模型预测抗病毒肽。
BMC Bioinformatics. 2024 Mar 7;25(1):102. doi: 10.1186/s12859-024-05726-5.
5
iAFPs-Mv-BiTCN: Predicting antifungal peptides using self-attention transformer embedding and transform evolutionary based multi-view features with bidirectional temporal convolutional networks.iAFPs-Mv-BiTCN:使用自注意力转换器嵌入和基于进化的多视图特征与双向时间卷积网络预测抗真菌肽。
Artif Intell Med. 2024 May;151:102860. doi: 10.1016/j.artmed.2024.102860. Epub 2024 Mar 26.
6
Deep-ProBind: binding protein prediction with transformer-based deep learning model.深度蛋白质结合预测:基于Transformer的深度学习模型进行结合蛋白预测。
BMC Bioinformatics. 2025 Mar 22;26(1):88. doi: 10.1186/s12859-025-06101-8.
7
pNPs-CapsNet: Predicting Neuropeptides Using Protein Language Models and FastText Encoding-Based Weighted Multi-View Feature Integration with Deep Capsule Neural Network.pNPs-CapsNet:使用蛋白质语言模型和基于FastText编码的加权多视图特征集成与深度胶囊神经网络预测神经肽
ACS Omega. 2025 Mar 18;10(12):12403-12416. doi: 10.1021/acsomega.4c11449. eCollection 2025 Apr 1.
8
ACP-MHCNN: an accurate multi-headed deep-convolutional neural network to predict anticancer peptides.ACP-MHCNN:一种准确的多头深度卷积神经网络,用于预测抗癌肽。
Sci Rep. 2021 Dec 8;11(1):23676. doi: 10.1038/s41598-021-02703-3.
9
iACP-DFSRA: Identification of Anticancer Peptides Based on a Dual-channel Fusion Strategy of ResCNN and Attention.iACP-DFSRA:基于 ResCNN 和注意力的双通道融合策略的抗癌肽鉴定。
J Mol Biol. 2024 Nov 15;436(22):168810. doi: 10.1016/j.jmb.2024.168810. Epub 2024 Oct 1.
10
AIPs-SnTCN: Predicting Anti-Inflammatory Peptides Using fastText and Transformer Encoder-Based Hybrid Word Embedding with Self-Normalized Temporal Convolutional Networks.AIPs-SnTCN:使用基于fastText和基于Transformer编码器的混合词嵌入与自归一化时间卷积网络预测抗炎肽
J Chem Inf Model. 2023 Nov 13;63(21):6537-6554. doi: 10.1021/acs.jcim.3c01563. Epub 2023 Oct 31.

引用本文的文献

1
NFEmbed: modeling nitrogenase activity via classification and regression with pretrained protein embeddings.NFEmbed:通过使用预训练蛋白质嵌入进行分类和回归来模拟固氮酶活性。
Bioinform Adv. 2025 Aug 23;5(1):vbaf204. doi: 10.1093/bioadv/vbaf204. eCollection 2025.
2
Long sequence temporal knowledge tracing for student performance prediction via integrating LSTM and informer.通过整合长短期记忆网络(LSTM)和Informer进行学生成绩预测的长序列时间知识追踪
PLoS One. 2025 Sep 9;20(9):e0330433. doi: 10.1371/journal.pone.0330433. eCollection 2025.
3
An efficient machine-learning framework for predicting protein post-translational modification sites.

本文引用的文献

1
DeepAIPs-Pred: Predicting Anti-Inflammatory Peptides Using Local Evolutionary Transformation Images and Structural Embedding-Based Optimal Descriptors with Self-Normalized BiTCNs.深度人工智能预测:使用局部进化变换图像和基于结构嵌入的最优描述符与自归一化双向卷积网络预测抗炎肽
J Chem Inf Model. 2024 Dec 23;64(24):9609-9625. doi: 10.1021/acs.jcim.4c01758. Epub 2024 Dec 3.
2
Deep-m5U: a deep learning-based approach for RNA 5-methyluridine modification prediction using optimized feature integration.Deep-m5U:一种基于深度学习的方法,用于使用优化的特征集成进行 RNA 5-甲基尿嘧啶修饰预测。
BMC Bioinformatics. 2024 Nov 19;25(1):360. doi: 10.1186/s12859-024-05978-1.
3
一种用于预测蛋白质翻译后修饰位点的高效机器学习框架。
Sci Rep. 2025 Aug 25;15(1):31179. doi: 10.1038/s41598-025-13178-x.
4
Targeting neurodegeneration: three machine learning methods for G9a inhibitors discovery using PubChem and scikit-learn.靶向神经退行性变:使用PubChem和scikit-learn发现G9a抑制剂的三种机器学习方法。
J Comput Aided Mol Des. 2025 Aug 6;39(1):61. doi: 10.1007/s10822-025-00642-z.
5
Deciphering the MHC immunopeptidome of human cancers with Ligand.MHC atlas.利用配体-MHC图谱解析人类癌症的MHC免疫肽组
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf314.
6
BGATT-GR: accurate identification of glucocorticoid receptor antagonists based on data augmentation combined with BiGRU-attention.BGATT-GR:基于数据增强结合双向门控循环单元-注意力机制的糖皮质激素受体拮抗剂准确识别
Sci Rep. 2025 Jul 1;15(1):21402. doi: 10.1038/s41598-025-05839-8.
7
An optimized transformer model for efficient detection of thoracic diseases in chest X-rays with multi-scale feature fusion.一种用于通过多尺度特征融合高效检测胸部X光片中胸部疾病的优化变压器模型。
PLoS One. 2025 May 7;20(5):e0323239. doi: 10.1371/journal.pone.0323239. eCollection 2025.
8
Towards precision oncology: a multi-level cancer classification system integrating liquid biopsy and machine learning.迈向精准肿瘤学:一种整合液体活检和机器学习的多层次癌症分类系统。
BioData Min. 2025 Apr 11;18(1):29. doi: 10.1186/s13040-025-00439-8.
9
pNPs-CapsNet: Predicting Neuropeptides Using Protein Language Models and FastText Encoding-Based Weighted Multi-View Feature Integration with Deep Capsule Neural Network.pNPs-CapsNet:使用蛋白质语言模型和基于FastText编码的加权多视图特征集成与深度胶囊神经网络预测神经肽
ACS Omega. 2025 Mar 18;10(12):12403-12416. doi: 10.1021/acsomega.4c11449. eCollection 2025 Apr 1.
A hybrid residue based sequential encoding mechanism with XGBoost improved ensemble model for identifying 5-hydroxymethylcytosine modifications.
基于残基的混合序贯编码机制与 XGBoost 改进的集成模型用于识别 5-羟甲基胞嘧啶修饰。
Sci Rep. 2024 Sep 6;14(1):20819. doi: 10.1038/s41598-024-71568-z.
4
PSSM-Sumo: deep learning based intelligent model for prediction of sumoylation sites using discriminative features.PSSM-Sumo:基于深度学习的智能模型,用于使用判别特征预测类泛素化位点。
BMC Bioinformatics. 2024 Aug 30;25(1):284. doi: 10.1186/s12859-024-05917-0.
5
StackedEnC-AOP: prediction of antioxidant proteins using transform evolutionary and sequential features based multi-scale vector with stacked ensemble learning.StackedEnC-AOP:基于多尺度向量的转换进化和序列特征与堆叠集成学习预测抗氧化蛋白。
BMC Bioinformatics. 2024 Aug 4;25(1):256. doi: 10.1186/s12859-024-05884-6.
6
DeepAVP-TPPred: identification of antiviral peptides using transformed image-based localized descriptors and binary tree growth algorithm.DeepAVP-TPPred:使用变换图像的局部描述符和二叉树生长算法鉴定抗病毒肽。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae305.
7
iAFPs-Mv-BiTCN: Predicting antifungal peptides using self-attention transformer embedding and transform evolutionary based multi-view features with bidirectional temporal convolutional networks.iAFPs-Mv-BiTCN:使用自注意力转换器嵌入和基于进化的多视图特征与双向时间卷积网络预测抗真菌肽。
Artif Intell Med. 2024 May;151:102860. doi: 10.1016/j.artmed.2024.102860. Epub 2024 Mar 26.
8
MA-PEP: A novel anticancer peptide prediction framework with multimodal feature fusion based on attention mechanism.MA-PEP:一种基于注意力机制的新型多模态特征融合抗癌肽预测框架。
Protein Sci. 2024 Apr;33(4):e4966. doi: 10.1002/pro.4966.
9
Deepstacked-AVPs: predicting antiviral peptides using tri-segment evolutionary profile and word embedding based multi-perspective features with deep stacking model.深度堆叠 AVPs:使用三片段进化特征和基于单词嵌入的多视角特征与深度堆叠模型预测抗病毒肽。
BMC Bioinformatics. 2024 Mar 7;25(1):102. doi: 10.1186/s12859-024-05726-5.
10
ACP-ML: A sequence-based method for anticancer peptide prediction.ACP-ML:一种基于序列的抗癌肽预测方法。
Comput Biol Med. 2024 Mar;170:108063. doi: 10.1016/j.compbiomed.2024.108063. Epub 2024 Jan 28.