• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多尺度注意力架构学习嵌入特征,以提高抗癌肽的预测性能。

Learning embedding features based on multisense-scaled attention architecture to improve the predictive performance of anticancer peptides.

机构信息

School of Software, Shandong University, Jinan, China.

Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.

出版信息

Bioinformatics. 2021 Dec 11;37(24):4684-4693. doi: 10.1093/bioinformatics/btab560.

DOI:10.1093/bioinformatics/btab560
PMID:34323948
Abstract

MOTIVATION

Anticancer peptides (ACPs) have recently emerged as effective anticancer drugs in cancer therapy. Machine learning-based predictors have been developed to identify ACPs and achieve satisfactory performance. However, existing methods suffer from experience-based feature engineering, which not only restricts the representation ability of the models to a certain extent but also lacks adaptivity for different data, limiting the further improvement of the predictive performance and impacting the robustness of the predictive models. To alleviate the above problems, we propose a novel deep-learning-based predictor named ACPred-LAF, in which we propose a novel multisense and multiscaled embedding algorithm to automatically learn and extract context sequential characteristics of ACPs.

RESULTS

Through the feature comparative analysis, we demonstrate that our learnable and self-adaptive embedding features are better than hand-crafted features in capturing discriminative information, which can effectively benefit the performance improvement for ACP prediction. In addition, benchmarking comparison results demonstrate that our ACPred-LAF outperforms the state-of-the-art methods both on existing benchmark datasets and our newly constructed dataset. Furthermore, we also prove and validate the robustness of the model via the data interference experiment. To avoid potential evaluation bias, here, we construct a new ACP benchmark dataset named ACP-Mixed by integrating existing datasets. We expect our newly constructed dataset to be a golden standard benchmark dataset in this field. To facilitate the use of our model, we develop a web server as the implementation of ACPred-LAF.

AVAILABILITY AND IMPLEMENTATION

Our proposed ACPred-LAF, newly constructed benchmark dataset ACP-Mixed are open source collaborative initiatives available in the GitHub repository (https://github.com/TearsWaiting/ACPred-LAF). Besides, a webserver as the implementation of ACPred-LAF that can be accessed via: http://server.malab.cn/ACPred-LAF.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

抗癌肽 (ACPs) 最近作为癌症治疗中的有效抗癌药物出现。已经开发了基于机器学习的预测器来识别 ACP 并取得令人满意的性能。然而,现有的方法存在基于经验的特征工程问题,这不仅在一定程度上限制了模型的表示能力,而且缺乏对不同数据的适应性,限制了预测性能的进一步提高,并影响了预测模型的稳健性。为了解决上述问题,我们提出了一种名为 ACPred-LAF 的新型基于深度学习的预测器,其中我们提出了一种新的多义多尺度嵌入算法,用于自动学习和提取 ACP 的上下文序列特征。

结果

通过特征比较分析,我们证明了我们可学习的自适应嵌入特征在捕获判别信息方面优于手工制作的特征,这可以有效地提高 ACP 预测的性能。此外,基准比较结果表明,我们的 ACPred-LAF 在现有的基准数据集和我们新构建的数据集上均优于最先进的方法。此外,我们还通过数据干扰实验证明和验证了模型的稳健性。为避免潜在的评估偏差,我们在此构建了一个新的 ACP 基准数据集 ACP-Mixed,通过整合现有数据集。我们希望我们新构建的数据集成为该领域的黄金标准基准数据集。为了方便模型的使用,我们开发了一个作为 ACPred-LAF 实现的网络服务器。

可用性和实现

我们提出的 ACPred-LAF 和新构建的基准数据集 ACP-Mixed 是开源的协作倡议,可在 GitHub 存储库(https://github.com/TearsWaiting/ACPred-LAF)中获得。此外,还可以通过:http://server.malab.cn/ACPred-LAF 访问作为 ACPred-LAF 实现的网络服务器。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Learning embedding features based on multisense-scaled attention architecture to improve the predictive performance of anticancer peptides.基于多尺度注意力架构学习嵌入特征,以提高抗癌肽的预测性能。
Bioinformatics. 2021 Dec 11;37(24):4684-4693. doi: 10.1093/bioinformatics/btab560.
2
ACPred-Fuse: fusing multi-view information improves the prediction of anticancer peptides.ACPred-Fuse:融合多视图信息可改善抗癌肽的预测。
Brief Bioinform. 2020 Sep 25;21(5):1846-1855. doi: 10.1093/bib/bbz088.
3
ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides.ACPred-FL:一种基于序列的预测器,使用有效的特征表示来提高抗癌肽的预测能力。
Bioinformatics. 2018 Dec 1;34(23):4007-4016. doi: 10.1093/bioinformatics/bty451.
4
ACPred: A Computational Tool for the Prediction and Analysis of Anticancer Peptides.ACPred:一种用于抗癌肽预测和分析的计算工具。
Molecules. 2019 May 22;24(10):1973. doi: 10.3390/molecules24101973.
5
Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.大规模比较综述与评估抗癌肽鉴定的计算方法。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa312.
6
ACPred-BMF: bidirectional LSTM with multiple feature representations for explainable anticancer peptide prediction.ACPred-BMF:用于可解释抗癌肽预测的具有多种特征表示的双向 LSTM。
Sci Rep. 2022 Dec 19;12(1):21915. doi: 10.1038/s41598-022-24404-1.
7
ACP-PDAFF: Pretrained model and dual-channel attentional feature fusion for anticancer peptides prediction.ACP-PDAFF:用于抗癌肽预测的预训练模型和双通道注意力特征融合。
Comput Biol Chem. 2024 Oct;112:108141. doi: 10.1016/j.compbiolchem.2024.108141. Epub 2024 Jul 3.
8
CAPTURE: Comprehensive anti-cancer peptide predictor with a unique amino acid sequence encoder.CAPTURE:具有独特氨基酸序列编码器的综合抗癌肽预测器。
Comput Biol Med. 2024 Jun;176:108538. doi: 10.1016/j.compbiomed.2024.108538. Epub 2024 May 3.
9
PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.基于蛋白质语言模型和小波去噪变换的抗癌肽 PLMACPred 预测。
Sci Rep. 2024 Jul 23;14(1):16992. doi: 10.1038/s41598-024-67433-8.
10
ACP-MHCNN: an accurate multi-headed deep-convolutional neural network to predict anticancer peptides.ACP-MHCNN:一种准确的多头深度卷积神经网络,用于预测抗癌肽。
Sci Rep. 2021 Dec 8;11(1):23676. doi: 10.1038/s41598-021-02703-3.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
pACP-HybDeep: predicting anticancer peptides using binary tree growth based transformer and structural feature encoding with deep-hybrid learning.pACP-HybDeep:基于二叉树生长的变压器和深度混合学习的结构特征编码预测抗癌肽
Sci Rep. 2025 Jan 2;15(1):565. doi: 10.1038/s41598-024-84146-0.
3
Contrastive learning for enhancing feature extraction in anticancer peptides.
基于对比学习的抗癌肽特征提取增强方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae220.
4
MA-PEP: A novel anticancer peptide prediction framework with multimodal feature fusion based on attention mechanism.MA-PEP:一种基于注意力机制的新型多模态特征融合抗癌肽预测框架。
Protein Sci. 2024 Apr;33(4):e4966. doi: 10.1002/pro.4966.
5
MHCSeqNet2-improved peptide-class I MHC binding prediction for alleles with low data.MHCSeqNet2 改进了低数据等位基因的肽类 I MHC 结合预测。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btad780.
6
TMSC-m7G: A transformer architecture based on multi-sense-scaled embedding features and convolutional neural network to identify RNA N7-methylguanosine sites.TMSC-m7G:一种基于多感官尺度嵌入特征和卷积神经网络的变压器架构,用于识别RNA N7-甲基鸟苷位点。
Comput Struct Biotechnol J. 2023 Dec 1;23:129-139. doi: 10.1016/j.csbj.2023.11.052. eCollection 2024 Dec.
7
TPpred-LE: therapeutic peptide function prediction based on label embedding.TPpred-LE:基于标签嵌入的治疗性肽功能预测。
BMC Biol. 2023 Oct 31;21(1):238. doi: 10.1186/s12915-023-01740-w.
8
Explainable Deep Hypergraph Learning Modeling the Peptide Secondary Structure Prediction.可解释的深度超图学习模型在肽二级结构预测中的应用。
Adv Sci (Weinh). 2023 Apr;10(11):e2206151. doi: 10.1002/advs.202206151. Epub 2023 Feb 15.
9
Anti-Cancer Peptides: Status and Future Prospects.抗癌肽:现状与未来展望。
Molecules. 2023 Jan 23;28(3):1148. doi: 10.3390/molecules28031148.
10
ACPred-BMF: bidirectional LSTM with multiple feature representations for explainable anticancer peptide prediction.ACPred-BMF:用于可解释抗癌肽预测的具有多种特征表示的双向 LSTM。
Sci Rep. 2022 Dec 19;12(1):21915. doi: 10.1038/s41598-022-24404-1.