StackCPPred：基于堆叠和成对能量含量的细胞穿透肽预测及其摄取效率。

StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency.

机构信息

College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China.

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China.

出版信息

Bioinformatics. 2020 May 1;36(10):3028-3034. doi: 10.1093/bioinformatics/btaa131.

DOI:10.1093/bioinformatics/btaa131

PMID:32105326

Abstract

MOTIVATION

Cell-penetrating peptides (CPPs) are a vehicle for transporting into living cells pharmacologically active molecules, such as short interfering RNAs, nanoparticles, plasmid DNAs and small peptides, thus offering great potential as future therapeutics. Existing experimental techniques for identifying CPPs are time-consuming and expensive. Thus, the prediction of CPPs from peptide sequences by using computational methods can be useful to annotate and guide the experimental process quickly. Many machine learning-based methods have recently emerged for identifying CPPs. Although considerable progress has been made, existing methods still have low feature representation capabilities, thereby limiting further performance improvements.

RESULTS

We propose a method called StackCPPred, which proposes three feature methods on the basis of the pairwise energy content of the residue as follows: RECM-composition, PseRECM and RECM-DWT. These features are used to train stacking-based machine learning methods to effectively predict CPPs. On the basis of the CPP924 and CPPsite3 datasets with jackknife validation, StackDPPred achieved 94.5% and 78.3% accuracy, which was 2.9% and 5.8% higher than the state-of-the-art CPP predictors, respectively. StackCPPred can be a powerful tool for predicting CPPs and their uptake efficiency, facilitating hypothesis-driven experimental design and accelerating their applications in clinical therapy.

AVAILABILITY AND IMPLEMENTATION

Source code and data can be downloaded from https://github.com/Excelsior511/StackCPPred.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

细胞穿透肽（CPPs）是一种将药理活性分子（如短干扰 RNA、纳米颗粒、质粒 DNA 和小肽）输送到活细胞中的载体，因此具有成为未来治疗剂的巨大潜力。现有的鉴定 CPPs 的实验技术既耗时又昂贵。因此，通过计算方法从肽序列预测 CPPs 可以快速注释和指导实验过程。最近出现了许多基于机器学习的方法来识别 CPPs。尽管已经取得了相当大的进展，但现有的方法仍然具有较低的特征表示能力，从而限制了进一步的性能提高。

结果

我们提出了一种名为 StackCPPred 的方法，该方法基于残基的成对能量含量提出了三种特征方法，即 RECM-组成、PseRECM 和 RECM-DWT。这些特征用于训练基于堆叠的机器学习方法，以有效地预测 CPPs。在使用 jackknife 验证的 CPP924 和 CPPsite3 数据集上，StackDPPred 实现了 94.5%和 78.3%的准确率，分别比最先进的 CPP 预测器高 2.9%和 5.8%。StackCPPred 可以成为预测 CPPs 及其摄取效率的有力工具，有助于驱动假设的实验设计并加速其在临床治疗中的应用。

可用性和实现

源代码和数据可从 https://github.com/Excelsior511/StackCPPred 下载。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency.StackCPPred：基于堆叠和成对能量含量的细胞穿透肽预测及其摄取效率。

Bioinformatics. 2020 May 1;36(10):3028-3034. doi: 10.1093/bioinformatics/btaa131.

KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides.KELM-CPPpred：基于核极限学习机的细胞穿透肽预测模型。

J Proteome Res. 2018 Sep 7;17(9):3214-3222. doi: 10.1021/acs.jproteome.8b00322. Epub 2018 Aug 13.

DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.DeepCPPred：一种用于区分细胞穿透肽及其摄取效率的深度学习框架。

IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2749-2759. doi: 10.1109/TCBB.2021.3102133. Epub 2022 Oct 10.

Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.基于机器学习的细胞穿透肽预测及其摄取效率的改进准确性。

J Proteome Res. 2018 Aug 3;17(8):2715-2726. doi: 10.1021/acs.jproteome.8b00148. Epub 2018 Jul 2.

StackDPPred: a stacking based prediction of DNA-binding protein from sequence.StackDPPred：一种基于堆叠的 DNA 结合蛋白序列预测方法。

Bioinformatics. 2019 Feb 1;35(3):433-441. doi: 10.1093/bioinformatics/bty653.

MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.MLCPP 2.0：更新的细胞穿透肽及其摄取效率预测器。

J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28.

CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency.CPPred-RF：一种基于序列的用于识别细胞穿透肽及其摄取效率的预测工具。

J Proteome Res. 2017 May 5;16(5):2044-2053. doi: 10.1021/acs.jproteome.7b00019. Epub 2017 Apr 26.

TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree.目标 CPP：使用梯度提升决策树从优化的多尺度特征中准确预测细胞穿透肽。

J Comput Aided Mol Des. 2020 Aug;34(8):841-856. doi: 10.1007/s10822-020-00307-z. Epub 2020 Mar 16.

The Development of Machine Learning Methods in Cell-Penetrating Peptides Identification: A Brief Review.机器学习方法在细胞穿透肽鉴定中的发展：简要综述。

Curr Drug Metab. 2019;20(3):217-223. doi: 10.2174/1389200219666181010114750.

SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.SkipCPP-Pred：一种改进的、有前途的基于序列的细胞穿透肽预测器。

BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.

引用本文的文献

TAL-SRX: an intelligent typing evaluation method for KASP primers based on multi-model fusion.TAL-SRX：一种基于多模型融合的KASP引物智能分型评估方法。

Front Plant Sci. 2025 Feb 18;16:1539068. doi: 10.3389/fpls.2025.1539068. eCollection 2025.

Machine learning for antimicrobial peptide identification and design.用于抗菌肽鉴定与设计的机器学习

Nat Rev Bioeng. 2024 May;2(5):392-407. doi: 10.1038/s44222-024-00152-x. Epub 2024 Feb 26.

A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides.细胞穿透肽的生物学机制及机器学习预测方法概述。

Front Artif Intell. 2025 Jan 7;7:1497307. doi: 10.3389/frai.2024.1497307. eCollection 2024.

Exploring the Chemical Features and Biomedical Relevance of Cell-Penetrating Peptides.探索细胞穿透肽的化学特性及生物医学相关性。

Int J Mol Sci. 2024 Dec 25;26(1):59. doi: 10.3390/ijms26010059.

EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP：一种基于深度学习和机器学习的多视图可解释框架，用于使用Transformer识别细胞穿透肽并分析序列信息。

Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.

Biological Sequence Classification: A Review on Data and General Methods.生物序列分类：数据与通用方法综述

Research (Wash D C). 2022 Dec 19;2022:0011. doi: 10.34133/research.0011. eCollection 2022.

MuCoCP: a priori chemical knowledge-based multimodal contrastive learning pre-trained neural network for the prediction of cyclic peptide membrane penetration ability.MuCoCP：基于先验化学知识的多模态对比学习预训练神经网络，用于预测环状肽的膜穿透能力。

Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae473.

Investigating molecular descriptors in cell-penetrating peptides prediction with deep learning: Employing N, O, and hydrophobicity according to the Eisenberg scale.利用深度学习研究细胞穿透肽预测中的分子描述符：根据艾森伯格标度采用氮、氧和疏水性。

PLoS One. 2024 Jun 13;19(6):e0305253. doi: 10.1371/journal.pone.0305253. eCollection 2024.

Artificial Intelligence and Computational Biology in Gene Therapy: A Review.基因治疗中的人工智能与计算生物学：综述

Biochem Genet. 2025 Apr;63(2):960-983. doi: 10.1007/s10528-024-10799-1. Epub 2024 Apr 18.

POSEIDON: Peptidic Objects SEquence-based Interaction with cellular DOmaiNs: a new database and predictor.波塞冬：基于肽段对象序列与细胞结构域的相互作用：一个新的数据库和预测工具。

J Cheminform. 2024 Feb 16;16(1):18. doi: 10.1186/s13321-024-00810-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

StackCPPred：基于堆叠和成对能量含量的细胞穿透肽预测及其摄取效率。

StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献