• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

目标 CPP:使用梯度提升决策树从优化的多尺度特征中准确预测细胞穿透肽。

TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree.

机构信息

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China.

出版信息

J Comput Aided Mol Des. 2020 Aug;34(8):841-856. doi: 10.1007/s10822-020-00307-z. Epub 2020 Mar 16.

DOI:10.1007/s10822-020-00307-z
PMID:32180124
Abstract

Cell-penetrating peptides (CPPs) are short length permeable proteins have emerged as drugs delivery tool of therapeutic agents including genetic materials and macromolecules into cells. Recently, CPP has become a hotspot avenue for life science research and paved a new way of disease treatment without harmful impact on cell viability due to nontoxic characteristic. Therefore, the correct identification of CPPs will provide hints for medical applications. Considering the shortcomings of traditional experimental CPPs identification, it is urgently needed to design intelligent predictor for accurate identification of CPPs for the large scale uncharacterized sequences. We develop a novel computational method, called TargetCPP, to discriminate CPPs from Non-CPPs with improved accuracy. In TargetCPP, first the peptide sequences are formulated with four distinct encoding methods i.e., composite protein sequence representation, composition transition and distribution, split amino acid composition, and information theory features. These dominant feature vectors were fused and applied intelligent minimum redundancy and maximum relevancy feature selection method to choose an optimal subset of features. Finally, the predictive model is learned through different classification algorithms on the optimized features. Among these classifiers, gradient boost decision tree algorithm achieved excellent performance throughout the experiments. Notably, the TargetCPP tool attained high prediction Accuracy of 93.54% and 88.28% using jackknife and independent test, respectively. Empirical outcomes prove the superiority and potency of proposed bioinformatics method over state-of-the-art methods. It is highly anticipated that the outcomes of this study will provide a strong background for large scale prediction of CPPs and instructive guidance in clinical therapy and medical applications.

摘要

细胞穿透肽(CPPs)是一种短长度的可渗透蛋白质,已成为将治疗剂(包括遗传物质和大分子)递送到细胞内的药物输送工具。最近,CPP 已成为生命科学研究的热点领域,由于其无毒特性,为疾病治疗开辟了新途径,而不会对细胞活力产生有害影响。因此,正确识别 CPP 将为医学应用提供线索。考虑到传统实验 CPP 识别的缺点,迫切需要设计智能预测器,以便对大规模未表征的序列进行准确的 CPP 识别。我们开发了一种新的计算方法,称为 TargetCPP,可提高准确性来区分 CPP 和非 CPP。在 TargetCPP 中,首先使用四种不同的编码方法对肽序列进行了描述,即复合蛋白质序列表示、组成转换和分布、分裂氨基酸组成和信息理论特征。将这些主要特征向量融合,并应用智能最小冗余和最大相关性特征选择方法来选择最佳特征子集。最后,通过不同的分类算法在优化的特征上学习预测模型。在这些分类器中,梯度提升决策树算法在整个实验中表现出优异的性能。值得注意的是,TargetCPP 工具在 jackknife 和独立测试中分别实现了 93.54%和 88.28%的高预测精度。实验结果证明了所提出的生物信息学方法优于最先进方法的优越性和有效性。预计这项研究的结果将为 CPP 的大规模预测提供强有力的背景,并为临床治疗和医学应用提供有指导意义的建议。

相似文献

1
TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree.目标 CPP:使用梯度提升决策树从优化的多尺度特征中准确预测细胞穿透肽。
J Comput Aided Mol Des. 2020 Aug;34(8):841-856. doi: 10.1007/s10822-020-00307-z. Epub 2020 Mar 16.
2
SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.SkipCPP-Pred:一种改进的、有前途的基于序列的细胞穿透肽预测器。
BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.
3
Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.基于机器学习的细胞穿透肽预测及其摄取效率的改进准确性。
J Proteome Res. 2018 Aug 3;17(8):2715-2726. doi: 10.1021/acs.jproteome.8b00148. Epub 2018 Jul 2.
4
KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides.KELM-CPPpred:基于核极限学习机的细胞穿透肽预测模型。
J Proteome Res. 2018 Sep 7;17(9):3214-3222. doi: 10.1021/acs.jproteome.8b00322. Epub 2018 Aug 13.
5
DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.DeepCPPred:一种用于区分细胞穿透肽及其摄取效率的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2749-2759. doi: 10.1109/TCBB.2021.3102133. Epub 2022 Oct 10.
6
G-DipC: An Improved Feature Representation Method for Short Sequences to Predict the Type of Cargo in Cell-Penetrating Peptides.G-DipC:一种改进的短序列特征表示方法,用于预测细胞穿透肽中货物的类型。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):739-747. doi: 10.1109/TCBB.2019.2930993. Epub 2019 Jul 25.
7
Prediction of cell penetrating peptides by support vector machines.基于支持向量机的细胞穿透肽预测。
PLoS Comput Biol. 2011 Jul;7(7):e1002101. doi: 10.1371/journal.pcbi.1002101. Epub 2011 Jul 14.
8
A Novel Amino Acid Sequence-based Computational Approach to Predicting Cell-penetrating Peptides.一种基于氨基酸序列的预测细胞穿透肽的新型计算方法。
Curr Comput Aided Drug Des. 2019;15(3):206-211. doi: 10.2174/1573409914666180925100355.
9
Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.基于网络的细胞穿透肽预测工具的实证比较和分析。
Brief Bioinform. 2020 Mar 23;21(2):408-420. doi: 10.1093/bib/bby124.
10
MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.MLCPP 2.0:更新的细胞穿透肽及其摄取效率预测器。
J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28.

引用本文的文献

1
Improved in Silico Identification of Protein-Protein Interactions Using Deep Learning Approach.使用深度学习方法改进蛋白质-蛋白质相互作用的计算机模拟识别
IET Syst Biol. 2025 Jan-Dec;19(1):e70008. doi: 10.1049/syb2.70008.
2
A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides.细胞穿透肽的生物学机制及机器学习预测方法概述。
Front Artif Intell. 2025 Jan 7;7:1497307. doi: 10.3389/frai.2024.1497307. eCollection 2024.
3
Exploring the Chemical Features and Biomedical Relevance of Cell-Penetrating Peptides.
探索细胞穿透肽的化学特性及生物医学相关性。
Int J Mol Sci. 2024 Dec 25;26(1):59. doi: 10.3390/ijms26010059.
4
EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。
Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.
5
Deep-GB: A novel deep learning model for globular protein prediction using CNN-BiLSTM architecture and enhanced PSSM with trisection strategy.深度GB:一种使用CNN-BiLSTM架构和采用三等分策略增强的PSSM进行球状蛋白质预测的新型深度学习模型。
IET Syst Biol. 2024 Dec;18(6):208-217. doi: 10.1049/syb2.12108. Epub 2024 Nov 8.
6
Biological Sequence Classification: A Review on Data and General Methods.生物序列分类:数据与通用方法综述
Research (Wash D C). 2022 Dec 19;2022:0011. doi: 10.34133/research.0011. eCollection 2022.
7
PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.基于蛋白质语言模型和小波去噪变换的抗癌肽 PLMACPred 预测。
Sci Rep. 2024 Jul 23;14(1):16992. doi: 10.1038/s41598-024-67433-8.
8
Investigating molecular descriptors in cell-penetrating peptides prediction with deep learning: Employing N, O, and hydrophobicity according to the Eisenberg scale.利用深度学习研究细胞穿透肽预测中的分子描述符:根据艾森伯格标度采用氮、氧和疏水性。
PLoS One. 2024 Jun 13;19(6):e0305253. doi: 10.1371/journal.pone.0305253. eCollection 2024.
9
DPI_CDF: druggable protein identifier using cascade deep forest.DPI_CDF:基于级联深度森林的可成药性蛋白识别方法。
BMC Bioinformatics. 2024 Apr 5;25(1):145. doi: 10.1186/s12859-024-05744-3.
10
PractiCPP: a deep learning approach tailored for extremely imbalanced datasets in cell-penetrating peptide prediction.PractiCPP:一种针对细胞穿透肽预测中极度不平衡数据集的深度学习方法。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae058.