• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SkipCPP-Pred:一种改进的、有前途的基于序列的细胞穿透肽预测器。

SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.

机构信息

School of Computer Science and Technology, Tianjin University, Tianjin, 30050, China.

State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300074, China.

出版信息

BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.

DOI:10.1186/s12864-017-4128-1
PMID:29513192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5657092/
Abstract

BACKGROUND

Cell-penetrating peptides (CPPs) are short peptides (5-30 amino acids) that can enter almost any cell without significant damage. On account of their high delivery efficiency, CPPs are promising candidates for gene therapy and cancer treatment. Accordingly, techniques that correctly predict CPPs are anticipated to accelerate CPP applications in future therapeutics. Recently, computational methods have been reportedly successful in predicting CPPs. Unfortunately, the predictive performance of existing methods is not satisfactory and reliable so as to accurately identify CPPs.

RESULTS

In this study, we propose a novel computational predictor called SkipCPP-Pred to further improve the predictive performance. The novelty of the proposed predictor is that we present a sequence-based feature representation algorithm called adaptive k-skip-n-gram that sufficiently captures the intrinsic correlation information of residues. By fusing the proposed adaptive skip features with a random forest (RF) classifier, we successfully construct the prediction model of SkipCPP-Pred. The various jackknife results demonstrate that the proposed SkipCPP-Pred is 3.6% higher than state-of-the-art CPP predictors in terms of accuracy. Moreover, we construct a high-quality benchmark dataset by reducing the data redundancy and enhancing the similarity between the positive and negative classes. Using this dataset to build prediction models, we can successfully avoid the performance bias lying in existing methods and yield a promising predictive model.

CONCLUSIONS

The proposed SkipCPP-Pred is a simple and fast sequence-based predictor featured with the adaptive k-skip-n-gram model for the improved prediction of CPPs. Currently, SkipCPP-Pred is publicly available from an online webserver ( http://server.malab.cn/SkipCPP-Pred/Index.html ).

摘要

背景

细胞穿透肽(CPPs)是可以进入几乎任何细胞而不造成显著损伤的短肽(5-30 个氨基酸)。由于其高效的传递效率,CPPs 是基因治疗和癌症治疗的有前途的候选者。因此,正确预测 CPPs 的技术有望加速 CPP 在未来治疗中的应用。最近,据报道,计算方法在预测 CPPs 方面取得了成功。不幸的是,现有方法的预测性能并不令人满意和可靠,无法准确识别 CPPs。

结果

在这项研究中,我们提出了一种名为 SkipCPP-Pred 的新型计算预测器,以进一步提高预测性能。所提出的预测器的新颖之处在于,我们提出了一种称为自适应 k-跳 n-gram 的基于序列的特征表示算法,该算法充分捕获了残基的内在相关信息。通过将所提出的自适应跳过特征与随机森林(RF)分类器融合,我们成功构建了 SkipCPP-Pred 的预测模型。各种折刀结果表明,在所提出的预测器在准确性方面比最先进的 CPP 预测器高出 3.6%。此外,我们通过减少数据冗余和增强正负类之间的相似性来构建一个高质量的基准数据集。使用该数据集构建预测模型,可以成功避免现有方法中的性能偏差,并产生有前途的预测模型。

结论

所提出的 SkipCPP-Pred 是一种简单快速的基于序列的预测器,具有自适应 k-跳 n-gram 模型,用于改进 CPP 的预测。目前,SkipCPP-Pred 可从在线网络服务器(http://server.malab.cn/SkipCPP-Pred/Index.html)获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/74ff3c453999/12864_2017_4128_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/4a72907d7a59/12864_2017_4128_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/b5704bf1971a/12864_2017_4128_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/3808487dfc79/12864_2017_4128_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/b3a4aad43865/12864_2017_4128_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/781e81771e91/12864_2017_4128_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/74ff3c453999/12864_2017_4128_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/4a72907d7a59/12864_2017_4128_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/b5704bf1971a/12864_2017_4128_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/3808487dfc79/12864_2017_4128_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/b3a4aad43865/12864_2017_4128_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/781e81771e91/12864_2017_4128_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1622/5657092/74ff3c453999/12864_2017_4128_Fig6_HTML.jpg

相似文献

1
SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.SkipCPP-Pred:一种改进的、有前途的基于序列的细胞穿透肽预测器。
BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.
2
CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency.CPPred-RF:一种基于序列的用于识别细胞穿透肽及其摄取效率的预测工具。
J Proteome Res. 2017 May 5;16(5):2044-2053. doi: 10.1021/acs.jproteome.7b00019. Epub 2017 Apr 26.
3
TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree.目标 CPP:使用梯度提升决策树从优化的多尺度特征中准确预测细胞穿透肽。
J Comput Aided Mol Des. 2020 Aug;34(8):841-856. doi: 10.1007/s10822-020-00307-z. Epub 2020 Mar 16.
4
KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides.KELM-CPPpred:基于核极限学习机的细胞穿透肽预测模型。
J Proteome Res. 2018 Sep 7;17(9):3214-3222. doi: 10.1021/acs.jproteome.8b00322. Epub 2018 Aug 13.
5
Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.基于机器学习的细胞穿透肽预测及其摄取效率的改进准确性。
J Proteome Res. 2018 Aug 3;17(8):2715-2726. doi: 10.1021/acs.jproteome.8b00148. Epub 2018 Jul 2.
6
A Novel Amino Acid Sequence-based Computational Approach to Predicting Cell-penetrating Peptides.一种基于氨基酸序列的预测细胞穿透肽的新型计算方法。
Curr Comput Aided Drug Des. 2019;15(3):206-211. doi: 10.2174/1573409914666180925100355.
7
DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.DeepCPPred:一种用于区分细胞穿透肽及其摄取效率的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2749-2759. doi: 10.1109/TCBB.2021.3102133. Epub 2022 Oct 10.
8
Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.基于网络的细胞穿透肽预测工具的实证比较和分析。
Brief Bioinform. 2020 Mar 23;21(2):408-420. doi: 10.1093/bib/bby124.
9
MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.MLCPP 2.0:更新的细胞穿透肽及其摄取效率预测器。
J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28.
10
StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency.StackCPPred:基于堆叠和成对能量含量的细胞穿透肽预测及其摄取效率。
Bioinformatics. 2020 May 1;36(10):3028-3034. doi: 10.1093/bioinformatics/btaa131.

引用本文的文献

1
AOPxSVM: A Support Vector Machine for Identifying Antioxidant Peptides Using a Block Substitution Matrix and Amino Acid Composition, Transformation, and Distribution Embeddings.AOPxSVM:一种使用块替换矩阵以及氨基酸组成、转化和分布嵌入来识别抗氧化肽的支持向量机。
Foods. 2025 Jun 6;14(12):2014. doi: 10.3390/foods14122014.
2
A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides.细胞穿透肽的生物学机制及机器学习预测方法概述。
Front Artif Intell. 2025 Jan 7;7:1497307. doi: 10.3389/frai.2024.1497307. eCollection 2024.
3
Exploring the Chemical Features and Biomedical Relevance of Cell-Penetrating Peptides.

本文引用的文献

1
Identifying N-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine.利用多区间核苷酸对位置特异性和支持向量机鉴定 N6-甲基腺苷位点。
Sci Rep. 2017 Apr 25;7:46757. doi: 10.1038/srep46757.
2
Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier.利用新型负样本、特征和集成分类器提高蛋白质-蛋白质相互作用预测。
Artif Intell Med. 2017 Nov;83:67-74. doi: 10.1016/j.artmed.2017.03.001. Epub 2017 Mar 4.
3
A novel hierarchical selective ensemble classifier with bioinformatics application.
探索细胞穿透肽的化学特性及生物医学相关性。
Int J Mol Sci. 2024 Dec 25;26(1):59. doi: 10.3390/ijms26010059.
4
EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。
Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.
5
Advancements in mitochondrial-targeted nanotherapeutics: overcoming biological obstacles and optimizing drug delivery.线粒体靶向纳米治疗学的进展:克服生物学障碍和优化药物传递。
Front Immunol. 2024 Oct 17;15:1451989. doi: 10.3389/fimmu.2024.1451989. eCollection 2024.
6
A hybrid residue based sequential encoding mechanism with XGBoost improved ensemble model for identifying 5-hydroxymethylcytosine modifications.基于残基的混合序贯编码机制与 XGBoost 改进的集成模型用于识别 5-羟甲基胞嘧啶修饰。
Sci Rep. 2024 Sep 6;14(1):20819. doi: 10.1038/s41598-024-71568-z.
7
deepAMPNet: a novel antimicrobial peptide predictor employing AlphaFold2 predicted structures and a bi-directional long short-term memory protein language model.深度 AMP 网络:一种新颖的抗菌肽预测器,采用 AlphaFold2 预测结构和双向长短期记忆蛋白质语言模型。
PeerJ. 2024 Jul 19;12:e17729. doi: 10.7717/peerj.17729. eCollection 2024.
8
TriplEP-CPP: Algorithm for Predicting the Properties of Peptide Sequences.TriplEP-CPP:预测肽序列性质的算法。
Int J Mol Sci. 2024 Jun 22;25(13):6869. doi: 10.3390/ijms25136869.
9
Investigating molecular descriptors in cell-penetrating peptides prediction with deep learning: Employing N, O, and hydrophobicity according to the Eisenberg scale.利用深度学习研究细胞穿透肽预测中的分子描述符:根据艾森伯格标度采用氮、氧和疏水性。
PLoS One. 2024 Jun 13;19(6):e0305253. doi: 10.1371/journal.pone.0305253. eCollection 2024.
10
Interpretable molecular encodings and representations for machine learning tasks.用于机器学习任务的可解释分子编码和表示。
Comput Struct Biotechnol J. 2024 May 24;23:2326-2336. doi: 10.1016/j.csbj.2024.05.035. eCollection 2024 Dec.
一种具有生物信息学应用的新型分层选择性集成分类器。
Artif Intell Med. 2017 Nov;83:82-90. doi: 10.1016/j.artmed.2017.02.005. Epub 2017 Feb 27.
4
Fast Prediction of Protein Methylation Sites Using a Sequence-Based Feature Selection Technique.基于序列特征选择技术的蛋白质甲基化位点快速预测。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1264-1273. doi: 10.1109/TCBB.2017.2670558. Epub 2017 Feb 16.
5
PhosPred-RF: A Novel Sequence-Based Predictor for Phosphorylation Sites Using Sequential Information Only.PhosPred-RF:一种仅使用序列信息的基于序列的磷酸化位点新型预测工具。
IEEE Trans Nanobioscience. 2017 Jun;16(4):240-247. doi: 10.1109/TNB.2017.2661756. Epub 2017 Jan 31.
6
Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.基于机器学习的蛋白质折叠识别方法的最新进展
Int J Mol Sci. 2016 Dec 16;17(12):2118. doi: 10.3390/ijms17122118.
7
Detection of tubule boundaries based on circular shortest path and polar-transformation of arbitrary shapes.基于圆形最短路径和任意形状极坐标变换的肾小管边界检测
J Microsc. 2016 Nov;264(2):127-142. doi: 10.1111/jmi.12421. Epub 2016 May 12.
8
iMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions.iMiRNA-SSF:通过结合不同分布的负集改进微小RNA前体的识别
Sci Rep. 2016 Jan 12;6:19062. doi: 10.1038/srep19062.
9
Predicting cancerlectins by the optimal g-gap dipeptides.通过最优g-间隙二肽预测癌凝集素
Sci Rep. 2015 Dec 9;5:16964. doi: 10.1038/srep16964.
10
On the Universality and Non-Universality of Spiking Neural P Systems With Rules on Synapses.关于具有突触规则的脉冲神经P系统的通用性和非通用性
IEEE Trans Nanobioscience. 2015 Dec;14(8):960-6. doi: 10.1109/TNB.2015.2503603. Epub 2015 Nov 25.