穿膜肽预测器：方法和数据集的比较分析。

Cell-penetrating peptides predictors: A comparative analysis of methods and datasets.

机构信息

Department of Computer Science, CICESE Research Center, Ensenada, 22860, Mexico.

Current address: School of Mathematics & Statistical Sciences, University of Galway, Galway, H91 TK33, Ireland.

出版信息

Mol Inform. 2023 Nov;42(11):e202300104. doi: 10.1002/minf.202300104. Epub 2023 Sep 6.

DOI:10.1002/minf.202300104

PMID:37672879

Abstract

Cell-Penetrating Peptides (CPP) are emerging as an alternative to small-molecule drugs to expand the range of biomolecules that can be targeted for therapeutic purposes. Due to the importance of identifying and designing new CPP, a great variety of predictors have been developed to achieve these goals. To establish a ranking for these predictors, a couple of recent studies compared their performances on specific datasets, yet their conclusions cannot determine if the ranking obtained is due to the model, the set of descriptors or the datasets used to test the predictors. We present a systematic study of the influence of the peptide sequence's similarity of the datasets on the predictors' performance. The analysis reveals that the datasets used for training have a stronger influence on the predictors performance than the model or descriptors employed. We show that datasets with low sequence similarity between the positive and negative examples can be easily separated, and the tested classifiers showed good performance on them. On the other hand, a dataset with high sequence similarity between CPP and non-CPP will be a hard dataset, and it should be the one to be used for assessing the performance of new predictors.

摘要

细胞穿透肽 (CPP) 作为小分子药物的替代品正在兴起，以扩大可用于治疗目的的生物分子范围。由于确定和设计新的 CPP 的重要性，已经开发了各种各样的预测器来实现这些目标。为了对这些预测器进行排名，最近的几项研究比较了它们在特定数据集上的性能，但它们的结论并不能确定所获得的排名是由于模型、描述符集还是用于测试预测器的数据集。我们对数据集的肽序列相似性对预测器性能的影响进行了系统研究。分析表明，用于训练的数据集对预测器性能的影响比所使用的模型或描述符更强。我们表明，阳性和阴性示例之间序列相似性低的数据集可以很容易地分离，并且测试的分类器在这些数据集上表现良好。另一方面，CPP 和非 CPP 之间序列相似性高的数据集将是一个困难的数据集，应该是用于评估新预测器性能的数据集。

相似文献

Cell-penetrating peptides predictors: A comparative analysis of methods and datasets.穿膜肽预测器：方法和数据集的比较分析。

Mol Inform. 2023 Nov;42(11):e202300104. doi: 10.1002/minf.202300104. Epub 2023 Sep 6.

Prediction of cell penetrating peptides by support vector machines.基于支持向量机的细胞穿透肽预测。

PLoS Comput Biol. 2011 Jul;7(7):e1002101. doi: 10.1371/journal.pcbi.1002101. Epub 2011 Jul 14.

SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.SkipCPP-Pred：一种改进的、有前途的基于序列的细胞穿透肽预测器。

BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.

Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space.使用机器学习算法预测细胞穿透肽并探索其化学空间。

Sci Rep. 2021 Apr 7;11(1):7628. doi: 10.1038/s41598-021-87134-w.

Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.基于机器学习的细胞穿透肽预测及其摄取效率的改进准确性。

J Proteome Res. 2018 Aug 3;17(8):2715-2726. doi: 10.1021/acs.jproteome.8b00148. Epub 2018 Jul 2.

The Development of Machine Learning Methods in Cell-Penetrating Peptides Identification: A Brief Review.机器学习方法在细胞穿透肽鉴定中的发展：简要综述。

Curr Drug Metab. 2019;20(3):217-223. doi: 10.2174/1389200219666181010114750.

MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.MLCPP 2.0：更新的细胞穿透肽及其摄取效率预测器。

J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28.

CPPpred: prediction of cell penetrating peptides.CPPpred：细胞穿透肽的预测。

Bioinformatics. 2013 Dec 1;29(23):3094-6. doi: 10.1093/bioinformatics/btt518. Epub 2013 Sep 23.

DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.DeepCPPred：一种用于区分细胞穿透肽及其摄取效率的深度学习框架。

IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2749-2759. doi: 10.1109/TCBB.2021.3102133. Epub 2022 Oct 10.

KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides.KELM-CPPpred：基于核极限学习机的细胞穿透肽预测模型。

J Proteome Res. 2018 Sep 7;17(9):3214-3222. doi: 10.1021/acs.jproteome.8b00322. Epub 2018 Aug 13.

引用本文的文献

A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides.细胞穿透肽的生物学机制及机器学习预测方法概述。

Front Artif Intell. 2025 Jan 7;7:1497307. doi: 10.3389/frai.2024.1497307. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

穿膜肽预测器：方法和数据集的比较分析。

Cell-penetrating peptides predictors: A comparative analysis of methods and datasets.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献