• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。

EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.

作者信息

Zhu Lun, Chen Zehua, Yang Sen

机构信息

School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou University, Changzhou, 213164, China.

The Affiliated Changzhou No. 2 People's Hospital of Nanjing Medical University, Changzhou, 213164, China.

出版信息

Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.

DOI:10.1007/s12539-024-00673-4
PMID:39714579
Abstract

Cell-Penetrating Peptides (CPPs) are a crucial carrier for drug delivery. Since the process of synthesizing new CPPs in the laboratory is both time- and resource-consuming, computational methods to predict potential CPPs can be used to find CPPs to enhance the development of CPPs in therapy. In this study, EnDM-CPP is proposed, which combines machine learning algorithms (SVM and CatBoost) with convolutional neural networks (CNN and TextCNN). For dataset construction, three previous CPP benchmark datasets, including CPPsite 2.0, MLCPP 2.0, and CPP924, are merged to improve the diversity and reduce homology. For feature generation, two language model-based features obtained from the Transformer architecture, including ProtT5 and ESM-2, are employed in CNN and TextCNN. Additionally, sequence features, such as CPRS, Hybrid PseAAC, KSC, etc., are input to SVM and CatBoost. Based on the result of each predictor, Logistic Regression (LR) is built to predict the final decision. The experiment results indicate that ProtT5 and ESM-2 fusion features significantly contribute to predicting CPP and that combining employed features and models demonstrates better association. On an independent test dataset comparison, EnDM-CPP achieved an accuracy of 0.9495 and a Matthews correlation coefficient of 0.9008 with an improvement of 2.23%-9.48% and 4.32%-19.02%, respectively, compared with other state-of-the-art methods. Code and data are available at https://github.com/tudou1231/EnDM-CPP.git .

摘要

细胞穿透肽(CPPs)是药物递送的关键载体。由于在实验室中合成新的CPPs的过程既耗时又耗资源,因此可使用预测潜在CPPs的计算方法来寻找CPPs,以促进CPPs在治疗中的发展。在本研究中,提出了EnDM-CPP,它将机器学习算法(支持向量机和CatBoost)与卷积神经网络(卷积神经网络和文本卷积神经网络)相结合。对于数据集构建,合并了三个先前的CPP基准数据集,包括CPPsite 2.0、MLCPP 2.0和CPP924,以提高多样性并减少同源性。对于特征生成,在卷积神经网络和文本卷积神经网络中采用了从Transformer架构获得的两种基于语言模型的特征,包括ProtT5和ESM-2。此外,将序列特征,如CPRS、混合伪氨基酸组成、KSC等,输入到支持向量机和CatBoost中。基于每个预测器的结果,构建逻辑回归(LR)来预测最终决策。实验结果表明,ProtT5和ESM-2融合特征对预测CPP有显著贡献,并且结合使用的特征和模型表现出更好的关联性。在独立测试数据集比较中,与其他现有方法相比,EnDM-CPP的准确率达到0.9495,马修斯相关系数达到0.9008,分别提高了2.23%-9.48%和4.32%-19.02%。代码和数据可在https://github.com/tudou1231/EnDM-CPP.git获取。

相似文献

1
EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information.EnDM-CPP:一种基于深度学习和机器学习的多视图可解释框架,用于使用Transformer识别细胞穿透肽并分析序列信息。
Interdiscip Sci. 2024 Dec 23. doi: 10.1007/s12539-024-00673-4.
2
CPPpred-En: Ensemble framework integrating a protein language model and conventional features for highly accurate cell-penetrating peptide prediction.CPPpred-En:集成蛋白质语言模型和传统特征以实现高精度细胞穿透肽预测的集成框架。
Comput Biol Med. 2025 Sep;195:110590. doi: 10.1016/j.compbiomed.2025.110590. Epub 2025 Jun 20.
3
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
4
Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.皮肤 CAD:基于双高级 CNN 特征选择和迁移学习的皮肤镜图像皮肤癌可解释深度学习分类。
Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.
5
Short-Term Memory Impairment短期记忆障碍
6
MoRF_ESM: Prediction of MoRFs in disordered proteins based on a deep transformer protein language model.MoRF_ESM:基于深度变压器蛋白质语言模型预测无序蛋白质中的分子识别特征片段
J Bioinform Comput Biol. 2024 Apr;22(2):2450006. doi: 10.1142/S0219720024500069. Epub 2024 May 28.
7
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
8
ToxinPred 3.0: An improved method for predicting the toxicity of peptides.ToxinPred 3.0:一种改进的多肽毒性预测方法。
Comput Biol Med. 2024 Sep;179:108926. doi: 10.1016/j.compbiomed.2024.108926. Epub 2024 Jul 21.
9
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
10
A novel double machine learning approach for detecting early breast cancer using advanced feature selection and dimensionality reduction techniques.一种使用先进特征选择和降维技术检测早期乳腺癌的新型双机器学习方法。
Sci Rep. 2025 Jul 2;15(1):22971. doi: 10.1038/s41598-025-06426-7.

引用本文的文献

1
AMCL: supervised contrastive learning with hard sample mining for multi-functional therapeutic peptide prediction.AMCL:用于多功能治疗性肽预测的带难样本挖掘的监督对比学习
BMC Biol. 2025 Jul 1;23(1):170. doi: 10.1186/s12915-025-02273-0.
2
MFP-MFL: Leveraging Graph Attention and Multi-Feature Integration for Superior Multifunctional Bioactive Peptide Prediction.MFP-MFL:利用图注意力和多特征整合实现卓越的多功能生物活性肽预测
Int J Mol Sci. 2025 Feb 4;26(3):1317. doi: 10.3390/ijms26031317.

本文引用的文献

1
In Silico Screening and Optimization of Cell-Penetrating Peptides Using Deep Learning Methods.基于深度学习方法的细胞穿透肽的计算机筛选和优化。
Biomolecules. 2023 Mar 13;13(3):522. doi: 10.3390/biom13030522.
2
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
3
Cell penetrating peptide: A potent delivery system in vaccine development.细胞穿透肽:疫苗研发中的一种有效递送系统。
Front Pharmacol. 2022 Nov 8;13:1072685. doi: 10.3389/fphar.2022.1072685. eCollection 2022.
4
MLCPP 2.0: An Updated Cell-penetrating Peptides and Their Uptake Efficiency Predictor.MLCPP 2.0:更新的细胞穿透肽及其摄取效率预测器。
J Mol Biol. 2022 Jun 15;434(11):167604. doi: 10.1016/j.jmb.2022.167604. Epub 2022 Apr 28.
5
ProteinBERT: a universal deep-learning model of protein sequence and function.蛋白质 BERT:一种通用的蛋白质序列和功能深度学习模型。
Bioinformatics. 2022 Apr 12;38(8):2102-2110. doi: 10.1093/bioinformatics/btac020.
6
Class imbalance in out-of-distribution datasets: Improving the robustness of the TextCNN for the classification of rare cancer types.分布外数据集的不平衡问题:提高 TextCNN 对罕见癌症类型分类的鲁棒性。
J Biomed Inform. 2022 Jan;125:103957. doi: 10.1016/j.jbi.2021.103957. Epub 2021 Nov 22.
7
Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space.使用机器学习算法预测细胞穿透肽并探索其化学空间。
Sci Rep. 2021 Apr 7;11(1):7628. doi: 10.1038/s41598-021-87134-w.
8
Cell-penetrating peptides (CPPs): an overview of applications for improving the potential of nanotherapeutics.细胞穿透肽(CPPs):提高纳米治疗药物潜力的应用概述
Biomater Sci. 2021 Feb 23;9(4):1153-1188. doi: 10.1039/d0bm01755h.
9
Cppsite 2.0: An Available Database of Experimentally Validated Cell-Penetrating Peptides Predicting their Secondary and Tertiary Structures.Cppsite 2.0:一个提供实验验证的细胞穿透肽数据库,预测其二级和三级结构。
J Mol Biol. 2021 May 28;433(11):166703. doi: 10.1016/j.jmb.2020.11.002. Epub 2020 Nov 10.
10
TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree.目标 CPP:使用梯度提升决策树从优化的多尺度特征中准确预测细胞穿透肽。
J Comput Aided Mol Des. 2020 Aug;34(8):841-856. doi: 10.1007/s10822-020-00307-z. Epub 2020 Mar 16.