EnACP：一种用于鉴定抗癌肽的集成学习模型。

EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides.

作者信息

Ge Ruiquan, Feng Guanwen, Jing Xiaoyang, Zhang Renfeng, Wang Pu, Wu Qing

机构信息

Key Laboratory of Complex Systems Modeling and Simulation, School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China.

Xi'an Key Laboratory of Big Data and Intelligent Vision, School of Computer Science and Technology, Xidian University, Xi'an, China.

出版信息

Front Genet. 2020 Jul 30;11:760. doi: 10.3389/fgene.2020.00760. eCollection 2020.

DOI:10.3389/fgene.2020.00760

PMID:32903636

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7438906/

Abstract

As cancer remains one of the main threats of human life, developing efficient cancer treatments is urgent. Anticancer peptides, which could overcome the significant side effects and poor results of traditional cancer treatments, have become a new potential alternative these years. However, identifying anticancer peptides by experimental methods is time consuming and resource consuming, it is of great significance to develop effective computational tools to quickly and accurately identify potential anticancer peptides from amino acid sequences. For most current computational methods, feature representation plays a key role in their final successes. This study proposes a novel fast and accurate approach to identify anticancer peptides using diversified feature representations and ensemble learning method. For the feature representations, the information is encoded from multidimensional feature spaces, including sequence composition, sequence-order, physicochemical properties, etc. In order to better model the potential relationships of peptides, multiple ensemble classifiers, LightGBMs, are applied to detect the different feature sets at first. Then the obtained multiple outputs are used as inputs of the support vector machine classifier, which effectively identifies anticancer peptides. Experimental results on cross validation and independent test sets demonstrate that our method can achieve better or comparable performances compared with other state-of-the-art methods.

摘要

由于癌症仍然是人类生命的主要威胁之一，开发有效的癌症治疗方法迫在眉睫。近年来，能够克服传统癌症治疗显著副作用和不佳效果的抗癌肽已成为一种新的潜在替代方案。然而，通过实验方法鉴定抗癌肽既耗时又耗资源，因此开发有效的计算工具以从氨基酸序列中快速准确地鉴定潜在抗癌肽具有重要意义。对于当前大多数计算方法而言，特征表示在其最终成功中起着关键作用。本研究提出了一种新颖的快速准确方法，利用多样化的特征表示和集成学习方法来鉴定抗癌肽。对于特征表示，信息是从多维特征空间进行编码的，包括序列组成、序列顺序、理化性质等。为了更好地模拟肽的潜在关系，首先应用多个集成分类器LightGBM来检测不同的特征集。然后将获得的多个输出用作支持向量机分类器的输入，从而有效地鉴定抗癌肽。交叉验证和独立测试集上的实验结果表明，与其他现有最先进方法相比，我们的方法能够实现更好或相当的性能。

相似文献

EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides.EnACP：一种用于鉴定抗癌肽的集成学习模型。

Front Genet. 2020 Jul 30;11:760. doi: 10.3389/fgene.2020.00760. eCollection 2020.

iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space.iACP - GAEnsC：基于进化遗传算法的利用混合特征空间对抗癌肽进行集成分类

Artif Intell Med. 2017 Jun;79:62-70. doi: 10.1016/j.artmed.2017.06.008. Epub 2017 Jun 17.

ME-ACP: Multi-view neural networks with ensemble model for identification of anticancer peptides.ME-ACP：用于识别抗癌肽的具有集成模型的多视图神经网络。

Comput Biol Med. 2022 Jun;145:105459. doi: 10.1016/j.compbiomed.2022.105459. Epub 2022 Mar 26.

IDM-PhyChm-Ens: intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids.IDM-PhyChm-Ens：基于氨基酸物理化学性质的人类乳腺癌分类智能决策集成方法

Amino Acids. 2014 Apr;46(4):977-93. doi: 10.1007/s00726-013-1659-x. Epub 2014 Jan 4.

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning.UMPred-FRL：一种使用特征表示学习准确预测鲜味肽的新方法。

Int J Mol Sci. 2021 Dec 4;22(23):13124. doi: 10.3390/ijms222313124.

AntiMF: A deep learning framework for predicting anticancer peptides based on multi-view feature extraction.AntiMF：基于多视图特征提取的抗癌肽预测深度学习框架。

Methods. 2022 Nov;207:38-43. doi: 10.1016/j.ymeth.2022.07.017. Epub 2022 Sep 11.

Stack-AAgP: Computational prediction and interpretation of anti-angiogenic peptides using a meta-learning framework.Stack-AAgP：使用元学习框架进行抗血管生成肽的计算预测和解释。

Comput Biol Med. 2024 May;174:108438. doi: 10.1016/j.compbiomed.2024.108438. Epub 2024 Apr 9.

AIEpred: An Ensemble Predictive Model of Classifier Chain to Identify Anti-Inflammatory Peptides.AIEpred：一种基于分类器链的集成预测模型，用于识别抗炎肽。

IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1831-1840. doi: 10.1109/TCBB.2020.2968419. Epub 2021 Oct 7.

ACP-DL: A Deep Learning Long Short-Term Memory Model to Predict Anticancer Peptides Using High-Efficiency Feature Representation.ACP-DL：一种使用高效特征表示来预测抗癌肽的深度学习长短期记忆模型。

Mol Ther Nucleic Acids. 2019 Sep 6;17:1-9. doi: 10.1016/j.omtn.2019.04.025. Epub 2019 May 10.

mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides.mACPpred：一种基于支持向量机的抗癌肽元预测器。

Int J Mol Sci. 2019 Apr 22;20(8):1964. doi: 10.3390/ijms20081964.

引用本文的文献

Extended dipeptide composition framework for accurate identification of anticancer peptides.用于准确鉴定抗癌肽的扩展二肽组成框架。

Sci Rep. 2024 Jul 29;14(1):17381. doi: 10.1038/s41598-024-68475-8.

Integrating In Silico and In Vitro Approaches to Identify Natural Peptides with Selective Cytotoxicity against Cancer Cells.整合计算机模拟和体外方法，以鉴定对癌细胞具有选择性细胞毒性的天然肽。

Int J Mol Sci. 2024 Jun 21;25(13):6848. doi: 10.3390/ijms25136848.

AMP-EBiLSTM: employing novel deep learning strategies for the accurate prediction of antimicrobial peptides.AMP-EBiLSTM：采用新型深度学习策略准确预测抗菌肽

Front Genet. 2023 Jul 24;14:1232117. doi: 10.3389/fgene.2023.1232117. eCollection 2023.

ACP-ADA: A Boosting Method with Data Augmentation for Improved Prediction of Anticancer Peptides.ACP-ADA：一种基于数据增强的提升方法，用于改善抗癌肽的预测。

Int J Mol Sci. 2022 Oct 13;23(20):12194. doi: 10.3390/ijms232012194.

ACPNet: A Deep Learning Network to Identify Anticancer Peptides by Hybrid Sequence Information.ACPNet：一种通过混合序列信息识别抗癌肽的深度学习网络。

Molecules. 2022 Feb 24;27(5):1544. doi: 10.3390/molecules27051544.

Multi-feature Fusion Method Based on Linear Neighborhood Propagation Predict Plant LncRNA-Protein Interactions.基于线性邻域传播的多特征融合方法预测植物 lncRNA-蛋白质相互作用。

Interdiscip Sci. 2022 Jun;14(2):545-554. doi: 10.1007/s12539-022-00501-7. Epub 2022 Jan 17.

Active Semisupervised Model for Improving the Identification of Anticancer Peptides.用于改进抗癌肽识别的主动半监督模型

ACS Omega. 2021 Sep 8;6(37):23998-24008. doi: 10.1021/acsomega.1c03132. eCollection 2021 Sep 21.

ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation.ACP-DA：利用数据增强改进抗癌肽的预测

Front Genet. 2021 Jun 30;12:698477. doi: 10.3389/fgene.2021.698477. eCollection 2021.

Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties.基于序贯特征和理化性质鉴定抗癌肽的亚型。

Sci Rep. 2021 Jun 30;11(1):13594. doi: 10.1038/s41598-021-93124-9.

Peptides with Dual Antimicrobial-Anticancer Activity: Strategies to Overcome Peptide Limitations and Rational Design of Anticancer Peptides.具有双重抗菌-抗癌活性的肽：克服肽局限性的策略和抗癌肽的合理设计。

Molecules. 2020 Sep 16;25(18):4245. doi: 10.3390/molecules25184245.

本文引用的文献

Mechanisms generating cancer genome complexity from a single cell division error.从单个细胞分裂错误中产生癌症基因组复杂性的机制。

Science. 2020 Apr 17;368(6488). doi: 10.1126/science.aba0712.

Design, Synthesis and Biological Evaluation of Ciprofloxacin- Peptide Conjugates as Anticancer Agents.环丙沙星-肽缀合物作为抗癌剂的设计、合成及生物学评价

Iran J Pharm Res. 2019 Fall;18(4):1823-1830. doi: 10.22037/ijpr.2019.111721.13319.

A New NT4 Peptide-Based Drug Delivery System for Cancer Treatment.一种用于癌症治疗的基于NT4肽的新型药物递送系统。

Molecules. 2020 Feb 28;25(5):1088. doi: 10.3390/molecules25051088.

Pleiotropic Role and Bidirectional Immunomodulation of Innate Lymphoid Cells in Cancer.固有淋巴细胞在癌症中的多效性作用和双向免疫调节。

Front Immunol. 2020 Feb 4;10:3111. doi: 10.3389/fimmu.2019.03111. eCollection 2019.

Uncovering Tumour Heterogeneity through PKR and nc886 Analysis in Metastatic Colon Cancer Patients Treated with 5-FU-Based Chemotherapy.通过对接受基于5-氟尿嘧啶化疗的转移性结肠癌患者进行PKR和nc886分析揭示肿瘤异质性

Cancers (Basel). 2020 Feb 7;12(2):379. doi: 10.3390/cancers12020379.

Bisindolemethane derivatives as highly potent anticancer agents: Synthesis, medicinal activity evaluation, cell-based compound discovery, and computational target predictions.双吲哚甲烷衍生物作为高效抗癌剂：合成、药用活性评估、基于细胞的化合物发现及计算靶点预测

Comput Biol Med. 2020 Jan;116:103574. doi: 10.1016/j.compbiomed.2019.103574. Epub 2019 Dec 7.

Monogenic causes of non-obstructive azoospermia: challenges, established knowledge, limitations and perspectives.单基因导致的非梗阻性无精子症：挑战、已有知识、局限性和展望。

Hum Genet. 2021 Jan;140(1):135-154. doi: 10.1007/s00439-020-02112-y. Epub 2020 Jan 18.

A 21‑gene Support Vector Machine classifier and a 10‑gene risk score system constructed for patients with gastric cancer.用于胃癌患者的 21 基因支持向量机分类器和 10 基因风险评分系统。

Mol Med Rep. 2020 Jan;21(1):347-359. doi: 10.3892/mmr.2019.10841. Epub 2019 Nov 21.

Development of cancer metabolism as a therapeutic target: new pathways, patient studies, stratification and combination therapy.癌症代谢作为治疗靶点的发展：新途径、患者研究、分层和联合治疗。

Br J Cancer. 2020 Jan;122(1):1-3. doi: 10.1038/s41416-019-0666-4. Epub 2019 Dec 10.

A gastric cancer LncRNAs model for MSI and survival prediction based on support vector machine.基于支持向量机的 MSI 和生存预测的胃癌 LncRNAs 模型。

BMC Genomics. 2019 Nov 13;20(1):846. doi: 10.1186/s12864-019-6135-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

EnACP：一种用于鉴定抗癌肽的集成学习模型。

EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献