使用多种伪氨基酸组成类型和不同的机器学习算法对古菌磷脂酶进行分类和预测。

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases.

作者信息

Samman Nour, Mohabatkar Hassan, Rabiei Parisa

机构信息

Department of Biotechnology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.

出版信息

Mol Biol Res Commun. 2023;12(3):117-126. doi: 10.22099/mbrc.2023.47756.1845.

DOI:10.22099/mbrc.2023.47756.1845

PMID:37525666

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10387176/

Abstract

Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea's proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to study of archaeal phospholipases' properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou's pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou's pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews's correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm.

摘要

磷脂酶作为重要的脂解酶，具有多种工业应用。关于嗜极端古菌蛋白质在恶劣条件下的稳定性，分析其蛋白质的异常特征对其利用具有重要意义。本研究旨在研究古菌磷脂酶的性质，并通过机器学习算法和周氏伪氨基酸组成概念开发一种将这些酶与其他古菌酶区分开来的开创性方法。收集了古菌磷脂酶的非冗余序列。使用BioSeq-Analysis服务器以及支持向量机（SVM）、随机森林（RF）、协方差判别（CD）和优化证据理论K近邻（OET-KNN）等强大的机器学习算法。此外，还采用了不同的周氏伪氨基酸组成模式，然后对序列进行5折交叉验证。根据我们的结果，在5折交叉验证中，OET-KNN预测器在SC-PseAAC模式下以96%的准确率表现最佳。该预测器还获得了非常高的特异性值（95%）、灵敏度（96%）、马修斯相关系数（0.92）和准确率（96%）。本研究利用PseAAC原则和OET-KNN机器学习算法建立了一个强大的古菌磷脂酶预测模型。

相似文献

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases.使用多种伪氨基酸组成类型和不同的机器学习算法对古菌磷脂酶进行分类和预测。

Mol Biol Res Commun. 2023;12(3):117-126. doi: 10.22099/mbrc.2023.47756.1845.

Prediction of metalloproteinase family based on the concept of Chou's pseudo amino acid composition using a machine learning approach.基于机器学习方法，利用周的伪氨基酸组成概念对金属蛋白酶家族进行预测。

J Struct Funct Genomics. 2011 Dec;12(4):191-7. doi: 10.1007/s10969-011-9120-4. Epub 2011 Dec 3.

Discrimination of acidic and alkaline enzyme using Chou's pseudo amino acid composition in conjunction with probabilistic neural network model.基于周氏伪氨基酸组成结合概率神经网络模型对酸性和碱性酶的鉴别

J Theor Biol. 2015 Jan 21;365:197-203. doi: 10.1016/j.jtbi.2014.10.014. Epub 2014 Oct 22.

Introducing of an integrated artificial neural network and Chou's pseudo amino acid composition approach for computational epitope-mapping of Crimean-Congo haemorrhagic fever virus antigens.介绍一种集成的人工神经网络和 Chou 的伪氨基酸组成方法，用于计算克里米亚-刚果出血热病毒抗原的计算表位图谱。

Int Immunopharmacol. 2020 Jan;78:106020. doi: 10.1016/j.intimp.2019.106020. Epub 2019 Nov 24.

Prediction of GABAA receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine.基于 Chou 的伪氨基酸组成和支持向量机预测 GABAA 受体蛋白。

J Theor Biol. 2011 Jul 21;281(1):18-23. doi: 10.1016/j.jtbi.2011.04.017. Epub 2011 Apr 28.

Computational prediction of antifungal peptides via Chou's PseAAC and SVM.基于周式伪氨基酸组成和支持向量机的抗真菌肽计算预测

J Bioinform Comput Biol. 2018 Aug;16(4):1850016. doi: 10.1142/S0219720018500166. Epub 2018 May 29.

DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.DPP-PseAAC：一种基于 Chou 的通用 PseAAC 的 DNA 结合蛋白预测模型。

J Theor Biol. 2018 Sep 7;452:22-34. doi: 10.1016/j.jtbi.2018.05.006. Epub 2018 May 16.

Using Chou's General Pseudo Amino Acid Composition to Classify Laccases from Bacterial and Fungal Sources via Chou's Five-Step Rule.利用周的通用伪氨基酸组成，通过周的五步法则对细菌和真菌来源的漆酶进行分类。

Appl Biochem Biotechnol. 2020 Mar;190(3):1035-1048. doi: 10.1007/s12010-019-03141-8. Epub 2019 Oct 28.

Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC.通过将二肽组成纳入周的通用 PseAAC，鉴定热休克蛋白家族和 J 蛋白类型。

Comput Methods Programs Biomed. 2015 Nov;122(2):165-74. doi: 10.1016/j.cmpb.2015.07.005. Epub 2015 Jul 22.

Discriminating outer membrane proteins with Fuzzy K-nearest Neighbor algorithms based on the general form of Chou's PseAAC.基于周式伪氨基酸组成通用形式，运用模糊K近邻算法鉴别外膜蛋白。

Protein Pept Lett. 2012 Apr;19(4):411-21. doi: 10.2174/092986612799789387.

引用本文的文献

In-silico comparison of fungal and bacterial asparaginase enzymes.真菌和细菌天冬酰胺酶的计算机模拟比较

Mol Biol Res Commun. 2024;13(4):183-191. doi: 10.22099/mbrc.2024.50123.1981.

本文引用的文献

Archaeal lipolytic enzymes: Current developments and further prospects.古菌脂肪酶：最新进展与未来展望。

Biotechnol Adv. 2022 Dec;61:108054. doi: 10.1016/j.biotechadv.2022.108054. Epub 2022 Oct 26.

ABLE: Attention based learning for enzyme classification.ABLE：基于注意力的酶分类学习。

Comput Biol Chem. 2021 Oct;94:107558. doi: 10.1016/j.compbiolchem.2021.107558. Epub 2021 Aug 19.

A generalized machine-learning aided method for targeted identification of industrial enzymes from metagenome: A xylanase temperature dependence case study.一种广义的机器学习辅助方法，用于从宏基因组中靶向鉴定工业酶：以木聚糖酶的温度依赖性为例。

Biotechnol Bioeng. 2021 Feb;118(2):759-769. doi: 10.1002/bit.27608. Epub 2020 Nov 14.

Computational prediction of antifungal peptides via Chou's PseAAC and SVM.基于周式伪氨基酸组成和支持向量机的抗真菌肽计算预测

J Bioinform Comput Biol. 2018 Aug;16(4):1850016. doi: 10.1142/S0219720018500166. Epub 2018 May 29.

BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches.生物序列分析：一个基于机器学习方法的 DNA、RNA 和蛋白质序列分析平台。

Brief Bioinform. 2019 Jul 19;20(4):1280-1294. doi: 10.1093/bib/bbx165.

Archaea Are Interactive Components of Complex Microbiomes.古菌是复杂微生物组的交互组成部分。

Trends Microbiol. 2018 Jan;26(1):70-85. doi: 10.1016/j.tim.2017.07.004. Epub 2017 Aug 18.

New Achievements in Bioinformatics Prediction of Post Translational Modification of Proteins.蛋白质翻译后修饰的生物信息学预测新进展

Curr Top Med Chem. 2017;17(21):2381-2392. doi: 10.2174/1568026617666170328100908.

Recombinant Lipases and Phospholipases and Their Use as Biocatalysts for Industrial Applications.重组脂肪酶和磷脂酶及其作为工业应用生物催化剂的用途。

Int J Mol Sci. 2015 Sep 1;16(9):20774-840. doi: 10.3390/ijms160920774.

Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.Pse-in-One：一个用于生成DNA、RNA和蛋白质序列各种伪组件模式的网络服务器。

Nucleic Acids Res. 2015 Jul 1;43(W1):W65-71. doi: 10.1093/nar/gkv458. Epub 2015 May 9.

The cell cycle of archaea.古菌的细胞周期。

Nat Rev Microbiol. 2013 Sep;11(9):627-38. doi: 10.1038/nrmicro3077. Epub 2013 Jul 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用多种伪氨基酸组成类型和不同的机器学习算法对古菌磷脂酶进行分类和预测。

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献