Suppr超能文献

使用多种伪氨基酸组成类型和不同的机器学习算法对古菌磷脂酶进行分类和预测。

Using several pseudo amino acid composition types and different machine learning algorithms to classify and predict archaeal phospholipases.

作者信息

Samman Nour, Mohabatkar Hassan, Rabiei Parisa

机构信息

Department of Biotechnology, Faculty of Biological Science and Technology, University of Isfahan, Isfahan, Iran.

出版信息

Mol Biol Res Commun. 2023;12(3):117-126. doi: 10.22099/mbrc.2023.47756.1845.

Abstract

Phospholipases, as important lipolytic enzymes, have diverse industrial applications. Regarding the stability of extremophilic archaea's proteins in harsh conditions, analyses of unusual features of their proteins are significantly important for their utilization. This research was accomplished to study of archaeal phospholipases' properties and to develop a pioneering method for distinguishing these enzymes from other archaeal enzymes via machine learning algorithms and Chou's pseudo-amino acid composition concept. The non-redundant sequences of archaeal phospholipases were collected. BioSeq-Analysis sever was used with Support Vector Machine (SVM), Random Forests (RF), Covariance Discrimination (CD), and Optimized Evidence-Theoretic K-nearest Neighbor (OET-KNN) as powerful machine learnings algorithms. Also, different Chou's pseudo-amino acid composition modes were performed and then, 5-fold cross-validation was applied to the sequences. Based on our results, the OET-KNN predictor, with 96% accuracy, yields the best performance in SC-PseAAC mode by 5-fold cross-validation. This predictor also achieved very high values of specificity (95%), sensitivity (96%), Matthews's correlation coefficient (0.92), and accuracy (96%). The present investigation yielded a robust anticipatory model for the archaeal phospholipase prediction utilizing the tenets PseAAC and OET-KNN machine learning algorithm.

摘要

磷脂酶作为重要的脂解酶,具有多种工业应用。关于嗜极端古菌蛋白质在恶劣条件下的稳定性,分析其蛋白质的异常特征对其利用具有重要意义。本研究旨在研究古菌磷脂酶的性质,并通过机器学习算法和周氏伪氨基酸组成概念开发一种将这些酶与其他古菌酶区分开来的开创性方法。收集了古菌磷脂酶的非冗余序列。使用BioSeq-Analysis服务器以及支持向量机(SVM)、随机森林(RF)、协方差判别(CD)和优化证据理论K近邻(OET-KNN)等强大的机器学习算法。此外,还采用了不同的周氏伪氨基酸组成模式,然后对序列进行5折交叉验证。根据我们的结果,在5折交叉验证中,OET-KNN预测器在SC-PseAAC模式下以96%的准确率表现最佳。该预测器还获得了非常高的特异性值(95%)、灵敏度(96%)、马修斯相关系数(0.92)和准确率(96%)。本研究利用PseAAC原则和OET-KNN机器学习算法建立了一个强大的古菌磷脂酶预测模型。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验