• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多标签分类器识别多功能酶

Identification of Multi-Functional Enzyme with Multi-Label Classifier.

作者信息

Che Yuxin, Ju Ying, Xuan Ping, Long Ren, Xing Fei

机构信息

School of Information Science and Technology, Xiamen University, Xiamen, Fujian 361005, China.

School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.

出版信息

PLoS One. 2016 Apr 14;11(4):e0153503. doi: 10.1371/journal.pone.0153503. eCollection 2016.

DOI:10.1371/journal.pone.0153503
PMID:27078147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4831692/
Abstract

Enzymes are important and effective biological catalyst proteins participating in almost all active cell processes. Identification of multi-functional enzymes is essential in understanding the function of enzymes. Machine learning methods perform better in protein structure and function prediction than traditional biological wet experiments. Thus, in this study, we explore an efficient and effective machine learning method to categorize enzymes according to their function. Multi-functional enzymes are predicted with a special machine learning strategy, namely, multi-label classifier. Sequence features are extracted from a position-specific scoring matrix with autocross-covariance transformation. Experiment results show that the proposed method obtains an accuracy rate of 94.1% in classifying six main functional classes through five cross-validation tests and outperforms state-of-the-art methods. In addition, 91.25% accuracy is achieved in multi-functional enzyme prediction, which is often ignored in other enzyme function prediction studies. The online prediction server and datasets can be accessed from the link http://server.malab.cn/MEC/.

摘要

酶是参与几乎所有活跃细胞过程的重要且有效的生物催化蛋白。识别多功能酶对于理解酶的功能至关重要。机器学习方法在蛋白质结构和功能预测方面比传统的生物湿实验表现更好。因此,在本研究中,我们探索一种高效且有效的机器学习方法,根据酶的功能对其进行分类。多功能酶采用一种特殊的机器学习策略进行预测,即多标签分类器。序列特征通过自协方差变换从位置特异性评分矩阵中提取。实验结果表明,所提出的方法通过五次交叉验证测试在对六个主要功能类别进行分类时获得了94.1%的准确率,并且优于现有方法。此外,在多功能酶预测中实现了91.25%的准确率,这在其他酶功能预测研究中常常被忽视。可通过链接http://server.malab.cn/MEC/访问在线预测服务器和数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/bb9d27a253f6/pone.0153503.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/b0e732111513/pone.0153503.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/bd8cf058a790/pone.0153503.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/bb9d27a253f6/pone.0153503.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/b0e732111513/pone.0153503.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/bd8cf058a790/pone.0153503.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b4f/4831692/bb9d27a253f6/pone.0153503.g003.jpg

相似文献

1
Identification of Multi-Functional Enzyme with Multi-Label Classifier.使用多标签分类器识别多功能酶
PLoS One. 2016 Apr 14;11(4):e0153503. doi: 10.1371/journal.pone.0153503. eCollection 2016.
2
HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source.HPSLPred:一种用于人类蛋白质亚细胞定位预测的集成多标签分类器,源数据不均衡。
Proteomics. 2017 Sep;17(17-18). doi: 10.1002/pmic.201700262.
3
Classifying Multifunctional Enzymes by Incorporating Three Different Models into Chou's General Pseudo Amino Acid Composition.通过将三种不同模型纳入周氏广义伪氨基酸组成对多功能酶进行分类
J Membr Biol. 2016 Aug;249(4):551-7. doi: 10.1007/s00232-016-9904-3. Epub 2016 Apr 25.
4
Highly accurate prediction of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC.通过将平均块和位置特异性得分矩阵(PSSM)信息纳入通用伪氨基酸组成(PseAAC)来实现对蛋白质自相互作用的高度准确预测。
J Theor Biol. 2017 Nov 7;432:80-86. doi: 10.1016/j.jtbi.2017.08.009. Epub 2017 Aug 9.
5
ELM-MHC: An Improved MHC Identification Method with Extreme Learning Machine Algorithm.ELM-MHC:一种基于极端学习机算法的 MHC 鉴定方法的改进。
J Proteome Res. 2019 Mar 1;18(3):1392-1401. doi: 10.1021/acs.jproteome.9b00012. Epub 2019 Feb 18.
6
DEEPre: sequence-based enzyme EC number prediction by deep learning.DEEPre:基于深度学习的酶 EC 号序列预测。
Bioinformatics. 2018 Mar 1;34(5):760-769. doi: 10.1093/bioinformatics/btx680.
7
A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation.基于自互协方差变换的新分类学蛋白质折叠识别方法。
Bioinformatics. 2009 Oct 15;25(20):2655-62. doi: 10.1093/bioinformatics/btp500. Epub 2009 Aug 25.
8
AOPs-SVM: A Sequence-Based Classifier of Antioxidant Proteins Using a Support Vector Machine.AOPs-SVM:一种基于序列的使用支持向量机的抗氧化蛋白分类器。
Front Bioeng Biotechnol. 2019 Sep 18;7:224. doi: 10.3389/fbioe.2019.00224. eCollection 2019.
9
A label distance maximum-based classifier for multi-label learning.一种基于标签距离最大化的多标签学习分类器。
Biomed Mater Eng. 2015;26 Suppl 1:S1969-76. doi: 10.3233/BME-151500.
10
Predicting human splicing branchpoints by combining sequence-derived features and multi-label learning methods.通过结合序列衍生特征和多标签学习方法预测人类剪接分支点
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):464. doi: 10.1186/s12859-017-1875-6.

引用本文的文献

1
PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes.PredictEFC:一种用于预测酶家族类别的快速高效的多标签分类器。
BMC Bioinformatics. 2024 Jan 30;25(1):50. doi: 10.1186/s12859-024-05665-1.
2
Antibody Class(es) Predictor for Epitopes (AbCPE): A Multi-Label Classification Algorithm.表位抗体类别预测器(AbCPE):一种多标签分类算法。
Front Bioinform. 2021 Sep 7;1:709951. doi: 10.3389/fbinf.2021.709951. eCollection 2021.
3
Machine learning for enzyme engineering, selection and design.机器学习在酶工程、选择和设计中的应用。

本文引用的文献

1
PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation.PseDNA-Pro:结合周氏伪氨基酸组成和物理化学距离变换的DNA结合蛋白鉴定方法
Mol Inform. 2015 Jan;34(1):8-17. doi: 10.1002/minf.201400025. Epub 2014 Sep 26.
2
Prediction and Validation of Disease Genes Using HeteSim Scores.使用异构相似性得分预测和验证疾病基因
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):687-695. doi: 10.1109/TCBB.2016.2520947. Epub 2016 Feb 12.
3
Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique.
Protein Eng Des Sel. 2021 Feb 15;34. doi: 10.1093/protein/gzab019.
4
Machine learning differentiates enzymatic and non-enzymatic metals in proteins.机器学习区分蛋白质中的酶促金属和非酶促金属。
Nat Commun. 2021 Jun 17;12(1):3712. doi: 10.1038/s41467-021-24070-3.
5
A hierarchical deep learning based approach for multi-functional enzyme classification.基于深度学习的多层次方法用于多功能酶分类。
Protein Sci. 2021 Sep;30(9):1935-1945. doi: 10.1002/pro.4146. Epub 2021 Jun 28.
6
Alignment-Free Method to Predict Enzyme Classes and Subclasses.无比对方法预测酶类和亚类。
Int J Mol Sci. 2019 Oct 29;20(21):5389. doi: 10.3390/ijms20215389.
7
mlDEEPre: Multi-Functional Enzyme Function Prediction With Hierarchical Multi-Label Deep Learning.mlDEEPre:基于分层多标签深度学习的多功能酶功能预测
Front Genet. 2019 Jan 22;9:714. doi: 10.3389/fgene.2018.00714. eCollection 2018.
8
iNuc-ext-PseTNC: an efficient ensemble model for identification of nucleosome positioning by extending the concept of Chou's PseAAC to pseudo-tri-nucleotide composition.iNuc-ext-PseTNC:一种通过将 Chou 的 PseAAC 概念扩展到伪三核苷酸组成来有效识别核小体定位的集成模型。
Mol Genet Genomics. 2019 Feb;294(1):199-210. doi: 10.1007/s00438-018-1498-2. Epub 2018 Oct 5.
9
ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature.ECPred:一种基于 EC 命名法预测蛋白质序列酶功能的工具。
BMC Bioinformatics. 2018 Sep 21;19(1):334. doi: 10.1186/s12859-018-2368-y.
10
Fuzzy Decision Making Approach to Identify Optimum Enzyme Targets and Drug Dosage for Remedying Presynaptic Dopamine Deficiency.用于确定纠正突触前多巴胺缺乏的最佳酶靶点和药物剂量的模糊决策方法
PLoS One. 2016 Oct 13;11(10):e0164589. doi: 10.1371/journal.pone.0164589. eCollection 2016.
使用具有特征选择技术的周氏伪氨基酸组成鉴定免疫球蛋白。
Mol Biosyst. 2016 Apr;12(4):1269-75. doi: 10.1039/c5mb00883b. Epub 2016 Feb 17.
4
iMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions.iMiRNA-SSF:通过结合不同分布的负集改进微小RNA前体的识别
Sci Rep. 2016 Jan 12;6:19062. doi: 10.1038/srep19062.
5
On the Universality and Non-Universality of Spiking Neural P Systems With Rules on Synapses.关于具有突触规则的脉冲神经P系统的通用性和非通用性
IEEE Trans Nanobioscience. 2015 Dec;14(8):960-6. doi: 10.1109/TNB.2015.2503603. Epub 2015 Nov 25.
6
DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.通过结合伪氨基酸组成和基于轮廓的蛋白质表示来鉴定DNA结合蛋白
Sci Rep. 2015 Oct 20;5:15479. doi: 10.1038/srep15479.
7
iEnhancer-2L: a two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition.iEnhancer-2L:一种通过伪 k-元核苷酸组成识别增强子及其强度的两层预测器。
Bioinformatics. 2016 Feb 1;32(3):362-9. doi: 10.1093/bioinformatics/btv604. Epub 2015 Oct 17.
8
Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set.通过纳入高质量阴性集改进并有望实现人类微小RNA的鉴定
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jan-Feb;11(1):192-201. doi: 10.1109/TCBB.2013.146.
9
Identifying Antioxidant Proteins by Using Optimal Dipeptide Compositions.利用最优二肽组成鉴定抗氧化蛋白。
Interdiscip Sci. 2016 Jun;8(2):186-191. doi: 10.1007/s12539-015-0124-9. Epub 2015 Sep 7.
10
Enhanced Protein Fold Prediction Method Through a Novel Feature Extraction Technique.通过一种新型特征提取技术增强蛋白质折叠预测方法
IEEE Trans Nanobioscience. 2015 Sep;14(6):649-59. doi: 10.1109/TNB.2015.2450233.