• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 KNN 和最小距离的分类器在酶家族预测中的效率分析。

Efficiency analysis of KNN and minimum distance-based classifiers in enzyme family prediction.

机构信息

Dokuz Eylül University, Faculty of Arts and Sciences, Department of Statistics, 35160 Kaynaklar Campus, Izmir, Turkey.

出版信息

Comput Biol Chem. 2009 Dec;33(6):461-4. doi: 10.1016/j.compbiolchem.2009.09.002. Epub 2009 Sep 28.

DOI:10.1016/j.compbiolchem.2009.09.002
PMID:19853514
Abstract

Nearly all enzymes are proteins. They are the biological catalysts that accelerate the function of cellular reactions. Because of different characteristics of reaction tasks, they split into six classes: oxidoreductases (EC-1), transferases (EC-2), hydrolases (EC-3), lyases (EC-4), isomerases (EC-5), ligases (EC-6). Prediction of enzyme classes is of great importance in identifying which enzyme class is a member of a protein. Since the enzyme sequences increase day by day, contrary to experimental analysis in prediction of enzyme classes for a newly found enzyme sequence, providing from data mining techniques becomes very useful and time-saving. In this paper, two kinds of simple minimum distance-based classifier methods have been proposed. These methods and known K-nearest neighbor (KNN) classification algorithm have been performed in order to classify enzymes according to their amino acid composition. Performance measurements and elapsed time to execute algorithms have been compared. In addition, equality of two proposed approaches under special condition has been proved in order to be a guide for researchers.

摘要

几乎所有的酶都是蛋白质。它们是加速细胞反应功能的生物催化剂。由于反应任务的不同特点,它们分为六类:氧化还原酶(EC-1)、转移酶(EC-2)、水解酶(EC-3)、裂解酶(EC-4)、异构酶(EC-5)、连接酶(EC-6)。预测酶类对于识别蛋白质所属的酶类具有重要意义。由于酶序列的数量与日俱增,与针对新发现的酶序列进行实验分析相比,从数据挖掘技术中获取信息变得非常有用且节省时间。本文提出了两种基于简单最小距离的分类器方法。为了根据氨基酸组成对酶进行分类,将这些方法与已知的 K-最近邻(KNN)分类算法进行了比较。比较了性能测量和算法执行时间。此外,为了指导研究人员,还证明了在特殊条件下两种方法的相等性。

相似文献

1
Efficiency analysis of KNN and minimum distance-based classifiers in enzyme family prediction.基于 KNN 和最小距离的分类器在酶家族预测中的效率分析。
Comput Biol Chem. 2009 Dec;33(6):461-4. doi: 10.1016/j.compbiolchem.2009.09.002. Epub 2009 Sep 28.
2
Predicting enzyme family classes by hybridizing gene product composition and pseudo-amino acid composition.通过杂交基因产物组成和伪氨基酸组成预测酶家族类别。
J Theor Biol. 2005 May 7;234(1):145-9. doi: 10.1016/j.jtbi.2004.11.017. Epub 2005 Jan 26.
3
Accurate prediction of enzyme subfamily class using an adaptive fuzzy k-nearest neighbor method.使用自适应模糊k近邻法准确预测酶亚家族类别。
Biosystems. 2007 Sep-Oct;90(2):405-13. doi: 10.1016/j.biosystems.2006.10.004. Epub 2006 Oct 26.
4
Prediction of enzyme classes from 3D structure: a general model and examples of experimental-theoretic scoring of peptide mass fingerprints of Leishmania proteins.从三维结构预测酶类别:利什曼原虫蛋白质肽质量指纹图谱的通用模型及实验-理论评分示例
J Proteome Res. 2009 Sep;8(9):4372-82. doi: 10.1021/pr9003163.
5
Using Nearest Feature Line and Tunable Nearest Neighbor methods for prediction of protein subcellular locations.使用最近特征线和可调最近邻方法预测蛋白质亚细胞定位。
Comput Biol Chem. 2005 Oct;29(5):388-92. doi: 10.1016/j.compbiolchem.2005.08.002. Epub 2005 Oct 5.
6
Using ensemble classifier to identify membrane protein types.使用集成分类器识别膜蛋白类型。
Amino Acids. 2007;32(4):483-8. doi: 10.1007/s00726-006-0439-2. Epub 2006 Oct 12.
7
ECS: an automatic enzyme classifier based on functional domain composition.ECS:一种基于功能域组成的自动酶分类器。
Comput Biol Chem. 2007 Jun;31(3):226-32. doi: 10.1016/j.compbiolchem.2007.03.008. Epub 2007 Mar 30.
8
Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes.利用周氏两亲性伪氨基酸组成和支持向量机预测酶亚家族类别。
J Theor Biol. 2007 Oct 7;248(3):546-51. doi: 10.1016/j.jtbi.2007.06.001. Epub 2007 Jun 9.
9
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
10
Sequence and structural features of enzymes and their active sites by EC class.按酶的国际分类编号(EC编号)分类的酶及其活性位点的序列和结构特征
J Mol Biol. 2009 Mar 13;386(5):1423-36. doi: 10.1016/j.jmb.2008.11.057. Epub 2008 Dec 10.

引用本文的文献

1
Prediction of enzyme function using an interpretable optimized ensemble learning framework.使用可解释的优化集成学习框架预测酶的功能。
Chem Sci. 2025 Sep 1. doi: 10.1039/d5sc04513d.
2
Machine Learning-Guided Protein Engineering.机器学习引导的蛋白质工程
ACS Catal. 2023 Oct 13;13(21):13863-13895. doi: 10.1021/acscatal.3c02743. eCollection 2023 Nov 3.
3
Improving automatic GO annotation with semantic similarity.利用语义相似度提高 GO 自动注释的效果。
BMC Bioinformatics. 2022 Dec 12;23(Suppl 2):433. doi: 10.1186/s12859-022-04958-7.
4
Identification of Human Enzymes Using Amino Acid Composition and the Composition of -Spaced Amino Acid Pairs.利用氨基酸组成和 - 间隔氨基酸对组成鉴定人类酶。
Biomed Res Int. 2020 May 22;2020:9235920. doi: 10.1155/2020/9235920. eCollection 2020.
5
GrAPFI: predicting enzymatic function of proteins from domain similarity graphs.GrAPFI:基于结构域相似性图预测蛋白质的酶功能。
BMC Bioinformatics. 2020 Apr 29;21(1):168. doi: 10.1186/s12859-020-3460-7.
6
ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature.ECPred:一种基于 EC 命名法预测蛋白质序列酶功能的工具。
BMC Bioinformatics. 2018 Sep 21;19(1):334. doi: 10.1186/s12859-018-2368-y.
7
DEEPre: sequence-based enzyme EC number prediction by deep learning.DEEPre:基于深度学习的酶 EC 号序列预测。
Bioinformatics. 2018 Mar 1;34(5):760-769. doi: 10.1093/bioinformatics/btx680.
8
Application of a hierarchical enzyme classification method reveals the role of gut microbiome in human metabolism.一种分层酶分类方法的应用揭示了肠道微生物群在人体新陈代谢中的作用。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S16. doi: 10.1186/1471-2164-16-S7-S16. Epub 2015 Jun 11.
9
A survey of computational intelligence techniques in protein function prediction.蛋白质功能预测中的计算智能技术综述。
Int J Proteomics. 2014;2014:845479. doi: 10.1155/2014/845479. Epub 2014 Dec 11.
10
Computational Approaches for Automated Classification of Enzyme Sequences.酶序列自动分类的计算方法
J Proteomics Bioinform. 2011 Aug 23;4:147-152. doi: 10.4172/jpb.1000183.