采用内核自适应算法的进化算法训练的支持向量机用于蛋白质结构的大规模分类

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

作者信息

Arana-Daniel Nancy, Gallegos Alberto A, López-Franco Carlos, Alanís Alma Y, Morales Jacob, López-Franco Adriana

机构信息

Centro Universitario de Ciencias Exactas e Ingenieras, Universidad de Guadalajara, Guadalajara, Jalisco, México.

出版信息

Evol Bioinform Online. 2016 Dec 4;12:285-302. doi: 10.4137/EBO.S40912. eCollection 2016.

DOI:10.4137/EBO.S40912

PMID:27980384

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5140013/

Abstract

With the increasing power of computers, the amount of data that can be processed in small periods of time has grown exponentially, as has the importance of classifying large-scale data efficiently. Support vector machines have shown good results classifying large amounts of high-dimensional data, such as data generated by protein structure prediction, spam recognition, medical diagnosis, optical character recognition and text classification, etc. Most state of the art approaches for large-scale learning use traditional optimization methods, such as quadratic programming or gradient descent, which makes the use of evolutionary algorithms for training support vector machines an area to be explored. The present paper proposes an approach that is simple to implement based on evolutionary algorithms and Kernel-Adatron for solving large-scale classification problems, focusing on protein structure prediction. The functional properties of proteins depend upon their three-dimensional structures. Knowing the structures of proteins is crucial for biology and can lead to improvements in areas such as medicine, agriculture and biofuels.

摘要

随着计算机性能的不断提升，在短时间内能够处理的数据量呈指数级增长，高效分类大规模数据的重要性也与日俱增。支持向量机在对大量高维数据进行分类时表现出了良好的效果，比如蛋白质结构预测、垃圾邮件识别、医学诊断、光学字符识别以及文本分类等所生成的数据。大多数用于大规模学习的先进方法都采用传统优化方法，如二次规划或梯度下降，这使得利用进化算法来训练支持向量机成为一个有待探索的领域。本文提出了一种基于进化算法和核自适应神经元的简单易行的方法来解决大规模分类问题，重点关注蛋白质结构预测。蛋白质的功能特性取决于其三维结构。了解蛋白质的结构对生物学至关重要，并且能够在医学、农业和生物燃料等领域带来改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d2e/5140013/52dd5c6e4fec/ebo-12-2016-285f1.jpg

相似文献

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

Evol Bioinform Online. 2016 Dec 4;12:285-302. doi: 10.4137/EBO.S40912. eCollection 2016.

Stochastic subset selection for learning with kernel machines.

IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):616-26. doi: 10.1109/TSMCB.2011.2171680. Epub 2011 Oct 27.

Large-scale linear nonparallel support vector machine solver.

Neural Netw. 2014 Feb;50:166-74. doi: 10.1016/j.neunet.2013.11.014. Epub 2013 Nov 26.

[Statistical analysis of big data: an approach based on support vector machines for classification and regression problems].

Biofizika. 2014 May-Jun;59(3):446-57.

Kernel design for RNA classification using Support Vector Machines.

Int J Data Min Bioinform. 2006;1(1):57-76. doi: 10.1504/ijdmb.2006.009921.

Evolutionary-driven support vector machines for determining the degree of liver fibrosis in chronic hepatitis C.

Artif Intell Med. 2011 Jan;51(1):53-65. doi: 10.1016/j.artmed.2010.06.002. Epub 2010 Aug 2.

Linear regression-based efficient SVM learning for large-scale classification.

IEEE Trans Neural Netw Learn Syst. 2015 Oct;26(10):2357-69. doi: 10.1109/TNNLS.2014.2382123. Epub 2015 Jan 6.

A coordinate descent margin based-twin support vector machine for classification.

Neural Netw. 2012 Jan;25(1):114-21. doi: 10.1016/j.neunet.2011.08.003. Epub 2011 Aug 17.

Supervised learning with quantum-enhanced feature spaces.

Nature. 2019 Mar;567(7747):209-212. doi: 10.1038/s41586-019-0980-2. Epub 2019 Mar 13.

Binary classification SVM-based algorithms with interval-valued training data using triangular and Epanechnikov kernels.

Neural Netw. 2016 Aug;80:53-66. doi: 10.1016/j.neunet.2016.04.005. Epub 2016 Apr 27.

本文引用的文献

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Sci Rep. 2016 Jan 11;6:18962. doi: 10.1038/srep18962.

Protein structure prediction from sequence variation.

Nat Biotechnol. 2012 Nov;30(11):1072-80. doi: 10.1038/nbt.2419.

A survey of machine learning methods for secondary and supersecondary protein structure prediction.

Methods Mol Biol. 2013;932:87-106. doi: 10.1007/978-1-62703-065-6_6.

Environment specific substitution tables improve membrane protein alignment.

Bioinformatics. 2011 Jul 1;27(13):i15-23. doi: 10.1093/bioinformatics/btr230.

Large-scale prediction of protein-protein interactions from structures.

BMC Bioinformatics. 2010 Mar 18;11:144. doi: 10.1186/1471-2105-11-144.

Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.

Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.

Prediction of recursive convex hull class assignments for protein residues.

Bioinformatics. 2008 Apr 1;24(7):916-23. doi: 10.1093/bioinformatics/btn050. Epub 2008 Feb 5.

An introduction to kernel-based learning algorithms.

IEEE Trans Neural Netw. 2001;12(2):181-201. doi: 10.1109/72.914517.

Comparing individual means in the analysis of variance.

Biometrics. 1949 Jun;5(2):99-114.

Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.

Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

采用内核自适应算法的进化算法训练的支持向量机用于蛋白质结构的大规模分类

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

作者信息

Arana-Daniel Nancy, Gallegos Alberto A, López-Franco Carlos, Alanís Alma Y, Morales Jacob, López-Franco Adriana

机构信息

Centro Universitario de Ciencias Exactas e Ingenieras, Universidad de Guadalajara, Guadalajara, Jalisco, México.

出版信息

Evol Bioinform Online. 2016 Dec 4;12:285-302. doi: 10.4137/EBO.S40912. eCollection 2016.

DOI:10.4137/EBO.S40912

PMID:27980384

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5140013/

Abstract

摘要

采用内核自适应算法的进化算法训练的支持向量机用于蛋白质结构的大规模分类

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

采用内核自适应算法的进化算法训练的支持向量机用于蛋白质结构的大规模分类

Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

作者信息

机构信息

出版信息

相似文献

本文引用的文献