• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ESVM:用于微阵列数据自动特征选择与分类的进化支持向量机

ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data.

作者信息

Huang Hui-Ling, Chang Fang-Lin

机构信息

Department of Information Management, Jin Wen Institute of Technology, and Department of Anesthesiology, Tri-Service General Hospital, Taipei, Taiwan.

出版信息

Biosystems. 2007 Sep-Oct;90(2):516-28. doi: 10.1016/j.biosystems.2006.12.003. Epub 2006 Dec 16.

DOI:10.1016/j.biosystems.2006.12.003
PMID:17280775
Abstract

An optimal design of support vector machine (SVM)-based classifiers for prediction aims to optimize the combination of feature selection, parameter setting of SVM, and cross-validation methods. However, SVMs do not offer the mechanism of automatic internal relevant feature detection. The appropriate setting of their control parameters is often treated as another independent problem. This paper proposes an evolutionary approach to designing an SVM-based classifier (named ESVM) by simultaneous optimization of automatic feature selection and parameter tuning using an intelligent genetic algorithm, combined with k-fold cross-validation regarded as an estimator of generalization ability. To illustrate and evaluate the efficiency of ESVM, a typical application to microarray classification using 11 multi-class datasets is adopted. By considering model uncertainty, a frequency-based technique by voting on multiple sets of potentially informative features is used to identify the most effective subset of genes. It is shown that ESVM can obtain a high accuracy of 96.88% with a small number 10.0 of selected genes using 10-fold cross-validation for the 11 datasets averagely. The merits of ESVM are three-fold: (1) automatic feature selection and parameter setting embedded into ESVM can advance prediction abilities, compared to traditional SVMs; (2) ESVM can serve not only as an accurate classifier but also as an adaptive feature extractor; (3) ESVM is developed as an efficient tool so that various SVMs can be used conveniently as the core of ESVM for bioinformatics problems.

摘要

基于支持向量机(SVM)的预测分类器的优化设计旨在优化特征选择、SVM参数设置和交叉验证方法的组合。然而,支持向量机不具备自动进行内部相关特征检测的机制。其控制参数的适当设置通常被视为另一个独立的问题。本文提出了一种进化方法来设计基于支持向量机的分类器(名为ESVM),通过使用智能遗传算法同时优化自动特征选择和参数调整,并结合k折交叉验证作为泛化能力的估计器。为了说明和评估ESVM的效率,采用了一个使用11个多类数据集进行微阵列分类的典型应用。通过考虑模型的不确定性,使用一种基于频率的技术,对多组潜在信息特征进行投票,以识别最有效的基因子集。结果表明,对于这11个数据集,平均使用10折交叉验证时,ESVM使用仅10.0个选定基因就能获得96.88%的高精度。ESVM的优点有三个方面:(1)与传统支持向量机相比,ESVM中嵌入的自动特征选择和参数设置可以提高预测能力;(2)ESVM不仅可以作为一个准确的分类器,还可以作为一个自适应特征提取器;(3)ESVM被开发为一个高效的工具,因此各种支持向量机可以方便地用作ESVM的核心来解决生物信息学问题。

相似文献

1
ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data.ESVM:用于微阵列数据自动特征选择与分类的进化支持向量机
Biosystems. 2007 Sep-Oct;90(2):516-28. doi: 10.1016/j.biosystems.2006.12.003. Epub 2006 Dec 16.
2
ProLoc: prediction of protein subnuclear localization using SVM with automatic selection from physicochemical composition features.ProLoc:利用支持向量机并从物理化学组成特征中自动选择来预测蛋白质亚核定位。
Biosystems. 2007 Sep-Oct;90(2):573-81. doi: 10.1016/j.biosystems.2007.01.001. Epub 2007 Jan 4.
3
Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers.从微阵列数据中选择最少数量的相关基因以设计精确的组织分类器。
Biosystems. 2007 Jul-Aug;90(1):78-86. doi: 10.1016/j.biosystems.2006.07.002. Epub 2006 Jul 10.
4
An evolutionary approach for gene selection and classification of microarray data based on SVM error-bound theories.一种基于支持向量机误差界理论的基因选择及微阵列数据分类的进化方法。
Biosystems. 2010 Apr;100(1):39-46. doi: 10.1016/j.biosystems.2009.12.006. Epub 2010 Jan 4.
5
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。
Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.
6
Bias in error estimation when using cross-validation for model selection.在使用交叉验证进行模型选择时误差估计中的偏差。
BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.
7
Advances in metaheuristics for gene selection and classification of microarray data.元启发式算法在基因选择和微阵列数据分析分类中的应用进展。
Brief Bioinform. 2010 Jan;11(1):127-41. doi: 10.1093/bib/bbp035. Epub 2009 Sep 29.
8
Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates.基于类别优化基因和概率估计的支持向量机进行多类别癌症分类
J Theor Biol. 2009 Aug 7;259(3):533-40. doi: 10.1016/j.jtbi.2009.04.013. Epub 2009 May 3.
9
Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.用于微阵列数据分析的具有准确且紧凑模糊规则库的可解释基因表达分类器。
Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21.
10
An integrated scheme for feature selection and parameter setting in the support vector machine modeling and its application to the prediction of pharmacokinetic properties of drugs.支持向量机建模中特征选择与参数设置的集成方案及其在药物药代动力学性质预测中的应用
Artif Intell Med. 2009 Jun;46(2):155-63. doi: 10.1016/j.artmed.2008.07.001. Epub 2008 Aug 12.

引用本文的文献

1
Determination of biomarkers from microarray data using graph neural network and spectral clustering.基于图神经网络和谱聚类的基因表达谱数据中生物标志物的确定。
Sci Rep. 2021 Dec 13;11(1):23828. doi: 10.1038/s41598-021-03316-6.
2
A framework model using multifilter feature selection to enhance colon cancer classification.基于多滤波器特征选择的结肠癌分类增强框架模型。
PLoS One. 2021 Apr 16;16(4):e0249094. doi: 10.1371/journal.pone.0249094. eCollection 2021.
3
DQB: A novel dynamic quantitive classification model using artificial bee colony algorithm with application on gene expression profiles.
DQB:一种使用人工蜂群算法的新型动态定量分类模型及其在基因表达谱中的应用
Saudi J Biol Sci. 2018 Jul;25(5):932-946. doi: 10.1016/j.sjbs.2018.01.017. Epub 2018 Feb 9.
4
Co-ABC: Correlation artificial bee colony algorithm for biomarker gene discovery using gene expression profile.协同人工蜂群算法(Co-ABC):利用基因表达谱发现生物标志物基因的相关人工蜂群算法
Saudi J Biol Sci. 2018 Jul;25(5):895-903. doi: 10.1016/j.sjbs.2017.12.012. Epub 2018 Jan 3.
5
CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data.《癌症发现》:一种用于从高通量测序数据预测癌症生物标志物和癌症类型的综合流程。
Oncotarget. 2017 Dec 20;9(2):2565-2573. doi: 10.18632/oncotarget.23511. eCollection 2018 Jan 5.
6
A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data.一种基于模糊的独立成分子空间特征选择方法,用于微阵列数据的机器学习分类。
Genom Data. 2016 Feb 23;8:4-15. doi: 10.1016/j.gdata.2016.02.012. eCollection 2016 Jun.
7
mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.mRMR-ABC:一种利用微阵列基因表达谱进行癌症分类的混合基因选择算法。
Biomed Res Int. 2015;2015:604910. doi: 10.1155/2015/604910. Epub 2015 Apr 15.
8
Discovery of prognostic biomarkers for predicting lung cancer metastasis using microarray and survival data.利用微阵列和生存数据发现预测肺癌转移的预后生物标志物。
BMC Bioinformatics. 2015 Feb 21;16:54. doi: 10.1186/s12859-015-0463-x.
9
A hybrid BPSO-CGA approach for gene selection and classification of microarray data.一种用于基因选择和微阵列数据分类的混合BPSO-CGA方法。
J Comput Biol. 2012 Jan;19(1):68-82. doi: 10.1089/cmb.2010.0064. Epub 2011 Jan 6.
10
Automated classification of fMRI data employing trial-based imagery tasks.采用基于试验的成像任务对功能磁共振成像(fMRI)数据进行自动分类。
Med Image Anal. 2009 Jun;13(3):392-404. doi: 10.1016/j.media.2009.01.001. Epub 2009 Jan 16.